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Glossary 

C0 2 capture Any process whereby C0 2 is sustainably 
removed from the atmosphere for a long period. 
Dunite Common rock type consisting for more than 
90% of the mineral olivine. 

Enhanced weathering Any process whereby 
weathering reactions are sped up, like crushing the 
rock into fine particles and spreading these in suit¬ 
able climates. 

Laterite Iron-rich tropical soil, which is the insoluble 
residue left after the weathering of the rock. 


Mineral carbonation The reaction of C0 2 with min¬ 
erals (particularly Mg or Ca silicates), leading to the 
formation of solid and stable carbonates. This car¬ 
bonation is preceded by the transformation of C0 2 
gas to bicarbonate solutions. 

Olivine A silicate which is a mixed crystal of Mg 2 Si0 4 
and Fe 2 Si0 4 . It is the fastest weathering common 
silicate. 

Weathering The process whereby rocks are 
decomposed by reaction with water and acid (usu¬ 
ally carbonic acid). 

Definition of the Subject 

The aim of enhanced weathering is to capture C0 2 by 
the carbonation of silicates, or by dissolution of these 
silicates during which the greenhouse gas C0 2 is 
converted to bicarbonate in solution. Research in this 
field is still focused on increasing the rate of reaction, 
but the required additional technologies add consider¬ 
ably to the cost of the process. In this entry, the focus is 
on the optimization of the weathering conditions, by 
selecting the most reactive abundantly available min¬ 
erals, grinding them, and spreading the grains over 
land. Thereafter nature takes its course. Since its for¬ 
mulation in the late 1990s, more and more people 
realize that this simple and natural approach may well 
turn out to be one of the most promising and environ¬ 
mentally friendliest ways to counteract climate change 
and ocean acidification. 

Introduction 

C0 2 is a greenhouse gas. The rising concentration of 
C0 2 in the atmosphere by the burning of fossil fuels is 
considered by many to be the main cause of climate 
change. Most efforts on carbon sequestration so far 
have focused on the physical removal of C0 2 , by 
locking it up as a supercritical fluid in available spaces 
in the subsoil, like abandoned oil and gas fields 
or aquifers. Sequestration by mineral carbonation* 
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has long been neglected, as it was deemed to be too slow 
or too costly. The main arguments for a slow rate of 
reaction stem from an extrapolation of abiotic experi¬ 
ments in the laboratory. Observations from the “real 
world” only now begin to show that weathering rates 
are often 1-2 orders of magnitude faster than rates 
determined in the laboratory. Arguments based on 
the apparent high-cost aspect are due to the fact that 
almost all researchers assume that mineral carbonation 
must be coupled to a technology to speed up the 
reaction. This may involve thermal treatment of the 
mineral, a chemical or mechanical pretreatment or 
a technology based on subjecting the mineral to high 
pressures and temperatures in autoclaves. Most of these 
approaches are successful to some extent, but their high 
cost makes such mineral carbonation technologies 
unattractive [1], and some produce large amounts of 
C0 2 . If there is no objection to the reaction taking a few 
months or a few years to run to completion, then there 
is no need for the use of expensive equipment, which 
must meet minimum throughput rates for financial 
reasons. Given these conditions, chemical weathering 
in favorable environments becomes an attractive prop¬ 
osition. It can be applied on a world scale, it is a 
low-cost operation per ton of C0 2 captured, it is sus¬ 
tainable, it provides employment opportunities in 
developing countries, and unpleasant environmental 
surprises are unlikely, as the same process has already 
operated over the entire geological history. 

This entry, therefore, will focus on the enhance¬ 
ment of the process of chemical weathering, by which 
a mineral is converted to a bicarbonate solution. These 
bicarbonate solutions are transported by rivers to the 
oceans, where they will be captured as carbonate rocks, 
which are the ultimate sink for C0 2 . During the pre¬ 
cipitation of solid carbonates, half of the C0 2 is 
released again, but this process takes on average several 
hundreds to thousands of years, so the bicarbonate step 
is the relevant step for climate change mitigation. 

Materials 

Most materials to be used for carbonation are naturally 
occurring rocks or minerals, although residues from 
industrial processes are also considered. Industrial 
residues have the advantage of being readily available 
and sometimes need to be removed or treated anyhow. 


However, their total volume is too small to make more 
than a small dent in the C0 2 problem, but they may 
provide an easily accessible and cost-effective part of 
the solution. Some of the materials in this category are 
(coal) fuel ashes, oil shale ashes, metallurgical slags, 
including slags from the production of elementary 
phosphorus, red mud from the alumina industry, and 
wastes from the demolition of concrete buildings [2, 3] . 
It was shown that wastes from the demolition of 
concrete are rapidly carbonated in soils, and their 
carbonation compensates for decarbonation during 
manufacture. 

Among the naturally occurring materials for car¬ 
bonation, olivine and olivine rocks, called dunites, take 
a prominent place because olivine weathers fast and is 
abundantly available (see Fig. 1). Other candidate 
materials include basalts and basaltic tuffs, and possibly 
anorthosites. A mineral that is also frequently men¬ 
tioned is wollastonite (CaSi0 3 ). Wollastonite does 
react quickly with C0 2 and water, but wollastonite 
deposits are rare and small. It is evident that priority 
will be given to material that is already crushed or 
milled, as this saves mining and milling costs. Many 
ore deposits of chromite, nickel, magnesite, peridot 
(a semiprecious variety of olivine), or even diamond 
have olivine-rich host rocks. Billions of tons of such 
rocks can be found in crushed form as mine tailings. 
Later on, in the section on rate of weathering, some 
spectacular evidence on the rate of weathering of pow¬ 
dered olivine or serpentine (the hydration product of 
olivine) on mine dumps will be presented. 

Modes of Application 

a. In dedicated industrial installations 

b. In situ injection in suitable rock types 

c. Ex situ after crushing and spreading suitable rock 

types 

a. Industrial installations. These require an input of 
fairly pure C0 2 to reach a maximum efficiency. This 
means that they can be dependent on point sources of 
pure C0 2 , or on more dilute sources of C0 2 from 
which the C0 2 must first be captured and purified. 
Industrial treatments, involving thermal or chemical 
activations, or large autoclaves through which the 
mineral powders must pass as fast as possible have 
significant energy requirements. It is doubtful that 
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large-scale application of thermal or chemical treat¬ 
ment of olivine, for instance, can be carried out at the 
required scale of 25 billion tons of olivine annually, 
let alone in a cost-effective way. 

b. In situ injection in suitable rock types. The major 
distinction between this type of injection and the most 
commonly proposed CCS (Carbon Capture and 
Storage) is that injection in flood basalts or olivine 
rocks aims at converting the injected C0 2 gas into 
carbonate rock by reaction with the host rock. This 
reaction produces some heat, which may further 
speed up the reaction. In oil and gas fields, or in saline 
aquifers, the C0 2 remains in the reservoir as a gas (or 
more correctly as a supercritical fluid), with the inher¬ 
ent risk of leakage, or even explosive escape. The main 
conditions for a reservoir rock in which the C0 2 will be 
injected and converted to solid carbonates are reactiv¬ 
ity and, what can be called, “selective” permeability. 
Thick sections composed of many superimposed vol¬ 
canic flows with a horizontal extension of sometimes 
more than 1 million km 2 , so-called flood basalts, may 
be an option. A number of features of lava flows, 
including flows that are separated from the overlying 
flow by a rubble zone, which can have high permeabil¬ 
ity, often finding a high concentration of bubbles near 
the top of the flow where these bubbles contribute to 
the accessibility of the rock for rock/fluid interaction; 
and some lava flows vertical structure with more or less 
hexagonal columnar jointing that originates from 
shrinkage during cooling, which permits easy access 
for fluids throughout the basalt flow. The combination 
of these properties makes piles of basalt flows almost 
ideal targets for injection, although their reactivity is 
considerably lower than that of olivine-rich rock types. 
Flood basalts can occupy enormous volumes, in several 
cases in excess of 1 million km 3 . Well-known examples 
are the Deccan traps in India, the Siberian traps, the 
Karroo basalts, and the flood basalts of the Parana 
basin, each covering more than 1 million km 2 . The 
Columbia River flood basalts and the Deccan traps 
are presently under investigation on how they can be 
used for carbon capture [4]. Logistics of C0 2 transport 
and its associated cost, as well as the rate at which it can 
be injected, and the reactivity of the rock will probably 
limit their use as a C0 2 trap to locations close to 
voluminous point sources of C0 2 . They potentially 
constitute a safer storage than aquifers or abandoned 


oil and gas fields because after reaction with the rock, 
the C0 2 is no longer stored as a gas but as a solid 
carbonate. 

The situation with direct injection into olivine-rich 
rocks is more or less the reverse of injection into basalt 
piles. Permeability is often much smaller, as most 
dunites (dunite is the rock type that consists for more 
than 90% of olivine) are massive rocks. They may 
contain joints, but these are often sealed with a thin 
veneer of serpentine or magnesite (MgC0 3 ). On the 
other hand, their reactivity is considerably better. Spec¬ 
tacular examples (Fig. 2) of carbonation of olivine-rich 
rocks in Oman were presented by [5]. These authors 
propose that annually up to one billion tons of C0 2 can 
be injected into these rocks in Oman. Transport of such 
volumes of C0 2 from industries and coal-fired power 
plants in the industrialized world, as well as purifying it 
and bringing it to a remote site, poses, however, a huge 
logistical problem. Even if the logistical problems are 
not insurmountable in terms of cost, there remains the 
problem that the rocks must be intensely fracced for 
injection. Kelemen and Matter suggest that the consid¬ 
erable heat of reaction from the hydration [5] and 
carbonation [6] may help to crack the rocks further. 
Although from the reactivity point of view direct injec¬ 
tion of C0 2 + water into dunites is an attractive 
proposition, the likely high costs may limit its use to 
locations where a coal-fired power plant is situated at 
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Multiple generations of carbonate cement around dunite 
pieces in alluvial terraces in Oman [5] 
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short distance from a dunite massif like near Orhaneli/ 
Turkey [7], or where, like in Oman, the C0 2 produced 
by the installations on oil and gas fields can be injected 
directly into the dunites nearby. 

c. Ex situ after crushing and spreading suitable rock 
types. This option is better known as enhanced 
weathering [8]. This is probably the most promising 
solution for the sustainable capture of C0 2 on the scale 
of billions of tons of C0 2 annually. It is only for this 
option that a more complete description of the pro¬ 
posed process will be given, as well as some quantitative 
estimates of its costs and energy requirements. Further¬ 
more, the possible collateral benefits of the method will 
be considered. 

Weathering 

It is estimated that the Earth produces around 
0.5 billion tons of C0 2 each year, mainly from volcanic 
sources [9]. Other important sources of C0 2 are car¬ 
bonate rocks that decompose after having been 
transported to great depth and subjected to high tem¬ 
peratures in subduction zones [10], or limestones and 
dolomites in the contact aureole of intrusive rocks, or 
marbles or siliceous dolomites undergoing high-grade 
regional metamorphism. If there were no feedback 
mechanism or mechanisms by which this C0 2 is con¬ 
stantly removed from the atmosphere and sustainably 
stored in rocks, the Earth would now have an atmo¬ 
sphere similar to Venus, where the C0 2 pressure of the 
atmosphere is 80 bars, and the surface temperature 
of the planet is 450° C, due to the excessive green¬ 
house effect. The major feedback mechanism is the 
weathering of silicate minerals, in particular Ca and 
Mg silicates, which is only possible thanks to the pres¬ 
ence of liquid water on Earth, whereas it is absent on 
Venus. A smaller feedback mechanism is the storage of 
CO 2 as organic carbon (oil and natural gas, coal, and 
organic carbon dispersed in sediments). Chemical 
weathering can be described as the neutralization of 
an acid by rocks. H 2 C0 3 is the dominant acid during 
weathering. It is converted to bicarbonate in solution. 
The rock is partly dissolved. A residue of clay minerals 
can form as the solid end product. Weathering has been 
a life-saving process for our planet. Not only has the 
interaction of C0 2 and water with rocks saved us from 
a greenhouse problem as found on Venus, but it has 


provided us also with fertile soils, without which 
it would have been impossible for any evolved life 
forms to develop, and it has released the mineral nutri¬ 
ents from the rocks which are a basic requirement for 
plant life. 

Some typical weathering reactions are as follows, 
weathering of olivine: 

Mg 2 Si0 4 + 4C0 2 + 4H 2 0 2Mg 2+ + 4HC0 3 “ + H 4 Si0 4 

(1) 

and weathering of anorthite: 

CaSi 2 Al 2 0 8 + C0 2 + 2H 2 0 ->• CaC0 3 + Al 2 Si 2 0 5 (OH) 4 

( 2 ) 

The first reaction, the weathering of olivine, 
describes an intermediate step in the weathering pro¬ 
cess. After this dissolution step, the Mg-bicarbonate 
solution is transported to the sea, where ultimately it 
will precipitate in the form of carbonate sediments 
(limestones and dolomites). That process, however, 
takes on average hundreds, if not thousands of years, 
so the first step, the formation of Mg- or Ca- 
bicarbonate waters is the relevant step for C0 2 capture 
on a time scale of a few tens of years. The second 
reaction (Eq. 2) demonstrates the formation of a car¬ 
bonate and a clay mineral by the weathering of anhy¬ 
drous silicates. 

Many nongeologists think that the C0 2 that is 
dissolved in the oceans, present as C0 2 in the air, or 
contained in biomass plays the major role in the C0 2 
balance of the Earth. In fact, these reservoirs are only 
short-lived and minor transitory storage rooms. The 
major part of the geochemical C0 2 cycle is represented 
by emission of C0 2 from the Earth, its transformation 
by weathering into a bicarbonate solution, followed by 
its transport to the oceans and deposition as carbonate 
rocks. These carbonate sediments form the ultimate 
sink on a human time scale. Geologically speaking, 
even these solid carbonate sinks are not eternal, but 
are recycled as well, albeit with time scales of hundreds 
of millions of years. The relations can be demonstrated 
with the following table (Table 1) [11]. 

The process of chemical weathering has always pro¬ 
vided the main mechanism to keep the C0 2 levels of 
the atmosphere within reasonable bounds. Of course, 
there have been fluctuations in the course of geological 
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Carbon Dioxide Sequestration, Weathering Approaches 
to. Table 1 Distribution of carbon on Earth. Modified 
after [11] 



Amount of carbon 
(x10 15 kg) 

Relative 
amount (%) 

Limestone 

(CaC0 3 ) 

35,000 

46.6 

Dolomite 

25,000 

33.3 

Sedimentary 

carbon 

15,000 

20 

Recoverable 
fossil fuels 

4 

0.005 

Oceanic C0 2 

42 

0.056 

Atmospheric C0 2 

3 

0.004 

Biomass 

0.056 

0.0007 

Anthropogenic 

emission 

0.03/year 


Input from 

Earth's interior 

0.002/year 



history because the available surface area where reactive 
rocks were exposed to the atmosphere varied; during or 
shortly after major orogenic periods, or after large-scale 
volcanic activity, the area of fresh rocks available to 
weathering was larger than in periods where the land¬ 
scape was more mature, and where many easily 
weatherable rocks were covered by a thick weathering 
crust, which virtually stopped their interaction with 
C0 2 and water. At the same time, it is likely that the 
annual emissions of C0 2 from the Earth were also 
subject to large variations caused by the intensity of 
volcanism, related to the rate of plate spreading. 

To a certain extent , such variations are counteracted 
by the C0 2 concentration of the atmosphere itself which 
acts as its own negative feedback. The higher the C0 2 
pressure , the more acidic the water, and the faster the 
weathering which removes that same C0 2 from the 
atmosphere. 

There is geological evidence that factors like moun¬ 
tain building or massive volcanism, through their 
effects on rates of weathering and emission of C0 2 , 
have influenced the C0 2 pressures of the atmosphere 
in the past and may have led to periods of glaciation 
[12-14]. Since the industrial revolution, atmospheric 


C0 2 levels have risen rapidly (from 260 to 392 ppm), 
largely due to mankind burning in a few 100 years the 
fossil fuels that took hundreds of millions of years to 
form. If weathering is going to be used as a tool to 
counteract this rise in C0 2 , caused by mankind annu¬ 
ally emitting more than ten times as much C0 2 than 
the Earth normally emits, then the weathering process 
must also be made more than ten times more effective 
than normal to reach a new balance. The process as 
discussed here is to select widely available rock types 
that weather easily, mine them and increase their reac¬ 
tive surface area by crushing them, and spread the 
crushed rocks over land or along beaches in areas 
with the most suitable climate for weathering. 

In subsequent sections, only olivine and olivine 
rocks will be discussed. Other rock types can also be 
used, but they are less effective or less available, 
although locally, there may be good reasons to use 
other rock types, such as basalts, anorthosites, or neph- 
eline syenites. In the following weathering reactions, 
the formula of olivine will be simplified to Mg 2 Si0 4 , 
although olivine normally is a mixed crystal of 
Mg 2 Si0 4 and Fe 2 Si0 4 , with the Mg-endmember usu¬ 
ally dominant. One finds a number of ways in which 
the weathering reaction of olivine is described. As this 
affects the amount of C0 2 that can be captured for 
a given amount of olivine, these different weathering 
reactions will be briefly discussed, and the major reac¬ 
tion that takes place in nature will be identified. 
A common way is the reaction in which during 
weathering two new solid minerals are formed, namely, 
magnesite (MgC0 3 ) and serpentine (Mg 3 Si 2 0 5 (0H) 4 ). 

Weathering Reactions of Olivine 

2Mg 2 Si0 4 + C0 2 + 2H 2 0 MgC0 3 +Mg 3 Si 2 0 5 (0H) 4 

(3) 

In this reaction, 2 olivine moles capture 1 C0 2 
mole. 

A second reaction is 

Mg 2 Si0 4 + 2C0 2 + xH 2 0 —> 2MgC0 3 + Si0 2 xH 2 0 

(4) 

In this reaction, 1 olivine mole captures 2 C0 2 
moles. 
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The reaction that occurs normally in places with 
sufficient rainfall 

Alg2Si04 T 4 CO 2 T 4 H 2 O 

Mg 2+ + 4HC0 3 “ + H 4 Si0 4 '' J> 

In seawater, it is convenient to split the reaction into 

M!g2Si04 T 4 H 2 O —> 2Mg 2 + T 40H H4Si04 

(6a) 

followed by 

40H - + 4C0 2 -> 4HC0 3 “ (6b) 

In reactions (5) and (6a and 6b), each olivine mole 
captures 4 moles of C0 2 . Evidence will be presented 
that reaction (5) is the common weathering reaction. It 
has been mentioned already before that the resulting 
Mg-bicarbonate waters will ultimately form carbonate 
rocks in the oceans. During the precipitation of those 
carbonates, half of the captured C0 2 returns to the 
atmosphere, but as this process may take thousands 
of years, we will neglect it as far as it affects climate 
change in the short run. 


In order to find out what is the normal reaction 
mechanism for olivine weathering, a suite of springwa- 
ters issuing from olivine rocks in Turkey were collected 
[15]. One can distinguish the following steps during 
the formation of such springwaters: 

1. Rainwater falls on the ground. 

2. It infiltrates the soil. The C0 2 concentration in soil 
atmospheres is usually around a 100 times larger 
than in the open air, because litter in the soil decays 
and soil fauna respires, both contributing to a high 
concentration of C0 2 in the soil [16, 17]. 

3. After equilibration with this C0 2 -rich soil atmo¬ 
sphere, the water infiltrates the underlying rock and 
reacts with it (the weathering step). 

4. Finally, the water emerges again as a spring at some 
lower point. 

Olivine-rich rocks contain generally, next to the 
dominant mineral olivine, some other silicates, includ¬ 
ing calcium silicates. From Fig. 3, it is evident that there 
is a close relation between the (Mg + Ca) content of the 
water, expressed in milliequivalents, and the amount of 
C0 2 that has been converted to bicarbonate. This close 
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Relationship between calcium + magnesium extracted from the rock and amount of C0 2 captured as bicarbonate in 
springwaters issuing from olivine rocks in Turkey 
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relation has been confirmed in a number of cases all 
over the world, and it can be concluded that the reac¬ 
tion leading to bicarbonate in the solution is the major 
pathway in which C0 2 is removed from the air, taken 
up as bicarbonate in the water and transported to 
the oceans. 

Even when the problem is considered on the scale of 
the whole Earth, the picture remains essentially the 
same. All rivers together transport annually 3.6 x 
10 16 kg of water to the oceans. The weighted average 
of their concentration of Ca is 11 ppm, of Mg 2.6 ppm, 
and of HC0 3 42.4 ppm. Expressed again in 
milliequivalents, this means that Ca + Mg make up 
0.77 meq and bicarbonate 0.7 meq, so, despite all 
complications, there is still a satisfactory agreement. If 
we calculate from these figures the amount of C0 2 that 
is annually brought to the sea by rivers, this comes out 
as 1.1 billion tons of C0 2 , the same order of magnitude 
as the annual emission of C0 2 by the Earth, which is 
obviously larger, as emissions of C0 2 from the seafloor 
are only partly taken into account. Some of this C0 2 
returns to land, takes part in weathering reactions, and 
returns as bicarbonate to the oceans. 


Enhanced Weathering: What Does It Involve? 

Having established that the principal weathering reac¬ 
tion is reaction (5), the formation of magnesium bicar¬ 
bonate solutions, it can now be calculated from the 
stoichiometry of this reaction how much olivine is 
required each year to sequester all the emitted C0 2 by 
enhanced weathering, under the assumption that 
energy savings, switching to green energy and change 
of lifestyle, will not lead fast enough to a lowering of the 
world’s C0 2 emissions, and other technologies for C0 2 
capture will remain negligible. In this scenario, the 
required volume turns out to be approximately 7 km 3 
of olivine rock. This is, of course, large because it must 
match the carbon contained in all the oil, gas, and coal 
that is burnt. It is, however, within the range of modern 
large-scale mining. This volume of olivine is equivalent 
to 10 km 3 of oil. It is worthwhile to come to grips with 
the scale of this mitigation effort (Fig. 4). The largest 
mine in the world, the copper mine at Bingham, Utah, 
USA, has an excavated volume of 25 km 3 . Olivine 
mining at this scale, however, means that olivine 
moves up from a modest commodity to third place in 
the mining industry, after construction materials and 
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It is a sobering thought that these huge bands of listwanite (completely carbonated dunites consisting of 
magnesite + quartz) in Oman, containing 500 million tons of C0 2/ are equivalent to only 1 week of the world's 
anthropogenic C0 2 emissions [4] 
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coal. One should, obviously, not open only one single, 
huge olivine mine but spread the mining operations 
over 30-50 open-pit olivine mines. In order to profit 
from the most suitable climatic conditions for 
weathering, these mines should be strategically spread 
along the tropical zone. A larger number of olivine 
mines will be able to service wider areas without requir¬ 
ing large transport. Each of the selected sites for olivine 
mining must produce in the order of 500 million tons 
of olivine each year to reach the required 7 km 3 per 
year. It may well turn out that it is more favorable both 
economically and environmentally to increase the 
number of olivine mines and decrease their size. At 
the same time, this will spread the economical benefits 
of olivine mining over a larger number of countries and 
further limit the required transport distances. This 
volume of olivine rock must be crushed and milled to 
grains of around 100 pm in diameter. If 7 km 3 is spread 
over an area of 10 million km 2 , it will occupy a layer of 
0.7-mm thickness. In the section on rate of weathering, 
it will be shown that grains of olivine of 100 pm will 
weather in approximately 5 years in tropical soils. It 
will, therefore, be cheaper to spread a layer of 3.5-mm 
thickness each year over an area of 2 million km 2 , shift 
to the next area in the following year, and come back to 
the first after 5 years. 

In a few instances outside the tropical zone, condi¬ 
tions may be so favorable that large olivine exploita¬ 
tions can be located there. For example, a possible site 
for an olivine mine outside the tropical zone might be 
Oman, which has the largest ophiolite zone in the 
world and a long coastline along the Indian Ocean. 
Deeply weathered sections of dunite, dating from ear¬ 
lier more humid periods, can be found, but in its 
present desert climate, the olivine does not weather, 
but forms huge talus deposits on the slopes of the 
mountains, at a short distance from the Indian 
Ocean. These scree deposits have no overburden 
which must first be removed, like in most tropical 
dunite deposits. In Oman, the loose pieces, after some 
additional crushing, can be loaded into ships that have 
direct access to the Indian Ocean. 

Rate of Weathering of Olivine 

Much of the debate on the potential of enhanced 
weathering using olivine grains to counteract climate 


change centers on the problem of the rate of 
weathering. The rate of weathering is enhanced in hot 
humid conditions, hence why this idea is focused on 
the application of crushed olivine in the wet tropical 
regions. 

There are a number of reasons to focus on the wet 
tropics for large-scale applications: 

• Weathering is faster under hot, humid, climatic 

conditions. 

• Tropical soils are usually very poor and can benefit 

from the addition of mineral nutrients. 

This does not mean that the olivine option is useless 
in temperate climates, it is just that the rate of 
weathering will be slower. 

In abiotic laboratory experiments, it was found that 
the surface of olivine grains retreats at a few tenths 
of a micron per year [18]. This is described by the 
shrinking-sphere concept. Such low rates would make 
it difficult to use enhanced weathering to mitigate the 
greenhouse effect. Fortunately, there is observational 
evidence on rates of weathering of olivine in the real 
world (see below), which shows that the rates are more 
than tenfold, and probably 100-fold larger, than those 
found in the laboratory. Qualitative information on 
fast rates of weathering is obtained from volcanic ter¬ 
rains with rocks containing olivine. When volcanism 
started in the Eifel/Germany, synchronous Rhine sedi¬ 
ments downstream in the Netherlands immediately 
started to contain a wealth of volcanic minerals, but 
no olivine , despite the fact that these volcanic rocks 
contain plenty of that mineral. Contrary to the other 
minerals of volcanic origin, olivine has not survived the 
short trip from Bonn to the Dutch border. Similar 
observations are reported from many other volcanic 
terrains in the world. Although suggestive of fast 
weathering, this evidence is difficult to quantify. 

In contrast, the rate of weathering of dunite massifs 
in the tropical zone can be quantified, or at least 
a minimum rate of weathering can be firmly 
established. The first example is the dunite massif of 
Conakry/Guinea. This dunite occupies the entire pen¬ 
insula on which Conakry, the capital of Guinea, is 
situated. It has an approximate length of 50 km and 
an average width of 5 km. Over its entire surface, it is 
covered by a thick lateritic weathering crust, which is 
very clearly visible as a purplish red area on satellite 
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The dunite massif of Conakry, Guinea. The length of the dunite massif (in purplish red) is approximately 50 km 


pictures (see Fig. 5). This lateritic crust, which is the 
iron-rich insoluble red residue of the dunite after deep 
tropical weathering, contains virtually no silica, mag¬ 
nesium or calcium oxides which were completely 
leached out during the weathering process [19]. These 
components make up around 90% of the original 
dunite. This means that 1 m of laterite is equivalent 
to 10 m of dunite, or even more if the remaining 
components of the laterite were not completely immo¬ 
bile but have also been leached to some extent. The 
same author presents evidence that iron has in fact 
been fairly mobile and was partially leached out as 
well, which means that 1 m of laterite is equivalent to 
more than 10 m of dunite. The weathering crust has 
a thickness between 30 and 100 m. The age of the 
dunite (that is to say the time at which this dunite 
intrusion formed) has been determined as 195 million 
years. From these data, it is simple to calculate the 
minimum rate of weathering as follows: 50 m of laterite 
is equivalent to 500 m of dunite, 500 m (= 500 million 
microns) divided by 195 million years is 2.6 pm/year. 
This is already ten times faster than deduced from 
laboratory experiments, but the real rate of weathering 
must have been considerably faster. The rock is an 
intrusion. That means it was emplaced between rocks 
at some depth and covered by other rocks, which had to 


be removed first by erosion before the dunite became 
exposed and could start its weathering process. If the 
dunite intrusion has taken place at 2 km depth, it 
would take 100 million years before the dunite massif 
was entirely laid bare by erosion at an estimated erosion 
rate of the order of 1-2 cm/1,000 years [20]. This is the 
average erosion rate for all continents. This correction 
alone more than doubles the calculated rate of 
weathering. That is not the only positive correction 
that must be made. In more recent times, the 
weathering process, under such a thick weathering 
crust, has virtually come to a standstill, as the thick 
laterite crust effectively shields the underlying rock 
from further interaction with the atmosphere. This 
shortens again the time span over which weathering 
was active, and thereby increases the rate of weathering. 

A further positive correction concerns the differ¬ 
ence between weathering of a solid rock as opposed to 
loose grains. A rock is attacked by weathering from 
above along a two-dimensional front, whereas loose 
olivine grains in soil are attacked from all sides. It 
seems certain that olivine grains in tropical soils dis¬ 
solve at least at a rate of 10 pm/year, but most likely 
even faster. Even when their surface retreats only by 
10 pm/year, a grain of 100 pm will disappear in 5 years. 
A similar calculation can be made for the dunite body 
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at Jacupiranga, Brazil [21]. Here, the rock has an age of 
130 million years, and it is covered by a weathering 
crust of >40 m (this is where the drill hole stopped, but 
at 40 m, the drill was still in lateritic weathering crust). 
The minimum rate of weathering turns out to be 
>3.1 pm/year, but the same positive corrections have 
to be applied as in the Conakry case. 

From a global balance of weathering and erosion, 
similar minimum rates of weathering emerge. The 
average rate of erosion of the continents is 1-2 cm in 
a 1,000 years [20] . As olivine grains from the interior of 
the continents do not make it to the oceans, this means 
that olivine rocks dissolve (= weather) at least at the 
same speed, which is 10-20 pm/year. The most dra¬ 
matic evidence for fast weathering of crushed magne¬ 
sium silicate rocks comes from observations of 
weathering rates of mine dumps of such rocks [22]. 
By measuring the amount of a suite of newly formed 
Mg carbonates, it was shown that the mine tailings of 
two abandoned asbestos mines in British Columbia 
weather extremely fast. In this case, it does not involve 
fresh olivine, but its hydration product serpentine 
(Mg 3 Si 2 0 5 ( 0 H) 4 ) that weathers and produces carbon¬ 
ates. This carbonation proceeds as follows: 

Mg 3 Si 2 0 5 (OH) 4 + 3C0 2 + 2H 2 0 -► 3MgC0 3 + 2H 4 Si0 4 

(7) 

At low temperatures, magnesite seldom forms, but 
in its place, hydrated magnesium carbonates, like 
nesquehonite Mg(HC0 3 )(0H)-2H 2 0, are found 
instead (Fig. 6). 

In order to make sure that these carbonates have 
indeed newly formed, 14 C analyses were performed on 
these carbonates which gave an age of about 0, showing 
that the carbon in these minerals really represents the 
sequestration of present-day atmospheric carbon [23]. 
In one of the cases, the mine dump, occupying a surface 
area of 0.5 km 2 , had captured 82,000 t of C0 2 between 
1978 and 2004, more than 50 times the maximum ever 
recorded for natural weathering under the most favor¬ 
able conditions. The real rate of weathering is even 
higher because the authors have only taken the solid 
products into account, whereas the waters that perco¬ 
late through the mine dumps carry an additional load 
of dissolved weathering products. These waters become 
quite alkaline, and their high silica content leads to 
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Efflorescence of nesquehonite on the serpentinite tailing 
heap at Clinton Creek (Photograph by S.A. Wilson) 

small diatom blooms in a pool at the foot of the tailings 
dump and in at least one of the mine pits [23-25]. 

One may wonder why there is such a large discrep¬ 
ancy between laboratory experiments, showing low 
rates of weathering, and the real world, where 
weathering rates are 100 times larger. The answer is 
relatively simple. Higher plants live in symbiosis with 
mycorrhizal fungi in and around their root system. 
These fungi secrete low molecular organic acids like 
acetic acid, malic acid and oxalic acid that rapidly 
attack mineral grains in the soil [26]. This liberates 
mineral nutrients that are subsequently taken up by 
the higher plants. In turn, the higher plants “reward” 
the fungi by providing them sugars. Lichens act in 
a similar way by secreting oxalic acid that “eats” the 
underlying rock [27]. In the laboratory, mycorrhizal 
fungi and lichens are absent, and this is the reason why 
the abiotic reaction rates that were found in the labo¬ 
ratory are much lower than weathering rates in nature. 

This all refers to weathering rates of olivine on land. 
Much less information is available for olivine on 
beaches, in the surf zone, shallow seas, and tidal 
flats. To understand what happens to olivine upon 
weathering we must distinguish between the chemical 
reaction of olivine with seawater and mechanical 
impacts during grain transport. To measure quantita¬ 
tively the chemical weathering rate of olivine in seawa¬ 
ter, one could use a so-called olivine reactor (see Fig. 7), 
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The olivine reactor (artist's impression) 


a concrete box 50 m long and 4 m wide, constructed 
perpendicular to the coast and filled with olivine sand. 
To obtain even more information from this experi¬ 
ment, the reactor can be partitioned lengthwise, and 
the two compartments can be filled with olivine sand of 
different grain sizes. On its coastal side, there is a low 
inlet, permitting seawater to enter the box during high 
tide. The seawater will drain from the other side toward 
the sea during ebb, after having passed through 50 m of 
olivine sand. By analyzing the water before and after it 
has passed the olivine reactor, one can calculate how 
much olivine reacts during each tidal cycle, how much 
C0 2 it captures, and if and when the reaction starts to 
be hampered by reaction products attached to the 
grains. Once the C0 2 that is transformed into bicar¬ 
bonate arrives in the seawater, it will take time before it 
is transformed into carbonates in the form of shells and 
corals, but no figures are available for this final uptake. 
Upon the formation of carbonate, half of the captured 
C0 2 is liberated again, but that may take centuries by 
when C0 2 -neutral energy production may have been 
well established. It is reported that olivine dissolves 
somewhat faster in saline water than in fresh water [28] . 


The mechanical action, the grinding down of oliv¬ 
ine grains, by waves and currents largely determines the 
rate of weathering of olivine on beaches and in shallow 
seas with strong bottom currents. There is a paper in 
which the rate of weathering of olivine grains on 
beaches is calculated [29], but that is based on theoret¬ 
ical modeling and overlooks the mechanical conse¬ 
quences of the surf, where grains are wearing down by 
the constant rubbing and bumping against each other. 
Olivine beaches are practically restricted to places 
where cliffs of olivine-rich rocks overlook the beach. 
Like on any other beach sand, the olivine grains are 
rounded, which means that they have lost little slivers 
of olivine due to bumping and scraping against each 
other. This has the advantage that one does not need to 
mill the olivine at high cost to very small grain sizes 
because the sea takes the place of the ball mill. These 
little slivers will quickly disappear by chemical 
weathering. In a recent experiment, this surf action 
was reproduced [30]. Grains of olivine were rotated 
in conical flasks. Within 24 h, the crushed olivine grains 
that were originally angular, with a rough surface, had 
transformed into rounded and polished grains (Fig. 8). 
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Carbon Dioxide Sequestration, Weathering Approaches to. Figure 8 

Simulation of surf action. Left : Olivine grit (2-5 mm) at start. Right : Olivine grit, rounded and polished after 3 days shaking 


The clear water at the start had become an opaque 
white suspension of very tiny olivine slivers, half of 
which had a grain size of less than 5 pm. The system 
reacts fast, the pH shoots up to 9.4, and a clay-type 
magnesium mineral is newly formed. The surf is clearly 
the world’s largest, most efficient and cheapest ball mill. 
The experiments also showed that a mixture of differ¬ 
ent grain sizes of olivine wears down more quickly than 
single grain sizes. There are strong currents along the 
bottom of the Southern North Sea, the Channel, and 
the Irish Sea. If one would cover this sea bottom with 
a layer of 1 cm of crushed olivine, this would compen¬ 
sate 5% of the global C0 2 emissions, more than the 
combined C0 2 emissions of the adjacent countries, 
England, France, The Netherlands, Belgium, and 
Ireland, which together produce about 4% of the 
world’s emission. 

For tidal flats, a very different picture emerges. 
Tidal flats are inhabited by huge populations of lug- 
worms (Arenicola maritima). The upper few centime¬ 
ters of the mud pass on average three times each year 
through the guts of the lugworm. In its digestive sys¬ 
tem, weathering is 700-1,000 times faster than outside 
[31]. In experiments in which lugworms were fed with 
small fragments of basaltic rock, these pieces came out 
as small heaplets of clay [32]. As basalt weathers much 
slower than dunite, it can be expected that spreading of 
olivine grains over tidal flats is a very effective applica¬ 
tion of enhanced weathering/sequestration of C0 2 . 


Costs 

Except in cases where the required olivine-rich rock is 
already available in crushed form on mine dumps, the 
application of enhanced weathering implies mining 
and milling of solid dunite rock from large open-pit 
mines. In 2002, two Swedish mining engineers calcu¬ 
lated the cost of mining and milling of bulk rocks in 
large open-pit mines [33]. This turns out to be 
6 Euros/t. After passing the primary and secondary 
crushers at the mine site, the granulated material 
must then be transported from the mine to the points 
of use. If we restrict the transport distance to a maxi¬ 
mum of 300 km around the mine, this adds on average 
6 Euros/t for transport. From the stoichiometry of 
reaction (5), one can calculate that 1 t of olivine will 
sequester 1.25 t of C0 2 . This brings the price for the 
sequestration of C0 2 to slightly under 10 Euros/t, 
making it by far the cheapest method to sustainably 
capture C0 2 . In comparison, CCS, the most commonly 
discussed technology for the sequestration of C0 2 , 
costs between 70 and 100 Euros/t of C0 2 [34]. Even 
though the costs are modest, compared to other options 
that are considered, the cost to sequester all man-made 
C0 2 emissions by enhanced weathering of olivine still 
comes to around 250 billion US$/year worldwide. 

The energy costs, expressed in terms of the C0 2 
expenditure for mining, milling, and transport, have 
been calculated in an LCA (life cycle analysis) [35]. The 
total C0 2 requirement for the operation as a whole 
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turns out to be 4% of the C0 2 that will later be cap¬ 
tured by the extracted olivine. In order to bring the 
olivine grains from the mine to the point of use, several 
transport systems can be considered, including road, 
railroad, river, or sea. Making maximum use of the 
opportunities provided by local conditions will keep 
costs lower. 

Mitigating Environmental and Social Costs 

(This section is mainly based on a personal communica¬ 
tion by Mrs. Gwendolyn Wellmann, community develop¬ 
ment specialist, Port Elizabeth, S.A.) 

There are, however, also environmental costs (and 
benefits) to the olivine option. Every mining operation 
has a negative impact on the natural and social envi¬ 
ronment, even if the mining is done for the purpose of 
solving a worldwide environmental problem. Fortu¬ 
nately, there are ways to overcome the environmental 
damage, or at least diminish it. All dunite bodies in the 
tropical zone are covered by a thick laterite (laterite is 
the typical red soil of tropical countries, an iron-rich 
residue that is left after the weathering of rocks in 
tropical climates). Dunites contain on average 
0.2-0.3% nickel, which is not very soluble during 
weathering. As most of the major components (Si0 2 , 
MgO, CaO) are leached out during weathering, the Ni 
content of the residue rises accordingly and may reach 
2% to sometimes over 3% Ni, making these nickel 
laterites a rich nickel ore [36] . In a number of countries 
(e.g., Brazil, Cuba, Philippines, Indonesia, New 
Caledonia, Madagascar, Malawi), these Ni laterites are 
mined, or mining options are being studied. By con¬ 
tinuing the mining below the laterite crust and also 
mining the underlying dunite, one can avoid the 
impact of clearing a new mining site; the infrastructure 
for mining is already in place at such locations, and 
there is a local population that depends on mining for 
their living. The removal of the overburden consisting 
of Ni-laterite ore is an additional financial bonus for 
the mining of dunite below. This way, one can save the 
environment, help the people, and at the same time 
make olivine mining cheaper. 

Even though mining brings with it many positives 
to an area, including employment opportunities, infra¬ 
structure development, and local economic develop¬ 
ment; and even if it is an olivine mine that will benefit 


the earth, mining also brings with it many negatives, 
such as an influx of job seekers, the increase of HIV/ 
AIDS, violence, and discontent. In terms of mitigating 
the negative impact on the social fabric in a mining 
area, what is most important is the attitude of the 
mining company. The majority of mining companies 
adhere to the International Council on Mining and 
Metals’ Sustainable Development Principles [37], with 
its underlying message that the overall success of any 
mining venture in a populated area is to consult, con¬ 
sult, consult with the local people. This requires the 
company from the onset, which might mean as early as 
the time of exploration activities, to meet with the local 
people; gain their views; understand their concerns; 
seek first to understand what makes the local commu¬ 
nity tick, before expecting them to understand your 
needs; and to devise long-term mitigating actionable 
plans for the rehabilitation of the area. 

Should an olivine mine be established in a poor, 
developing country, it is vital that as the mining com¬ 
pany starts to invest in the community, whether 
through infrastructure, local economic development, 
or alternative livelihood projects, it also invests in the 
development and promotion of a functional local gov¬ 
ernment. This is important because the management 
and maintenance of these projects will ultimately be the 
responsibility of the local government. A weak, nearly 
nonexistent local government will lack the correct pol¬ 
icies, procedures, structures, and systems to ensure 
sustainability of any project invested in by the local 
mine. The result of this is that there is a tendency for 
both the affected communities and the government to 
rely on the mine to assume local government roles. This 
can be problematic as implementing local governance 
is not the mine’s expertise or responsibility, and as 
has been demonstrated numerous times in many 
countries - when the community feels the lack of 
service delivery - they will express their discontent 
by attacking the mine. One of the ways of investing in 
local governance is to expand the skills base of local 
government officials through training. The skills that 
are most frequently lacking in these areas are those 
needed for planning, budgeting, project implementa¬ 
tion, and public consultation. Once olivine mining is 
a certified means of obtaining carbon credits, it is 
recommended that it is supervised by a special inter¬ 
national body. 



Carbon Dioxide Sequestration, Weathering Approaches to 


C 


1923 


Applications of the Olivine Option 

It should be emphasized that the presence of liquid 
water is essential. This rules out all countries with 
a desert climate except along their coasts, where olivine 
grains will disintegrate in the surf and be spread by 
wind and currents. The rate of reaction is increased by 
a high temperature. There are a number of arguments 
to focus on the wet tropics for large-scale applications: 

• Weathering is fastest under these climatic 
conditions. 

• Tropical soils are usually very poor, and can benefit 
from the addition of mineral nutrients. 

For all mines, not just the ones in the wet tropics, 
the following arguments hold: 

• Large mines profit from the economy of scale. 

• Olivine rocks sometimes contain marginal chro¬ 
mite, nickel, or magnesite deposits, and the related 
rock kimberlite is the normal host rock for dia¬ 
monds. If the host rock itself is also put to good 
use, and brings a profit in the form of carbon 
credits, instead of being dumped as a mining 
waste, this may make such marginal deposits 
economical. 

• Once the olivine option is accepted as a legitimate 
means of carbon capture, the olivine mines can sell 
carbon credits. 

There may be additional novel or niche applications 
suitable to certain regions, such as using olivine sand 
instead of quartz sand to strew into Astroturf playing 
fields, using olivine gravel on tennis fields, spreading 
olivine grains on lawns to reduce the growth of moss, 
using olivine as the top layer on footpaths or bicycle 
paths, adding a layer of olivine as roof covering, using 
olivine instead of quartz in sandblasting, using olivine 
grit on icy roads, adding olivine powder to digesters to 
improve the quality of the biogas by transferring the 
C0 2 part of the biogas to bicarbonate in the liquid, and 
using olivine sand in sound barriers along main roads. 
These are all fairly minor applications, although taken 
together they are not negligible. 

A larger application that can be used in the tropics 
as well as in temperate climates is to cover beaches that 
are subject to erosion or tidal flats by olivine sand. 
Olivine is considerably heavier than quartz and will 


not be eroded and transported away from the beach 
as easily as quartz sands. The tides will alternately wet 
the beach and drain the pore water. During such a tidal 
cycle, the olivine will react with the seawater, thus 
adding alkalinity to the sea, which makes it possible 
to store more C0 2 as bicarbonate in the seawater with¬ 
out acidifying it. The olivine grains bump into each 
other or scratch each other when they are moved by the 
surf. They become rounded, and the tiny slivers that 
have come off the grains will very rapidly weather [38] . 
If marine constructions (dams or artificial reefs) are 
built with olivine blocks and sand, this will conceivably 
lead to well-cemented structures after some time. Sea¬ 
water is saturated with CaC0 3 . If the water between the 
olivine pieces is only slowly replaced, it can react with 
the olivine for some time. This raises the pH of the 
interstitial seawater, causing a shift in the carbonate 
equilibria. This leads to a supersaturation of calcite 
which precipitates as a cement between the olivine. 
This process is similar to the formation of beach 
rocks. Filling the solution holes left by underground 
solution mining of salt with olivine powder serves two, 
possibly three purposes. After the cavity is filled with 
olivine, it is no longer necessary to keep the brine-filled 
hole permanently pressurized to prevent it from col¬ 
lapsing because the olivine will support the cavity. By 
injecting C0 2 into the mixture of olivine and brine, one 
captures C0 2 as bicarbonate. The heat of reaction of 
carbonatation + hydratation may heat the water suffi¬ 
ciently to use it for warm thermal medicinal baths, or 
swimming pools. 

There are possible ideas for large-scale or econom¬ 
ical applications of olivine. One such novel idea is the 
application of olivine in biodigesters. Biodigesters are 
installations where agricultural waste, urban sewage 
sludge, and organic waste stream from industry are 
anaerobically digested; during digestion, the organic 
material is turned into biogas. The biogas that is pro¬ 
duced consists for around 2/3 of methane and 1/3 of 
C0 2 . By adding olivine powder to the digester, part of 
the C0 2 gas will be converted to bicarbonate in solu¬ 
tion, so the quality of the biogas will improve. As 
yet unpublished experiments at the Wageningen Agri¬ 
cultural University have shown that this is indeed the 
case, but there were two unexpected additional positive 
results. Most of the smell disappears. This can be 
explained by the fact that the FeO that is liberated 
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when the iron part of the olivine dissolves reacts imme¬ 
diately with H 2 S and forms solid iron sulfides. A second 
effect was even more unexpected. The methane 
increased not only in relative amount but also in abso¬ 
lute amount. In nature, serpentinization of olivine 
under exclusion of free oxygen is often accompanied 
by methane emissions. These are due to the following 
reaction: 

6Fe 2 Si0 4 T C0 2 T 14H 2 0 —> CH 4 T 6 H 4 Si 04 T 4 Fe 304 

( 8 ) 

Sometimes this leads to methane flames, like in the 
Turkish Yanartasi (the rock that always burns), where a 
peridotitic rock is being serpentinized at some depth 
and in Los Fuegos Eternos (the eternal fires) in the 
Zambales ophiolite complex on the main island of the 
Philippines, Luzon. The increase in methane production 
in the biodigesters is probably due to the same reaction. 

Collateral Benefits 

It was already pointed out in the report entitled 
“Geoengineering the climate” of the commission on 
geoengineering to the Royal Society that the conse¬ 
quences of enhanced weathering could be benign in 
principle [39]. One of the most obvious advantages of 
enhanced weathering is that it has the potential to 
mitigate ocean acidification, the decrease of pH in 
response to rising atmospheric C0 2 concentrations 
[40]. The ongoing lowering of the pH endangers the 
growth of coral reefs and likely affects marine life in 
general. A number of other collateral benefits of the 
olivine option can be mentioned. 

Improvement of Soil Productivity 

Tropical soils, except in areas with recent volcanism, are 
generally very poor, as they lack important mineral 
nutrients like magnesium, calcium, potassium, and 
phosphate. A limited amount of these nutrients is 
recycled by the standing forest, but once this is cut, 
the trees are removed and the soil is used for agricul¬ 
tural crops, it loses its productivity very fast. Spreading 
crushed dunite rock will solve the magnesium problem, 
and the calcium problem as well to some degree. Kim¬ 
berlites, the host rock of diamonds, are also olivine- 
rich, but in addition they contain a potassium mineral 
(phlogopite) as well as often some phosphate. In order 


to provide a more balanced mixture of major nutrients, 
one should spread the crushed kimberlite of which 
hundreds of millions of tons are lying on the mine 
dumps of diamond mines. At the request of the author, 
seepage waters from kimberlite tailings in India and 
South Africa were analyzed. It was found that the 
waters that seep through these kimberlite tailings have 
a high pH and sequester large amounts of C0 2 as 
bicarbonate. Moreover, they can contain significant 
concentrations of potassium (data from the de Beers 
mine tailings in South Africa and the Panna diamond 
mine in India, Schuiling, unpublished). Again, one 
should avoid as much as possible mining wastes that 
contain chrysotile, although this white asbestos is con¬ 
siderably less dangerous than crocidolite (blue asbes¬ 
tos) or amosite (brown asbestos), which are not 
associated with dunites or peridotites. Recent research 
by Arcadis has shown that chrysotile weathers quite 
fast, and thereby loses its fibrous character [41]. 

Mindful of the beneficial effects of volcanic rock on 
the fertility of soils, it may also be a good idea to spread 
mixtures of crushed dunite and rock meal, like volcanic 
tuffs in areas with poor soils. Although the rock meal 
weathers considerably slower than dunite powder, and 
thus contributes much less to C0 2 sequestration, the 
combination is likely to increase productivity of the soil 
for a number of years, by which it can contribute to 
world food production. In view of the scarcity of potash 
fertilizer and its high price, it is recommended [42] to 
use powdered nepheline syenites to provide potassium 
to poor tropical soils, as the nepheline (Na,K)AlSi0 4 
weathers considerably faster than potash feldspar. Acid 
sulfate soils form another problem that can be solved 
by spreading olivine. The Mekong delta, as well as a 
number of soils in estuaries along the East coast of 
Australia, suffers from high acidity, once the soil is 
plowed in preparation for agriculture. This exposes the 
pyrite in these soils to the oxygen in the air. The pyrite 
oxidizes and produces sulfuric acid, which reduces their 
rice productivity (Mekong) or sugarcane production 
(Australia) considerably [43]. If crushed olivine is 
spread on these acid sulfate soils, its weathering will 
neutralize the acid and thus increase their productivity. 
If the neutralization reaction is simplified to 

2 FeS 2 + 2Mg 2 Si0 4 + 7.50 2 4MgS0 4 + Fe 2 0 3 + 2Si0 2 

(9) 
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this would mean that 1 t of soil with 1% of pyrite 
would require close to 12 kg of olivine for complete 
neutralization. 

Production of Biofuels from Siliceous Algae 

In a very different manner, olivine can help to produce 
biofuel from algae instead of from land-based crops 
which use up land that is thereby lost for world food 
production and cost large amounts of scarce irrigation 
water [44, 45]. Diatoms (siliceous algae) grow very fast 
and are a proven raw material for the production of 
biodiesel, as they have a lipid content around 50%. 
Their growth, unfortunately, is often limited by the 
availability of silica, which they need to build their 
silica skeletons. During weathering, olivine releases 
large amounts of silica in solution. If one constructs 
a lagoon, preferably where the coastal waters carry large 
loads of urban or agricultural waste, diatoms can be 
grown in the following manner. The lagoon should be 
along a coastal stretch where the beach can be covered 
by olivine sand. A dam must be constructed around the 
lagoon to separate it from the open sea, except for one 
or a few U tubes through this dam. These U tubes 
permit the exchange of water between the lagoon and 
the open sea. The opening on the lagoon side of each 
tube should be covered by a perforated metallic plate, 
which acts as a support for a plankton net. During high 
tide, the water will flow into the lagoon and wet the 
olivine sand on the beach. During low tide, there will be 
an outflowing current of the water, including the part 
that formed the pore water between the olivine grains 
on the beach that is drained during ebb. This outflow 
will also carry the diatoms, but these are retained on the 
plankton net, where they can be harvested to be used as 
the raw material for biodiesel production. It is expected 
that in such diatom farms almost pure diatom cultures 
will form, as they have a competitive edge over organ¬ 
isms that do not use silica. 

The growth of diatoms might also be stimulated for 
quite a different purpose. More and more often, coastal 
waters are threatened by poisonous dinoflagellate 
blooms (“red tides”), causing massive fish kills, and 
threatening human health. The main reason is believed 
to be the supply of untreated urban waters that contain 
high levels of nutrients. If one could stimulate the 
growth of fast-growing and nonpoisonous diatoms to 


consume the contained nutrients in these coastal 
waters, this may reduce or eliminate the dinoflagellate 
blooms. Besides, diatoms are an excellent fish food. The 
stimulation can be done by applying the same method, 
covering the beaches with olivine sand which will 
provide the silica that is necessary for the growth of 
the diatoms. 

Geopolitical Implications of the Olivine Option 

Although many scientists realize that CCS is a very 
costly operation [34], and is unlikely to solve the C0 2 
problem at the required volumes, it is still the most 
widely discussed and preferred option, particularly 
among Western politicians and those that may benefit 
from the uptake of the technology. This can make for 
a powerful lobby. Many scientists urgently call for 
a reduction of C0 2 emissions, by increasing efficiency, 
changing our lifestyles, and switching to sustainable 
energy (wind, water, solar power, biomass, geothermal 
energy, or even nuclear energy). In the long run, man¬ 
kind will be obliged to make the change to renewable 
energy anyhow, as our reserves of oil and gas are dwin¬ 
dling. Although there are still vast reserves of coal, these 
too are not endless. It is evident that the world should 
reduce its carbon footprint as quickly as possible. Some 
are concerned that the possibility of C0 2 sequestration 
may provide an excuse not to change current lifestyles, 
sometimes called the “moral hazard” argument. Trying 
to change people’s attitudes and lifestyles is necessary, 
but it may be a slow process, whereas the danger of 
climate change requires immediate action. The two 
sides have a different time frame, and both are needed 
to avoid the imminent climate change caused by rap¬ 
idly rising C0 2 levels in the atmosphere. 

The picture is complicated by the fact that the 
largest industrializing nations (such as China, India, 
Brazil, South Africa, and Indonesia) all possess vast 
coal reserves and want to lift the standard of living of 
their people as quickly as possible. One requirement to 
reach that goal is the unlimited access to cheap energy. 
Reduction of emissions is, therefore, not necessarily 
a top priority. Enhanced weathering ideas may be 
a way to break this deadlock by providing a means of 
compensating their emissions by using olivine. All five 
industrializing nations mentioned also possess vast 
reserves of olivine rocks. They can exploit these with 
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their own workforce for 10% of the cost of CCS for the 
same amount of C0 2 sequestered, and it will give them 
large employment opportunities. Moreover, if they 
produce more olivine than required to meet their 
own agreed quota, they can sell surplus carbon credits 
to the other countries for 15 Euros/t of C0 2 , and still 
make a profit. This will permit them to continue to use 
their cheap energy based on coal, while still doing their 
part to counteract climate change. 

Future Directions 

A number of niche applications of the concept of 
enhanced weathering of olivine have already been set 
into motion, like the inclusion of olivine in roof cov¬ 
erings, the addition of olivine to fertilizers or potting 
soil, or the covering of (bicycle) paths with olivine 
sand. Several other niche applications are under 
development. A breakthrough, though, depends on 
the certification of the olivine option for carbon 
credits, and the start of large-scale olivine mining. In 
the global climate debate, enhanced weathering can 
play a major role to reconcile the views and aims of 
the West and those of the emerging economies (China, 
India, Brazil, South Africa), if it is accepted that 
these countries may continue to use their vast coal 
reserves as a source of cheap energy, on condition that 
they compensate their C0 2 emissions by enhanced 
weathering of olivine. 
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Glossary 

Dynamic models Models that incorporate a time- 
varying element. Such models may have no spatial 
dimension or extend to modeling spatial variation 
in all three dimensions. 

Integrated multi-trophic aquaculture (IMTA) 

Cocultivation of aquatic species at different trophic 
levels, such as salmon, mussels, and kelp or scallop 
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and sea cucumber, in order to maximize yield and 
minimize the environmental footprint. 

Modeling framework Models are a representation of 
reality, which they aim to reproduce in terms of 
generality, realism, accuracy, and simplicity. As a 
rule, generality and simplicity are maximized. 
A model framework usually consists of two or 
more models, which are combined to address dif¬ 
ferent levels of complexity. 

Research models Models that are more detailed, usu¬ 
ally complex to develop, implement, and apply. 
Aimed mainly at the scientist, rather than the 
manager. 

Screening models Models that use a limited set of 
inputs, and provide highly aggregated outputs, 
such as an index of suitability or an environmental 
score card. 

Virtual technology for aquaculture Any artificial rep¬ 
resentation of ecosystems that support aquaculture, 
whether directly or indirectly. Such representations, 
exemplified by mathematical models, are designed 
to help measure, understand, and predict the 
underlying variables and processes, in order to 
inform an Ecosystem Approach to Aquaculture. 

Definition of the Subject and Its Importance 

Aquaculture, defined simply as the cultivation of 
aquatic organisms, has many similarities to agriculture, 
most notably that it is based on the interaction between 
humans and other elements of the natural system, 
converting the latter (at least in part) into a managed 
system. 

In parts of the world, such as Southeast Asia, the 
distinction between the two types of cultivation 
becomes increasingly fuzzy, especially if they take 
place on land. Cocultivation of rice and tilapia in 
paddy fields [ 1 ] or the combination of penaeid shrimp 
(e.g., the whiteleg shrimp Penaeus vannamei ) and water 
spinach (Ipomoea aquatica) in earthen ponds is com¬ 
mon, as are many other combinations (Fig. 1). Carry¬ 
ing capacity in such intensive systems, whether in 
monoculture or in integrated multi-trophic aquacul¬ 
ture (IMTA), might at first glance seem equivalent to 
assimilative capacity. 

Aquaculture in open water, whether in reservoirs, 
lakes, or coastal systems, must take into account the 
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Pigs might fly? Cocultivation of pigs and fish such as carp or 
rohu in India http://harfish.gov.in/technology.htm 

complexities of water circulation, together with the 
harmonization of different uses. In the context of 
organically extractive open-water culture, Bacher et al. 
[2] and Smaal et al. [3] defined carrying capacity as: 

The standing stock at which the annual production 
of the marketable cohort is maximized. 

Although this definition was proposed for bivalve 
shellfish culture, it is sufficiently broad to be relevant 
for production in open freshwater, coastal, and off¬ 
shore environments, as well as in land-based systems 
using ponds or raceways. However, production carry¬ 
ing capacity needs to be further qualified, because in 
economic terms the maximization of annual produc¬ 
tion is not the objective function, and brings with it 
increased environmental costs. Commercial produc¬ 
tion must seek to achieve optimal profit , well before 
the inflection point in the production function, where 
total physical product (TPP) maximizes income (e.g., 
[4,5]). 

This production-oriented view of carrying capacity 
for aquaculture, whether in terms of assimilative capac¬ 
ity for fed aquaculture such as finfish or shrimp or with 
respect to food depletion in the case of shellfish such as 
oysters, clams, or abalone, has been expanded over the 
last decade into a four pillar approach based on phys¬ 
ical, production, ecological, and social carrying capac¬ 
ity [6, 7]. These pillars encompass the three elements of 
sustainability, viz., planet, people, and profit. 

The terms carrying capacity and assimilative capac¬ 
ity are frequently associated with models of aquacul¬ 
ture impacts. Because these terms attempt to define the 
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Marginal analysis indicates that the seed that provides maximum profit (red arrow) falls well to the left of the maximum 
production, shown by the dotted line. Production beyond the optimal profit point adds no value and potentially 
increases environmental pressure (Results from the FARM model. Adapted from [4]) 


limits of sustainable aquaculture, predictive capability 
is highly sought (Fig. 2). 

This chapter reviews the state of the art in modeling 
frameworks that assist with that prediction and sup¬ 
port proactive management of aquaculture. 

Introduction 

The establishment of aquaculture activities in different 
geographical areas has historically been a bottom-up 
process, without any systemic planning or definition of 
a zoning framework. This is seen throughout the world, 
from the development of salmon cage culture in 
Scottish lochs to the incremental destruction of man¬ 
groves for construction of shrimp farms (Fig. 3). 

This approach to licensing (or in many cases just to 
development) has been based on space availability and 
limits to production rather than on any environmental 
criteria, and has led to undesirable ecosystem effects, 
including habitat destruction both on land and in 
open waters, coastal eutrophication through increased 
nutrient loading from land, organic enrichment of 
sediments, loss of benthic biodiversity, and major out¬ 
breaks of disease. 


In the last decade, better regulatory frameworks 
have led to a more stringent approach to licensing, 
most notably in the European Union, the United States, 
and Canada. The European Unions Marine Strategy 
Framework Directive (EU MSFD - EC [80]), together 
with guidelines for an Ecosystem Approach to Aqua¬ 
culture (EAA - [8]), highlights the ecological compo¬ 
nent and aims to optimize production without 
compromising ecosystem services. Part of the challenge 
of determining carrying capacity is the quantification 
of negative externalities as a first step toward improved 
management. 

In Brazil, for instance, where aquaculture has grown 
very rapidly over the past decade (Fig. 4), environmen¬ 
tal permitting of new tilapia farms in reservoirs is 
determined through the application of the Dillon and 
Rigler [9] phosphorus loading model, a rather simplis¬ 
tic view of carrying capacity. 

A maximum limit of 30 pg L -1 of P has been 
established for reservoir waters, 5 pg L -1 of which is 
reserved for fish farming, to allow for multiple uses 
including cattle ranching, sugar cane production, 
urban discharges, and the natural background. 
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Expansion of shrimp ponds over a 13-year period in Estero Real, Nicaragua [83] 


Fish farms are licensed sequentially, based on the 
contribution to P loading of their declared production. 

Although this approach does address carrying 
capacity at the system scale (i.e., AP = 5 pg L _1 ), it 
does not consider any spatial or temporal variation, 
nor does it account for factors such as organic enrich¬ 
ment, disease, or impacts on biodiversity - all of which 
may be linked. 

Whereas production and social (e.g., tourism) 
impacts are often local, ecological impacts of 


aquaculture can have far broader scales, as can large- 
scale effects on aquaculture, such as advection of harm¬ 
ful algae from offshore [10]. Sustainability is “easier to 
plan than to retrofit,” [10] which makes a case for the 
analysis of carrying capacity at the system scale 
(e.g., Nobre et al. [11]), i.e., defining and quantifying 
the overall potential for different types of aquaculture 
prior to local-scale assessment (e.g., [12]) of new 
operations. Simulation models of varying degrees of 
complexity must play a role in the determination of 
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Production of farmed tilapia in Brazil (MPA, personal communication, 2011) 


carrying capacity for aquaculture, often combined as 
model frameworks, given the range of time and space 
scales and the number of processes involved. 

Over the next decades, the growth of aquaculture 
will take place in developing nations [13-15], which 
makes it paramount that the digital divide, which is 
already considerable (as is the legal divide; see [16]), 
does not become wider. 

Simulation technology that can support planning 
should be close to the production centers and be able to 
deal with data-poor environments and limited compu¬ 
tational access and skills. 

Modeling Frameworks for Carrying Capacity 

The concept of carrying capacity in aquaculture, based 
on four pillars of sustainability, has been adapted in 
Fig. 5 to include governance [16,17]. This is considered 
more relevant than the physical element, which in 
many respects is encapsulated in the production pillar. 
Governance, on the other hand, is clearly missing from 
the original model of Inglis et al. [6], and the quality of 
balanced regulation, stakeholder involvement, and 
community-based management [18] often constitutes 
the difference between sustainable aquaculture and an 
environmental time bomb. 

The social (here used in the context of social oppo¬ 
sition to visual or other impacts of aquaculture devel¬ 
opment, such as conflict with leisure areas) pillar is at 
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The relative importance of the four pillars of carrying 
capacity (Adapted from [17]) 


the forefront of decision-making for aquaculture in the 
EU, the US, and Canada and can frequently be identi¬ 
fied as the single most important criterion for carrying 
capacity assessment and site selection (Fig. 5). By con¬ 
trast, in Asia and other parts of the world where food 
production is the paramount concern, licensing criteria 
are more frequently based on the production pillar, 
with ecological considerations assuming less relevance. 
In China and Southeast Asia, the social component acts 
as a driver for increased aquaculture, for reasons of 
economy and food security. Governance is not usually 
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limited by a lack of legal instruments [16] but often by 
their adequacy and acceptance by stakeholders. 

Two of the pillars illustrated in Fig. 5, production 
and ecology, are amenable to mathematical modeling, 
and two are not. This does not mean that those math¬ 
ematical models will be entirely successful in describing 
growth, environmental effects, and particularly ecosys¬ 
tem responses, but they do make a significant contri¬ 
bution to the improved evaluation of carrying capacity. 
Part of the difficulty lies with our understanding of the 
relevant processes, parameters, and rates, part with 
other factors, such as the lack of a paradigm in ecology 
to support prediction. For instance, in the EU MSFD, 
as in other legal instruments, there are complex 
descriptors of ecosystem health such as biodiversity, 
and the scientific community struggles to establish 
meaningful classification systems and their relation¬ 
ship to anthropogenic pressures. 

There are also interactions with the human compo¬ 
nent of cultivation that constitute simulation chal¬ 
lenges. Culture practice, for instance, is widely 
variable and can generally only be modeled in average 
terms [19]. 

Issues related to disease, which fall squarely between 
production and ecology and are extremely difficult to 
model, are a huge challenge in aquaculture and 
often include a significant element of poor governance, 
such as relaying of infected animals to hitherto 
uncontaminated areas. 

Rearing large numbers of animals of the same spe¬ 
cies in close proximity to each other favors the estab¬ 
lishment and spread of infectious diseases within those 
farmed populations [84]. Disease can affect both sur¬ 
vival and growth of animals. Outbreaks of highly viru¬ 
lent bacterial and viral diseases can result in high 
mortality of an affected farm stock. For example, out¬ 
breaks of white spot syndrome virus (WSSV) in farmed 
whiteleg shrimp can result in greater than 60% mor¬ 
tality, with an attendant dramatic impact on the farms 
and regions dependent on farming this species. 

Additionally, with ever increasing demands from 
consumers for high-quality products, the effects that 
disease can have on product quality are of increasing 
importance. For example, tilapia that have survived 
outbreaks of Francisella often have unsightly lesions 
in the fillets. There are also examples of diseases, 
such as Red Mark Syndrome in rainbow trout [85], 


that do not result in any significant mortality or mor¬ 
bidity, but still result in the product being downgraded 
or rejected by processors after harvest, imposing signif¬ 
icant economic costs on the farmer. 

The ecological effects of disease in aquaculture can 
also be profound. This includes spread of pathogens 
from farmed fish to wild fish and vice versa [86]. 
Disease agents that affect aquaculture species generally 
have their origins in wild aquatic animals [86]. Many 
systems (e.g., salmonid netcage and bivalve culture) 
involve rearing the farmed animals in relatively open 
systems where the reared animals are in direct contact 
or close proximity with wild animals. Water is an ideal 
medium for the dissemination of many pathogens, 
leading to a high risk of transfer. For instance, in 
Norway and Scotland, exchange of sea lice 
Lepeophtheirus salmonis between farmed Atlantic 
salmon and wild salmonids has been implicated in 
causing significant declines in populations of wild 
Atlantic salmon and sea trout. 

In order to examine modeling frameworks, the 
following sections review the definitions of, and dis¬ 
tinctions between, the pillars of carrying capacity that 
are more amenable to mathematical modeling. 

Assimilative Capacity and Carrying Capacity 

Assimilative capacity is sometimes considered a sub¬ 
class of the more general term carrying capacity, but 
other more specific definitions have been applied. We 
make a distinction as detailed below. Assimilative 
capacity refers to the ability of biological systems in 
the water column or sediments to process organic 
matter, nutrients, therapeutants, or contaminants 
without alteration of ecosystem state or function. 

Carrying capacity refers to the biomass of cultured 
organisms that can be supported without altering sys¬ 
tem state or function measured by water or sediment 
quality standards and processes [20, 21]. The latter is 
thus determined by aquacultured biomass [22], where 
the former is independent of aquaculture and deter¬ 
mined by biological properties of the habitat. 

The application of standards to models of impact 
takes two forms: 

1. Absolute criteria determined by regulatory bodies 

such as the Water Framework Directive of the EU 

(EUWFD-EC [79]) 
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2. Relative standards compared to reference condi¬ 
tions or the range of variation observed in the 
environment 

In both cases, there is an attempt to use these values 
as sustainability criteria. In the case of absolute stan¬ 
dards, the background of the environmental conditions 
is not considered. For example, naturally eutrophic 
waters may show higher nutrient levels, with no rela¬ 
tionship to aquaculture. This is particularly true of 
chlorophyll impacted by shellfish depletion. It may be 
more realistic to consider depletion in the context of 
the range of values observed in the environment as 
a means of establishing whether aquaculture signals 
can be detected against background noise. This type 
of standard has been applied to shellfish depletion of 
chlorophyll by Filgueira and Grant [20] and to shellfish 
biodeposits by Grant et al. [23]. 

Tett et al. [22] formalize both carrying capacity and 
assimilative capacity as the result of dose-response 
curves, couched in the terminology of DPSIR 
(Drivers-Pressure-State-Impact-Response; see also 
[24]; Fig. 6), where pressure is the farmed biomass 
and state is the system response modeled as 
a water quality standard. By comparing physical and 


geochemical rate processes, a net balance is deter¬ 
mined, whereby the quantity of interest displays an 
increase (e.g., ammonia) or decrease (e.g., dissolved 
oxygen) relative to a water quality standard. When 
two of these processes are balanced, an index is created, 
constituting some of the earliest models in this research 
area, sometimes referred to as screening models. 

A further distinction is that assimilative capacity be 
a function only of biological processing. Tett et al. [22] 
use the net results of water quality models which 
include advection-diffusion as well as biological 
processing to define assimilative capacity. Following 
bioenergetic lexicon, assimilation is a metabolic pro¬ 
cess involving digestion and/or decomposition, but 
excluding physical exchange. 

There has been some decoupling of the measure¬ 
ments used to characterize aquaculture impacts in sed¬ 
iments (redox, sulfides; [25]) with models that are 
based on oxygen fluxes. Brigolin et al. [26] used 
a more classical early diagenesis model to examine 
coupled redox reactions at fish farm sites. This is one 
of the few cases where the field measurements used in 
regulation could be compared to model output includ¬ 
ing total sulfides and seabed oxygen consumption. 
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Conceptual framework of assimilative and carrying capacities [22] 
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In terms of benthic impacts, assimilative capacity 
as the ability of sediments to accept some degree of 
organic enrichment without generating anoxic crises is 
well grounded conceptually. A comparison of benthic 
carbon supply via fish feed and fecal input to oxygen 
demand of the sediment forms the basis of some the 
original models of benthic impact [27] including the 
long-standing Norwegian MOM approach [21]. This 
concept was widely applied to multiple salmon sites by 
Morrisey et al. [28]. De Gaetano et al. [78] produced 
a more sophisticated version of the model for Mediter¬ 
ranean fish culture. 

The intermediate disturbance hypothesis (e.g., 
[29]) provides a functional form to the model by 
suggesting a parabolic response of benthic diversity 
and activity to organic loading. Stimulation of aerobic 
demand occurs at low enrichment, peaking at interme¬ 
diate levels and declining at high levels (Fig. 7; [30]). 

The latter authors define “acceptable aquaculture” 
as keeping sediment oxygen demand at or below the 
peak. They produced a numerical model which 
included a 3D circulation component, organic deposi¬ 
tion, and decomposition based on a sediment diagen¬ 
esis model using oxygen fluxes and anaerobic 
processes. Output is in the form of mapped values of 
organic loading based on fish stocking density that 
produces maximum aerobic oxygen concentration 
(Fig. 8). 

The map shows that farms within the inner parts of 
the inlet have less permissible organic sedimentation 
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Acceptable limits for organic matter loading from fish 
farms and the relationship to benthic oxygen uptake and 
sediment sulfide content [30] 


due to reduced flushing. We suggest that this example is 
the appropriate approach to assimilative capacity since 
it models system function and its response to organic 
loading based on the capacity of the benthos to absorb 
organic matter while maintaining oxic conditions. In 
addition, inclusion of a spatial diffusion-advection 
model allows incorporation of nonlocal processes and 
provides mapped output. Results are inherently inclu¬ 
sive of far-held effects and include interaction of mul¬ 
tiple farms. Although the paper did not have the 
context of marine spatial planning, it is clearly appli¬ 
cable in terms of both the approach and the results. The 
comprehensive nature of this study and its faithfulness 
to the concept of assimilative capacity make it note¬ 
worthy in the literature. 

Models of assimilative capacity, like those for shell¬ 
fish carrying capacity, differ in spatial resolution. The 
original index models are 1-box, 0D models treating 
any body of water as a single basin. Physical exchange is 
thus averaged over the entire domain. This is often 
problematic since estuaries or other embayments are 
places with strong gradients in flushing. In addition, 
aquaculture sites are averaged in this scheme, so no 
questions regarding optimal location can be addressed. 

One hybrid approach which solves some of these 
problems is the use of a full circulation model whose 



□ Fish Farming site 
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Isopleths of the limit values of the organic matter loading 
rate (pmol0 2 /cm 2 /day) in Gokasho Bay, estimated by 
a numerical model [30] 
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output is applied to local models. In this case, at least 
the physics is location specific, and subregions of the 
system may be considered without the OD averaging. 

Lee et al. [31] applied this approach to yellowtail 
culture in Hong Kong Harbour by comparing oxygen 
delivery to fish oxygen consumption and determining 
net oxygen concentration relative to water quality 
standards for different levels of stocked biomass. 
Similar schemes have been used for finfish culture in 
Scotland [22]. 

Models for Finfish, Shrimp, and Bivalve Culture 

Models for Open Water Culture Models for organic 
loading by finfish include submodels of circulation, 
particle transport, and benthic response [21]. Models 
may also include a phytoplankton-nutrients- 
zooplankton (PNZ) component, simulating 
trophodynamics in the water column. This may be 
necessary due to the importance of phytoplankton 
and the microbial loop as nutrient processors [22]. 
For either finfish or shellfish, some version of a farm 
production or bioenergetics model is essential to esti¬ 
mate waste outputs from cultured animals. Benthic 
models applied to aquaculture are typically diagenetic, 
aimed at resolving sediment decomposition and nutri¬ 
ent regeneration [78]. 

For finfish, benthic deposition of farm wastes and 
consequent impacts are the primary emphasis; since 
fish are not dependent on the environment for food, 
trophodynamics are less relevant. Similarly, the models 
are usually localized since much of the waste material 
remains near the cage site (e.g., DEPOMOD). None¬ 
theless, there is concern that wastes reach the far field 
and produce negative benthic impacts. Despite this 
potential, far-held models of finfish cage culture are 
uncommon. Some early examples used 3D circulation 
models to examine waste dispersal over kilometer 
scales [87]. 

In a more recent example, Symonds [32] compared 
near-held models such as DEPOMOD to a far-held 
model based on a 3D circulation. He found that the 
dependence of near-held models on limited current 
meter observations was subject to the noise and uncer¬ 
tainty inherent in those measurements. In addition, the 
far-held model had the potential for bidirectional 
transport of waste, whereas the near-held model had 


permanent escape from its limited domain causing 
potential underestimation of benthic impacts. In addi¬ 
tion, the far-held model allows consideration of mul¬ 
tiple farms and their interaction. Skogen et al. [33] used 
a 3D circulation model coupled to a full ecosystem 
model to study the effects of hsh farms on far-held 
oxygen, nutrients, and primary production, conclud¬ 
ing that eutrophication in the far held of a Norwegian 
fjord was not enhanced by the farm. 

The goal of most bivalve shellfish models is to 
understand seston depletion as a limiting food resource 
for farmed animals. This requires a trophodynamic 
model which includes primary production, bivalve 
grazing, and advection-diffusion. Because farmed 
shellfish feed on phytoplankton from distant sources, 
recent examples involve models which include the far 
held. Nevertheless, there are many examples of 1-box, 
OD models in which the physics and biology are 
averaged over the basin. Environmental impact of 
biodeposition to the benthos has been considered in 
several models including spatial examples [30, 34], but 
this is less frequently addressed compared to seston 
depletion. The opposite is true for held studies where 
benthic impacts are emphasized, and seston depletion 
is rarely addressed due to the difficulty in observing it. 
Because seston depletion occurs hrst at a farm scale, 
models of the depletion process have also been created 
at the local scale, prominently the FARM model [4]. 
Far-held models of seston depletion are increasingly 
common [20, 35]. 

It may be concluded that models of hnhsh aquacul¬ 
ture impacts would beneht from more spatial realism 
and far-held content, as well as further emphasis on 
assimilative capacity. Similarly, shellfish models would 
beneht from inclusion of benthic impact prediction in 
association with existing focus on seston depletion. The 
development of assimilative capacity models would be 
identical for both hnhsh and shellfish culture. This also 
places the context to that of ecosystem function. The 
present context of faunal indices based on carbon 
deposition in local models is important [36, 37], but 
is predicted on the basis of somewhat tenuous empir¬ 
ical relationships which are a departure from the more 
quantitative nature of the models. 

Models for Land-Based Culture Models that simu¬ 
late land-based culture taking place in ponds, tanks, or 
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raceways use many of the biogeochemical features 
described above, but the physics is simplified to 
a reactor type of system and serves to determine 
water exchange and effluent loading to adjacent water 
bodies. Such models are cheaper to develop and imple¬ 
ment than the examples given for open waters (here 
including lakes and estuaries, where water circulation 
should also be accounted for). According to the latest 
figures from FAO, over 70% of freshwater aquaculture 
in China takes place in ponds, corresponding to an area 
the size of New Jersey and an annual production of over 
15 million tons [88]. 

This number is triple the total aquaculture produc¬ 
tion of America, Europe, and Africa combined, which 
suggests that substantial emphasis should be placed on 
models that can assist with site selection, optimization 


of carrying capacity, and evaluation of negative envi¬ 
ronmental externalities of pond culture. 

Various examples of this type of approach exist 
(e.g., [38, 77]. Figure 9 shows mass balance results 
obtained with the Pond Aquaculture Management 
and Development (POND) model [39], which simu¬ 
lates the production and environmental effects of 
shrimp, fish, and bivalves cultivated in ponds, in 
monoculture or IMTA. 

The model was run for a site at 23°N, with a daily 
water renewal of 3% of the pond volume, and estimates 
an environmental discharge of about 45 kg of nitrogen 
(mostly dissolved, but also as algae), roughly 14 popu¬ 
lation equivalents (PEQ) per year for the 90-day culti¬ 
vation cycle. The cost of offsetting these emissions is 
over 500 USD [40]. In developing countries, these 
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Mass balance of whiteleg shrimp (Penaeus vannamei) culture, including production and environmental externalities for 
a 1-ha pond 
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waste costs are not internalized but rather imposed on 
to the environment, contradicting the first principle of 
the ecosystem approach to aquaculture [8] that: 

► Aquaculture should be developed in the context of 
ecosystem functions and services (including biodiver¬ 
sity) with no degradation of these beyond their 
resilience. 

By contrast, pond production in the United States 
already requires a National Pollutant Discharge Elimi¬ 
nation System (NPDES) permit [41]. Often, this means 
that large agro-industrial companies from developed 
countries price-leverage the lack of environmental reg¬ 
ulation and/or implementation in the developing 
world. 

Models of this kind can be coupled with a range of 
other models to provide a decision support framework. 
POND uses the well-tested Assessment of Estuarine 
Trophic Status (ASSETS) model [42], providing 
a color-coded assessment (Fig. 9) of the degradation 
of water quality over the culture cycle. 

Models for Integrated Multi-trophic Aquaculture 

IMTA has been practised in Asia for thousands of 
years. In the fifth century B.C., the Chinese 
aquaculturalist Fan Lee [81] wrote: 

► You construct a pond out of six mou of land. In the 
pond you build nine islands. Place into the pond plenty 
of aquatic plants that are folded over several times. 
Then collect twenty gravid carp that are three chih in 
length and four male carp that are also three chih 
in length. Introduce these carp into the pond 
during the early part of the second moon of the year. 
Leave the water undisturbed, and the fish will spawn. 
During the fourth moon, introduce into the pond one 
turtle, during the sixth moon, two turtles: during the 
eighth moon, three turtles. The turtles are heavenly 
guards, guarding against the invasion of flying 
predators. 

A substantial proportion of Asian aquaculture 
currently takes place in cocultivation, improving 
production, optimizing resources, and reducing envi¬ 
ronmental waste [14]. Despite this oriental wisdom, 
multi-trophic aquaculture is still rare in North America 
and Europe, although commercial interest is growing 
rapidly. This is reflected in research (e.g., [43,44] ), with 


the annual number of scientific publications on IMTA 
doubling from 2007 to 2010 (SCOPUS). 

The combinations of species, relevant proportions, 
and culture practice are key to successful IMTA. There 
is traditional knowledge in China and other parts of 
Asia on what works best, but advances in mathematical 
modeling can make a substantial contribution by quan¬ 
tifying energy flows, production, and environmental 
externalities ([45], Nobre et al. [11], [16]). 

Figure 10 shows a simulation of IMTA for a 
shrimp farm, with cocultivation of the Pacific oyster 
Crassostrea gigas at different densities, using the POND 
model. As the oyster density in the ponds increases, 
the net primary production (NPP) of phytoplankton 
is reduced due to top-down control, although 
NH 4 + increases due to bivalve excretion. The model is 
not simulating macroalgae or other aquatic plants, 
which if added would significantly reduce the output 
of ammonia. 

The ASSETS score is best at the higher oyster den¬ 
sity, but at a density of 10 oysters m -2 , the shrimp 
culture cycle would yield a harvestable oyster biomass 
of over 500 kg to provide an annualized extra income 
of over 10,000 USD. The removal of algae and detritus 
by the filter-feeding bivalves corresponds to three 
PEQ per year, about 15% of the discharge. At the 
higher densities, the bivalves are performing a 
bioextractive function and do not reach market size 
in a short cycle due to food depletion. A more detailed 
analysis can be made by simulating oyster growth 
continuously over multiple cycles of shrimp culture. 
Such interactions can also be modeled for other com¬ 
binations, for example, tilapia and shrimp or salmon 
and mussels. 

Models for Disease Spread and Control In deter¬ 
mining the carrying capacity of aquaculture opera¬ 
tions, it is important to ensure aquaculture 
production practices and systems within a farm, man¬ 
aged region, or zone are resilient to the effects of dis¬ 
ease. Modeling provides a means of investigating the 
interactions occurring among the four pillars and the 
spread and establishment of pathogens; however, to 
date, most models have ignored the influence of society 
on aquatic disease. Many different models have been 
derived to investigate the spread and impact of partic¬ 
ular pathogens: In aquatic systems, two of the most 
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Analysis of IMTA production and externalities by means of the POND model 


O 

CO 

CD 

=7 

< 
CD 
c n 


common approaches are (1) compartment-based 
models and (2) network models. 

Compartment-based models assume that individ¬ 
uals transition through a series of states from suscepti¬ 
ble (S) to infected (I), and potentially back, or through 
a series of other states such as resistant (R). These 
models are often referred to as SIR models [46] and 
are usually based on continuous time, and therefore use 
differential equations; however, stochastic and discrete 
time approaches are also used. SIR models do not track 
individuals but assume the population follows a set 
of behavior rules, and that at each point in time, 
a proportion of the population leaves one state to 
enter another according to these rules. In aquatic sys¬ 
tems, these models are generally used to track disease 
through a population of animals, but they have also 
been applied to look at spread through a population of 
sites. Simple implementations are often analytically 
tractable, allowing conditions under which thresholds 
and equilibria occur to be found without the need to 
run simulations. 


One of the key pieces of information that can be 
derived from these models is a maximum (susceptible) 
host carrying capacity for which a pathogen cannot 
persist. Such carrying capacities used in the context of 
pathogens are often referred to as a critical threshold 
(N t ) and may be useful to wildlife and farm managers 
when attempting to control or prevent disease. This 
threshold is however largely dependent on the assump¬ 
tions made regarding the way a pathogen is transmit¬ 
ted, and obviously does not apply if transmission is not 
dependent on host density, but is frequency based (as is 
often assumed to be the case when modeling sexually 
transmitted infections). It is therefore important that 
the correct form of transmission is used in a disease 
model to avoid false inference. The most commonly 
used form of model assumes that transmission occurs 
by contact between animals or sites, that the number of 
contacts is density dependent, and that mixing between 
individuals occurs equally and at random. This is often 
referred to as a “density-dependent transmission” or 
“pseudo mass action” model [47, 48]. 
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Under these assumptions, one method by which N T 
can be derived is by determining the conditions under 
which the basic reproductive number, RO, is equal to 1. 
In simple terms, RO is the number of secondary infec¬ 
tions that arise if an infected individual enters a wholly 
susceptible population. If greater than 1, the pathogen 
will establish; if less than 1, it will die out. RO is 
governed by the total population density, the period 
over which an infected animal sheds pathogen, and the 
transmission rate (3 (the rate of contacts between indi¬ 
viduals and that the probability that given a contact, 
infection occurs). Both environmental conditions and 
management/husbandry decisions can influence this 
number, and therefore the critical threshold for estab¬ 
lishment. Understanding the influence of each of the 
four pillars of carrying capacity on RO is therefore 
critical in order to predict and manipulate N T . Omori 
and Adams [49] illustrate the application of compart¬ 
ment-based models to assess the influence of the ecol¬ 
ogy and production pillars on the dynamics of Koi 
herpesvirus in farmed carp populations (Cyprinus 
carpio). Their approach showed that temperature and 
on-farm production processes used in conjunction 
with this could be used to immunize a population, 
preventing clinical disease expression. 

The assumption of random mixing of a population 
is often not reasonable, as in reality some individuals 
will be more social or isolated than others, and 
individuals tend to have discrete groups of contacts 
that may or may not be connected to other groups 
within a population. Under these circumstances, the 
assumption of random mixing can lead to substantial 
overprediction of the epidemic process. Social network 
analysis (SNA) and modeling approaches provide 
a means of incorporating the contacts that occur 
between individuals, and therefore the consequence 
this may have on pathogen spread. In the case of 
agriculture and aquaculture, these models tend to be 
based at the level of the site, rather than the animal 
in question. 

A major advantage of this modeling approach 
is that much epidemiologically useful information 
can be obtained merely by examining the network 
properties, without parameterizing for a particular 
disease. For example, in order to develop generic sur¬ 
veillance and control strategies, it may be useful 
to identify: 


• Clusters of connected individuals within the net¬ 
work and whether it is possible to make them epi¬ 
demiologically distinct through the removal of 
a few connections. 

• Long chains of connection that join the network 
throughout and thus allow disease spread. 

• Super-spreaders, which are individuals that contact 
a high number of other individuals and can thus 
rapidly spread pathogens. 

• Super-sinks, which are individuals that receive from 
a lot of other individuals. Though they may not 
spread a pathogen, they are most likely to receive 
one and may therefore be useful for surveillance 
purposes. 

Many statistics can be generated to summarize dif¬ 
ferent properties of a network. Although RO can also be 
estimated, other statistics such as the degree of central¬ 
ity or clustering coefficient may provide more useful 
means of assessing the likelihood of spread through 
a network. Green et al. [50] demonstrated the applica¬ 
tion of such statistics using SNA applied to the Scottish 
network of trout and salmon farms. The analysis 
showed how much transmission was likely to occur in 
the network as it stood but also used a variety of 
methods to remove the most influential connections 
to reduce transmission. 

In addition to examining the network properties, 
stochastic simulations can be run over the network. In 
such simulations, the network is randomly seeded with 
infected individuals and, at each subsequent time 
point, connected individuals change state from suscep¬ 
tible to infected at a given probability and given the 
likelihood of a contact. This approach is often useful 
when evaluating rates of spread and the effectiveness of 
control strategies but also has the potential to be used 
as a real-time tool during an outbreak, if it can be 
parameterized for the pathogen of interest. Further 
useful information may be gained from these models 
by making network models spatially explicit, as this 
facilitates the designations of control zones. 

Werkman et al. [51] used stochastic network simu¬ 
lations to good effect to determine the efficacy of dif¬ 
ferent fallowing strategies in eradicating disease from 
areas producing different amounts of fish. Thrush and 
Peeler [52] used a simulation of movements over the 
English and Welsh trout industry network to 
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investigate the potential spread of Gyrodactylus salaris. 
This demonstrated that in 95% of the simulations run, 
less than 63 of 193 river catchments would be at risk of 
getting the pathogen in the 12 months following intro¬ 
duction. Jonkers et al. [53] developed this model fur¬ 
ther into a strategic tool that could be used to evaluate 
the effectiveness of different control and surveillance 
efforts on disease spread. 

One of the major limitations of network 
approaches is that they rely on knowledge of the com¬ 
plete network of connections and assume that this 
remains stable over time. Missing connections or 
changes to the network with time could lead to 
misdirected efforts. For many aquatic systems, reliable 
network data are not available or are difficult to com¬ 
pile. Under these circumstances, simpler compart¬ 
ment-based approaches may still be of value in 
informing control policies. One such application was 
demonstrated by Taylor et al. [54] to evaluate the 
effectiveness of different control options in reducing 
the spread of Koi herpesvirus in the UK between sites 
holding carp. 

Most models applied to aquatic systems to date 
tend to be based at either the level of the site or animal, 
with few attempting to combine the two approaches. 
There is, however, substantial scope to develop future 


models that incorporate multiple levels that account 
for transmission between individual animals in a unit, 
the influence this has on the transmission between 
units on a farm, and the subsequent effect on 
between-farm spread (Fig. 11). 

Where attempts to link these two levels together 
have been made, useful results have been obtained. 
Hydrodynamic models have been applied to look at 
the spread of sea lice between Norwegian salmon 
cages in fjord systems [55]. These models track the 
number of lice generated and monitor their dispersal 
and decay over time and space to see whether they 
reach and infect other sites. Green [56] combined com¬ 
partment-based and network approaches to incorpo¬ 
rate the influence of within-site processes on network 
spread and found that differences in site biomass 
influenced the rate of between-site transmission if den¬ 
sity-dependent transmission was assumed. One of the 
major advantages of models that link the within-site 
epidemic with between-site spread is that they allow for 
the consequences of different detection (or action) 
rates to be assessed, allowing more efficient resource 
planning. 

The primary goal of all of the approaches described 
above is often to apply changes to the system to under¬ 
stand their influence on pathogen spread. In 
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Network models working at different scales in time and space 
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application to aquaculture systems, this may be 
conducted to establish how best to maximize N T 
(either in the number of animals produced within 
a site or the number of sites in an area) and thus 
increase carrying capacity. Generally speaking, N T can 
be increased by reducing the amount of connectivity 
between individuals (restrictions in movements), 
changing their susceptibility to disease (e.g., by vacci¬ 
nation) or reducing the period over which an individ¬ 
ual is able to transmit a pathogen (e.g., through 
good surveillance and rapid culling). Other important 
disease management tools that have or could be 
evaluated by the modeling approaches described 
above include: 

• The use of management areas in which aquaculture 
activities such as harvesting or treatment are coor¬ 
dinated between all farms in an area 

• Fallowing of sites between production cycles to allow 
pathogens present to die out and potentially reduce 
the infection pressure other sites are exposed to 

• Year class separation, where only one age class is 
present at a time causing all animals to be harvested 
prior to stocking new animals 

• Removal of high-risk contacts between sites that are 
likely to spread disease widely 

• Biomass limits to reduce the maximum amount of 
pathogen that could be discharged from a site 

• Minimum distances between sites to reduce trans¬ 
mission via hydrographic connectivity 

Building a Framework 

Screening Models and Research Models The models 
reviewed above address different components of aqua¬ 
culture carrying capacity, focusing mainly on produc¬ 
tion and environmental effects, including disease 
aspects. Additionally, they consider different scales in 
time and space and range from statistical models to 
spatially discrete representations, which may (e.g., 
hydrodynamic models) or may not (GIS) be time vary¬ 
ing. It is useful in this context to distinguish between 
screening models and research models. 

Screening models typically use a limited set of 
inputs and provide highly aggregated outputs, such as 
an index of suitability or an environmental scorecard. 
Examples include a comparison of ammonia excretion 
by caged salmon compared to tidal flushing [57] and 


the production of biodeposits compared to tidal 
removal for suspended mussel culture [23]. 

Models of this type (e.g., FARM, [4]; POND; Fig. 9) 
are easy to use by the management community and 
provide a quick diagnosis for a specific site, or a generic 
overview comparison for multiple areas. 

Models that are complex to apply (and usually 
lengthy and complex to develop and implement) may 
be termed research models and are of limited practical 
use in day to day management. Partly, this is due to 
the knowledge required for parameterization, volumes 
of data involved, and substantial requirements in 
processing and interpreting results. This does not 
mean that the results of such models do not have 
a clear application for management, but operating 
them may be beyond the scope of many institutions. 

The concept of a single overarching model, able to 
provide answers across a range of space and time, has 
been shown repeatedly to be unsound. Just as software 
suffers from feature creep [58, 59], stand-alone models 
tend to become increasingly overparameterized, partly 
in an effort to better match reality and partly in 
an attempt to solve per se what should really be 
approached with a combination of models. 

A more promising alternative is to combine GIS, 
dynamic models, network models, and remote sensing 
tools as appropriate to deal with questions that range 
from the impact of a HAB event on a salmon cage at the 
scale of days to the economic success of 10 years of 
mussel farming. 

An increase in model complexity does not neces¬ 
sarily equate to increased accuracy [60], whether in 
physical or biological models. In both cases, the scale 
at which predictions may be made is limited by our 
incapacity to accurately predict the weather. As 
a consequence, model resolution and accuracy are lim¬ 
ited by key drivers such as river flows, salinity patterns, 
and water and air temperature, which impact meta¬ 
bolic rates, algal blooms, turbidity, spawning, or larval 
dispersal. 

Models in the Context of Integrated Coastal Zone 
Management Carrying capacity models for aquacul¬ 
ture are useful in their predictive capability and there¬ 
fore in management of both cultured biomass as well as 
location. Spatially resolved models are particularly 
notable in this context. Within the catchment, this 
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maybe addressed by means of GIS (see, e.g., [61]) and 
can be enhanced through a combination of dynamic 
simulations and spatially resolved models. 

Aguilar-Manjarrez and Nath [89] performed an 
extended analysis of the potential for fish aquaculture 
in Africa, based on GIS models, using a resolution of 
3 arc minutes (25 km 2 at the equator). For small-scale 
operations, suitability was based on water require¬ 
ments, soil and terrain, availability of feed inputs, and 
farmgate sales projections. This analysis was also car¬ 
ried out for larger, commercial farming and concluded 
that for the three species considered (Nile tilapia, 
African catfish, and carp), about 23% of the area of 
continental Africa scored very suitable for both types of 
farming. These authors did not explicitly consider envi¬ 
ronmental effects of fish farming, probably because 
that analysis is best performed regionally or locally, as 
part of detailed site selection, but the modeling tools 
for addressing these impacts are available today. 

GIS models have also been combined with remote 
sensing to assess aquaculture opportunities, for exam¬ 
ple, for crab and shrimp in the Khulna region of 
Bangladesh [62]. In this case, the focus was on gross 
production, economic output, and employment, and 
discussed species suitability with respect to factors such 
as salinity distribution and freshwater availability. Once 
again, the environmental externalities of the different 
types of culture were not included but can be modeled 
to provide a more complete picture for decision sup¬ 
port, including externality costs. 

Figure 12 shows the Jaguaribe estuary, in the state of 
Ceara, northeast Brazil. As part of the application of the 
ASSETS model to determine the eutrophication status 
of the estuary (Eschrique and Braga, personal commu¬ 
nication, 2011), the nitrogen loading from 1,200 ha of 
shrimp farms, located between the city of Aracati and 
the dunes fringing the beach, was determined using 
POND. 

The contribution of shrimp farming to the nitrogen 
load was estimated to be of the order of 60 t year -1 , 
roughly equivalent to 20% of the total loading, or 
about 15,000 PEQ. 

Screening models of this kind are simple to use and 
help water managers in developing countries to address 
the challenge of multiple uses and nutrient sources to 
the coastal zone and to make better decisions with 
respect to site selection and waste treatment. They can 


also be included in a more general catchment modeling 
approach, combining their outputs with hydrological 
models such as the Soil and Water Assessment Tool 
(SWAT) model (e.g., Nobre et al. [11]). 

In open water, including semi-enclosed systems 
such as estuaries and bays, because spatially resolved 
models incorporate physical circulation, they are 
potentially useful for other aspects of coastal zone 
management. 

There are of course a variety of possibly impactful 
activities in the coastal zone including eutrophication, 
fisheries and fish processing, contaminant input, 
resource extraction, and transportation. There are 
inevitably competing uses of the water and bottom, 
and making predictions about aquaculture in isolation 
is insufficient. Water and sediment quality standards 
applied over large spatial scales help to apply objectives 
that may be seen as ecosystem-based management. The 
implementation of these objectives is achieved through 
marine spatial planning. The implication of planning is 
that predictions can be made to anticipate overlaps, 
conflicts, and cumulative impacts of various activities. 

Although models exist for these other activities, 
particularly those acting through eutrophication, the 
models need to interact coherently, and their outputs 
should be tied together in a way that is useful for 
management decisions. Two essential features of this 
integration are physical models that unify the transport 
of materials common to almost every process in the 
ocean and GIS to maintain data layers. The physical 
model establishes the spatial domain which can be 
either conserved, collapsed, or expanded within the 
GIS. At present, models run within GIS are necessarily 
simple and based on averaged values. For example, the 
AKVAVIS tool developed in Norway [63] utilizes a GIS 
with calculation of shellfish growth as well as fish farm 
effects on water quality at any location based on an 
inline model using depth, temperature, and other spa¬ 
tially located characteristics within the GIS [63]. 

Tironi et al. [24] utilized an open source circulation 
model to predict far-field deposition for Chilean 
salmon culture. An essential feature of their work was 
a GIS interface used to depict other coastal activities in 
light of hydrodynamics and dispersal of farm waste. 
Nobre et al. [11]) combined models of watersheds, 
aquaculture, and eutrophication to look at mitigation 
strategies for improving nutrification in Chinese bays. 
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Carrying Capacity for Aquaculture, Modeling Frameworks for Determination of. Figure 12 

The Jaguaribe estuary, in Ceara, NE Brazil. Shrimp farms occupy an area of about 12 km 2 and discharge the effluent into 
the estuary 


In marine systems with aquaculture but lacking 
other activities (e.g., some Norwegian fjords), the cul¬ 
ture model may form the basis of the decision support 
system. In Lysefjord (southern Norway), a pump was 
used to transport water to depth where it enhanced 
diffusion of nutrient from below the thermocline to 
the euphotic zone [90]. The increase in primary pro¬ 
duction was used as a food source for cultured mussels. 
A model was employed to determine the best location 
for the upweller as well as the mussels in order to 
benefit from increased primary production, balancing 
the increase in new production with mussel removal to 
maintain chlorophyll within its natural limits. This 


example is of interest because the ecosystem was truly 
managed in terms of bottom-up nutrient supply, new 
production, grazing in terms of mussel culture, and 
marine spatial planning optimized for the extent and 
level of shellfish culture. 

As more types of activities are added to decision 
support tools, one approach is to use GIS as a wrapper 
for model outputs produced as layers. Decision sup¬ 
port comes from ancillary software with features such 
as portrayal of alternative land uses, exclusion of 
protected areas, weighting of valued features and hab¬ 
itats, and economic analyses. Several initiatives have 
been undertaken in this vein, with the incorporation 
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of simulation models mostly in the initial stages of 
progress. Examples which have freely available ArcGIS 
extensions include NatureServe Vista (natureserve.org) 
and Marine InVEST (naturalcapitalproject.org). There 
are different emphases in the various projects, includ¬ 
ing conservation, community engagement, ecosystem 
services, and land use, in addition to water quality 
issues. 

Among these, Marine InVEST seeks to maintain 
ecosystem services in the context of activities such as 
fisheries, aquaculture, renewable energy, and recreation 
[64]. Input information ranging from oceanographic 
data to species distribution is used in various models to 
consider outputs in terms of ecosystem services pro¬ 
vided, traditional model results (e.g., water quality), 
and socioeconomic valuation (Fig. 13). 

We note the importance of models other than those 
based on eutrophication and water quality. For exam¬ 
ple, some of the same spatial data used to model aqua¬ 
culture impacts can be used to predict the location of 
sea grasses or species at risk [65, 66]. Protected areas or 
buffer zones may then become part of the plan. In this 


way, critical habitat and biodiversity are also consid¬ 
ered along with aquaculture siting. Moreover, these 
decision-support frameworks can be used in public 
forums to incorporate community input, as has been 
done with protected area planning via MARXAN GIS 
[67] . In practice, these are few examples of models used 
to this extent, but it provides a very useful management 
shell in which to consider aquaculture submodels. 

The Other 50% of the Problem At present, two 
major components of carrying capacity, the social and 
governance aspects, are not amenable to mathematical 
modeling (Fig. 5); they are nevertheless fundamental 
areas for aquaculture management. 

The social pillar, from the perspective of social 
acceptance of aquaculture and, in particular, of the 
definition of limits to industry growth, has been 
discussed, for example, by Byron et al. [18]. Stake¬ 
holder dialogue, understanding of terms and concepts, 
and the simple fact that opinions can be voiced during 
the decision-making process are major contributors 
to generate consensus. The tools employed include 
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Models for different purposes are linked through input data layers that are used to build scenarios about management 
actions (e.g., restoration of eelgrass, increase in aquaculture) and climate change (e.g., sea-level rise, water temperature) 
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questionnaires, meetings, and presentations of differ¬ 
ent sides of the issue. Simulation models can help 
inform those discussions by, for example, providing 
quantitative data on development scenarios but social 
positions often have a strong emotional component. 

The governance pillar plays a major role in the assess¬ 
ment of carrying capacity. Table 1 shows key aspects of 
the human interaction in aquaculture and highlights 
the consequences of poor practices and governance. 

The issues identified have consequences that vary in 
severity, from a reduction in revenue and profit to 


major outbreaks of disease and the loss of aquaculture 
resources across large areas. 

Because the effects of human mismanagement can 
be so far reaching, it is important to discuss to what 
extent modeling frameworks can be used to assist the 
farmer and the water manager. The eight topics in 
Table 1 fall under the heading of governance (here 
taken to include self-regulation by farmers and farmers’ 
associations, as well as legal instruments, policy, 
and implementation), but only three (dark blue) 
can benefit directly from recommendations from 
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Carrying Capacity for Aquaculture, Modeling Frameworks for Determination of. Table 1 Key issues in culture practice, 
time frames, and potential consequences (color coding reflects the extent to which models can be applied: black - 
substantial; dark red - reasonable; light red - inapplicable) 


N° 

Topic 

Time 

frame 

Issues/consequences 

Examples 

1 . 

Species 

selection 

Prior to 
cultivation 

Imported exotics, disease, proliferation 

Pacific oyster, now considered invasive in 
the Netherlands; Perkinsus (dermo) and 

MSX, U.S. East Coast 

2. 

Seed 

purchase 

Start of 
cultivation 

Disease from imports, stable broodstock 

Herpes virus spreading in oysters across 
Europe 

3. 

Relaying 

Variable 

Disease spread 

Transmission of Herpes in oysters across 
Europe, ISA in Chilean salmon 

4. 

Stocking 
(farm or 
pond scale) 

Critical 

periods 

Overstocking can lead to: mass mortalities 
due to dissolved oxygen depletion, other 
environmental factors, and stress-related 
disease outbreaks 

Whitespot Syndrome Virus (WSSV) in 
Penaeid shrimp, Perkinsus in clams 

5. 

System- 

wide 

carrying 

capacity 

Months to 
years 

If carrying capacity is significantly 
exceeded: harvest reductions, disease 
outbreaks, economic hardship, system 
collapse, long recovery cycles, long-term 
loss of resource 

Marennes-Oleron, France, longer oyster 
culture cycles; Sacca di Goro, Italy, mass 
mortality of Manila clam; Qinshan, Fujian 
province, China, 50,000 fish cages (yellow 
croaker), mass mortality 

6. 

Feeding 

practice 

Culture 

cycle 

Ecologically damaging practices with 
ecosystem consequences, e.g., overfeeding 
of caged fish and benthic hypoxia 

Fertilization of intertidal/subtidal areas in 
China to promote seaweed growth, 
increasing yields of commercial products 
(seaweed, gastropods); use of juvenile fish 
as fish meal for cage culture in parts of Asia 

7. 

Spatial 
distribution 
of culture 

Culture 

cycle 

If inappropriate, e.g., by combining year 
classes in shellfish, more labor-intensive, 
greater impacts on the ecosystem, e.g., 
through sediment reworking 

Clam culture in southern Portugal 

8. 

Lease 

structure 


Fragmented/inexistent lease structure, 
smallholdings governance issues, due to 
multiple stakeholders 

Pond culture in Thailand (average 
freshwater pond: 0.28 ha); Clam culture in 
southern Portugal (average lease: 0.15- 
0.5 ha) 
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simulation models, addressing stocking density and 
feeding practices. 

Five of the issues identified introduce disease- 
related problems, and whereas procedures such as 
relaying, or seed import from contaminated sources, 
are strictly within the remit of good governance, 
models can to some extent assist in predicting the 
likelihood (i.e., risk), spread, and establishment of dis¬ 
eases, if present within an aquaculture area. 

Regulations enacted to control spread of diseases 
can directly mandate what species and culture prac¬ 
tices, stocking densities, etc., are permitted within 
a zone/region. Firstly, this would include planning 
applications assessments that would consider, among 
other factors, how siting a farm may potentially 
adversely affect other aquaculture operations located 
in that area, as well as potential effects on wild animal 
populations. In some cases, e.g., in Chile, regulations 
have been enacted that specify minimum distances 
between farms. 

There is also an increasing move by some large-scale 
industries, as part of developing common codes of 
practice, to manage farms to minimize the effects of 
disease on production. These codes can include 
restricting stocking densities, production on farms, 
specifying minimum allowable distances between 
farms, and introduction of mandatory fallowing 
periods between production cycles. All these are mea¬ 
sures that would theoretically constrain carrying capac¬ 
ity, at least in the short term, but might well increase 
the overall sustainability of the industry in the zone or 
region in the longer term. Although these may not be 
enforced by government regulations when first devel¬ 
oped (e.g., they are often “voluntary codes”), those 
signing up may be entering legally binding agreements. 
These codes may then be used as the basis to develop 
more formal regulatory frameworks in the future at the 
local, national, and even supranational level. 

Regulation can also influence the carrying capacity 
of a zone or region both directly and indirectly. In 
particular, to help control the spread of diseases 
between countries, the OIE aquatic animal code [91] 
lists a number of infectious aquatic animal diseases that 
are generally considered untreatable, pose a significant 
threat to aquaculture and/or wild fish populations, and 
are not widely distributed (e.g., are exotic to most 
countries where the species they affect are farmed). 


Countries and regions also have laws and regulations 
to prevent the entry and, where pathogens do gain 
entry, specify measures for their control and eradica¬ 
tion. For instance, Directive 2006/88/EC lays down, for 
European Member States, the required minimum ani¬ 
mal health requirements, disease prevention, and con¬ 
trol measures for aquaculture animals and products. 

Eradication of a disease from a country or zone may 
involve extensive depopulation of affected farms and 
not restocking them until it is confirmed that the risk 
from disease is significantly reduced. This has obvious 
short- and long-term consequences on the carrying 
capacity of the affected species in the affected areas. 
For example, following detection of the notifiable viral 
disease Infectious Salmon Anemia (ISA) on an Atlantic 
salmon farm in Chile in 2007, the disease spread 
throughout the salmon industry [92]. The effects were 
dramatic. It is estimated that salmon production in 
2010 was little more than 98,000 t, down from 
386,000 t in 2006 [93]. With the 1.5-2.5-year produc¬ 
tion cycle for salmon, these effects will be felt for years 
to come, with estimated smolt release in 2009 only 
10% what it was in 2007 [94]. It is estimated that, 
compared to production in 2007, levels will be reduced 
cumulatively by at least 700,000 tons during the period 
2009-2011, with their value reduced by more than two 
billion USD. 

Disease controls can restrict the types of aquacul¬ 
ture activity that are permitted, or are practically pos¬ 
sible, within a zone or region. Here, the constraints to 
carrying capacity are often opportunity costs, with fish 
not reared in a particular area because of potential 
disease risks. The effects can be quite subtle. For 
instance, in the UK, controls in place to restrict the 
spread of bacterial kidney disease are considered by 
some sectors of the rainbow trout farming industry to 
adversely affect their operations at the expense of the 
larger Atlantic salmon industry that has generally 
supported the maintenance of these controls. This 
would also include the effects of restrictions placed on 
the movements of live fish and ova between countries 
that limit the opportunities to farm those species in 
other countries. 

Finally, societal acceptance of aquaculture activities 
may be influenced by disease risk concerns, particularly 
to wild fish stocks. This is perhaps best illustrated by sea 
lice spread to wild fish. There is additional public 
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concern as to the effects on the environment of use of 
chemical treatments. In particular, there is strong resis¬ 
tance to fish farming in many remote and pristine 
environments due to fears they may adversely affect 
local ecosystems. Increased public awareness of welfare 
issues in aquaculture may also lead to practices that 
reduce the incidence of disease as a consequence. 

Blueprint for a Model System 

There are considerable challenges in selecting and com¬ 
bining different types of models, each of which plays 
a particular role, for carrying capacity assessment. 
A number of models are available, many of which 
reviewed above, but the level of usability varies widely, 
as does the capacity of one model to “speak” to another. 
This comes about through the “one tool” syndrome: If 
your only tool is a hammer, all your problems are nails. 
One tool, i.e., one model, may be fine for a particular 
level of analysis, but it will be incapable of doing all 
the work. 

For instance, a local-scale model such as 
DEPOMOD [36], MOM [68], or FARM [4] will need 
current velocity fields supplied either by field measure¬ 
ments or by other types of models. Equally, it will not 
provide a robust answer with respect to system-scale 
carrying capacity. Even at the farm scale, such models 
do not consider the interactions among farms, 


although the siting of a new farm in a region will 
inevitably affect environmental conditions and pro¬ 
duction in existing farms. These changes will then 
feed back, so that the original farm-scale assessment 
will change because the model input data will change. 

Figure 14 illustrates this difference for mussel cul¬ 
ture in Killary Harbour, Ireland, by comparing two 
models at differing scales [35]. EcoWin2000 (E2K) is 
an ecological model applied at the system scale, 
whereas FARM simulates production and environmen¬ 
tal carrying capacity at the local scale. Both models can 
be used to perform a marginal analysis [4] to determine 
stocking densities that lead to optimal profitability. 
This is extremely useful for licensing purposes since 
farms often maximize income rather than profit (i.e., 
aim for the highest TPP). It can be seen that for 
a coastal or semi-enclosed system, there appear to be 
significant differences between the results for the sys¬ 
tem-scale model, where the dotted line indicating 
highest TPP (which exceeds the seeding density of 
maximum profit) occurs at a density 7-8 times greater 
than the present situation. FARM, however, which 
deals only with the local scale, determines the end of 
stage 2, i.e., highest TPP, as around XI5 density. This is 
because (1) E2K runs multiple production cycles, typ¬ 
ically for periods of 10-20 years, i.e., the ecosystem 
model reports what is actually harvested, whereas 
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Carrying Capacity for Aquaculture, Modeling Frameworks for Determination of. Figure 14 

Impacts of different mussel-stocking densities on total physical product (TPP, harvestable biomass) and average 
physical product (APP, the ratio between seeded and harvested biomass) as simulated by EcoWin2000 (system scale, 
per unit of cultivated area) and FARM (single farm) for Killary Harbour, Ireland (Adapted from [35]) 
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FARM reports what is harvestable over one cycle, and 
(2) FARM does not account for interactions among 
farms, whereas E2K considers the farms in the whole 
water body. 

The following set of properties is desirable when 
assembling a modeling system to address aquaculture 
carrying capacity: 

1. No single model solves all problems. Overparame¬ 
terization and code bloat only make matters worse. 

2. Models should only be as complex as the problem 
requires. In other words, as simple as possible. 
Increased computational power is no excuse for 
unnecessary complexity. 

3. Models should be able to work independently, pre¬ 
sent value in doing so, and add further value when 
working in conjunction with other models. 

4. Models should define exactly what problems they 
can address, as part of the overall questions for an 
ecosystem, rather than the opposite. 

5. Any model in the system must be able to receive 
input from data or from other models and be able 


to supply outputs in a form that can be easily used 
by other models. 

6. Different models are appropriate for different scales 
in space and time. Carrying capacity assessment 
may require scales as short as a tidal cycle (e.g., for 
intertidal culture of clams) and as long as a decade 
(e.g., for coupling ecological models with economic 
models). 

7. Models share a challenge with field sampling with 
respect to the conversion of data (measured or 
modeled) into information that is useful for man¬ 
agers; the use of screening models, or other 
approaches that help to distil data into mean¬ 
ingful information, is a vital component of 
any system. 

Figure 15 shows an example of such a multi-model 
framework for investigating the role of bioextraction by 
oysters, clams, and ribbed mussels as part of a nutrient 
control strategy for Long Island Sound. The various 
models chosen conform to the seven properties 
described above. 
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Modeling framework: the REServ shellfish bioextraction project. Long Island Sound, USA (NOAA/EPA REServ Project) 
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Previous applications of this kind of framework 
include the SMILE project in Northern Ireland 
[19] and the SPEAR project in China ([69], Nobre 
et al. [11]). 

A different kind of multi-model framework has 
been developed by Kapetsky et al. [70], through the 
application of various types of remotely sensed 
data in a GIS system, incorporating variables such 
as sea surface temperature, current speed, and chloro¬ 
phyll a. In the example shown in Fig. 16, current 
speeds of 0.1-1 m s -1 are combined with suitable 
depth ranges for fish cages and shellfish longlines. 
High current speeds are problematic for both culture 
structures, due to excessive hydrodynamism, and fin- 
fish production, due to higher metabolic costs and 
lower yields; very low current speeds may cause 
sediment organic enrichment and quality loss for cer¬ 
tain species. 

Quantification of suitable areas indicates that 
123 countries have at least 100 km 2 within their exclu¬ 
sive economic zone that meet these criteria. 


For an annual production of 1 kg m -2 year -1 , this 
corresponds to 10 7 t year -1 of aquatic products. This 
analysis has been further refined by considering dis¬ 
tance to port, a key economic consideration for off¬ 
shore aquaculture, which will considerably reduce this 
estimate. The same applies to other factors, such as 
marine protected areas. An assessment of spatial poten¬ 
tial (a component of production carrying capacity) was 
executed at a finer scale for selected areas in Canada, 
Chile, Ireland, and Norway (Fig. 17). For a number of 
specific locations (farms) in these four countries, the 
time required for growing an adult salmon was then 
determined using a dynamic model [71], forced by 
remotely sensed sea surface temperature. In this case, 
the model end point was a particular fish biomass, 
rather than a specific cultivation period. 

It is easy to see how this kind of coupling can be 
leveraged, for example, by introducing the effect of 
current speeds on metabolic requirements, using eco¬ 
nomic models to analyze the trade-offs between bio¬ 
mass and market price (lower yield, higher product 
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World ocean areas suitable for offshore aquaculture (From [70]) 
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Areas with Potential for Atlantic Salmon and Locations of Atlantic Salmon Farms in the Gulf of Corcovado, Chile 
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Areas in Chiloe Island, Chile, with temperature apt for Atlantic salmon mariculture (22-32°C) and depths (25-100 m) and 
current speeds (10-100 cm/s) apt for submerged cages (From [70]) 


quality), and using models such as FARM to quantify 
environmental effects, considering factors such as 
current speed and depth of site, both of which are 
readily available. 

It is clear from this discussion that different model¬ 
ing products and services are required for different 
markets. Water managers or planners may for instance 
be concerned with bay-scale carrying capacity, taking 
into account the multi-sectorial context of aquaculture, 
i.e., integrating ecological models with marine spatial 
planning. Those types of models will help inform deci¬ 
sions on the conservation of wild populations and 
biodiversity and also provide support for licensing, 
legislative compliance, and source apportionment for 
managing pressures. 

On the other hand, a farmer will probably be more 
focused on optimization of production, improvement 
of feed conversion ratios, and business profits. But 
increasingly, the environmental footprint of the farm, 
together with potential positive impacts, such as the 
role of shellfish in reducing eutrophication symptoms, 
will be of interest. In the European Union, with legis¬ 
lative requirements for Good Ecological Status by 2015 


in all EU water bodies and Good Environmental Status 
in all marine waters by 2020, some of the modeling 
approaches and tools described above will undoubtedly 
be in demand to perform self-assessments and to 
resolve conflicts. 

As models of disease risks evolve, and become better 
integrated with other types of models, these too will be 
in demand for siting decisions. 

Some of these models will be products, some will 
be services, and some will be a combination of both. 
The ways in which those markets develop and the 
future directions in which the science and technol¬ 
ogy may evolve will be discussed in the final section of 
this entry. 

Future Directions 

Support for an Ecosystem Approach to Aquaculture 

Application of the EAA on a worldwide scale requires 
the harmonization of (1) environmental, (2) social, 
and (3) multi-sectorial planning objectives [8]. These 
three principles and their relative weights differ sub¬ 
stantially across world regions, and it is not feasible 
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from a social and political standpoint to establish uni¬ 
form compliance with respect to limits and thresholds. 

The only solution is to define appropriate 
approaches, which within particular world regions 
define a gradient in relative terms, assessing EAA in 
terms of the principles stated above. The three princi¬ 
ples of EAA can be mapped onto the four pillars of 
carrying capacity and illustrated as the overlap of these 
[16]. The importance of each theme represented will 
vary regionally and will evolve through time based on 
societal cues. 

It is clear that aquaculture in Europe and North 
America is more closely aligned with EAA, and the 
present effort in marine spatial planning will only rein¬ 
force that. It is also clear that aquaculture production 
in developed countries will grow little, and that con¬ 
sumer demand in these regions is satisfied in large part 
through imports (Fig. 18). 

In the United States, 84% of aquatic products are 
presently imported, of which 50% are from aquacul¬ 
ture. This has resulted in a nine-billion-USD seafood 
trade deficit [82]. Table 2 presents a summary of the 
main issues that are presently considered in aquacul¬ 
ture carrying capacity and site selection, together with 


what may constitute future components for assess¬ 
ment. Models play a key role. 

Since developing nations supply the bulk of farmed 
aquatic products, will continue to do so, and will prob¬ 
ably increase their contribution, it is critical that state- 
of-the-art tools are available to ensure sustainability in 
the countries where they are most needed. 

Which developing countries and which types of 
culture should be priorities for the application of 
such models? From an EAA perspective, those that 
have the highest impact on the environment are the 
most promising candidates. Kapetsky et al. [73] used 
FAO production statistics at country environment level 
(freshwater, brackish water and marine) to estimate the 
intensity at which aquaculture was practised in each of 
those environments. A knowledge of the species being 
cultured can reveal the production systems and their 
associated impacts in a very general way. The approach 
outlined above can be refined to focus more closely on 
modeling needs by considering the potential impacts 
by species and culture systems in countries in which 
production data by species are reported. 

Dissemination of models can be passive (e.g., 
packages freely accessible via the Internet) or active 
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Trade flows of aquatic products into Europe [72] 
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Carrying Capacity for Aquaculture, Modeling Frameworks for Determination of. Table 2 Novel management 
approaches (Adapted from [10]) 


Topic 

Now 

Tomorrow 

Feed based (cage, pond) 

Site selection based on holding 
capacity (cages), wastewater 

Integrated model systems, risks, welfare, 
and disease. Holistic indicators 


minimization (ponds) 

LCA: inefficiencies and eco-labeling 



Mechanistic and statistical models 



Data assimilation models 

Shellfish farming 

Large areas 

Economic sustainability, ecology and 
economics, disease 


Focus on production and social 
carrying capacity NIMBY, (Not in my 

Coupled GIS expert systems including 
xenobiotics, HAB, etc. 


backyard) NIMTO (Not in my term in 
office) 

Model uncertainties in yield 


Early warning 

Integrated multi-trophic aquaculture 

Optimize production 

Integrated coastal zone management 


Reduce negative externalities 

Simulate species combinations 



Full economic assessment 



Combine GIS, remote sensing, and 
modeling 


(training courses and workshops by region or by coun¬ 
try). In both cases, it is essential to establish the 
technical capacity, level of interest, and financial com¬ 
mitment of the audience and the status of the Internet 
as a communications and data pipeline for technical 
support in each country. The focus should not be on 
developing countries alone because (1) virtual technol¬ 
ogy specialists in developed countries may be in a 
position to aid dissemination and (2) companies 
established in developed countries often have aquacul¬ 
ture operations in developing countries, and could 
therefore also help to bridge the gap. 

Looking into the Crystal Ball 

The Thematic Review “Virtual Tools for Aquaculture,” 
presented at the FAO Global Aquaculture Conference 
in 2010 [10], encapsulates much of our thought on the 
future of models in aquaculture. The final points in this 
entry are drawn from that review. 

The aquaculture industry is going to be affected by 
many different issues and trends over the coming years, 
often operating concurrently, sometimes in unexpected 
ways, and producing changes in the industry that may 


be very rapid indeed. Mathematical models will play an 
important role in addressing many of these, particu¬ 
larly in the following areas: 

1. Information exchange and networking are going 
to accelerate the use of virtual technology and 
decision-making for problem solving to support 
industry growth. Web-based access to real-time 
information will further accelerate this growth. 

2. Links between industry and research centers will 
become more effective responding to objective-led 
demand for virtual technology-driven RTD. 

3. Strategic alliances will need to be reinforced or 
created for the implementation of virtual technol¬ 
ogy for aquaculture in developing countries, for 
example, FAO and WorldFish Center are working 
in many of the same target countries, and this could 
facilitate the transfer of research outcomes on vir¬ 
tual technology to end users. The same applies to 
collaborative research with third countries medi¬ 
ated, for example, by the European Union, the 
United States of America, and Canada. 

4. Many virtual technology tools will need to be more 
production and management oriented. And even if 
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attractive and promising, these tools will have to 
be adapted to local realities and conditions to 
really become useful (and used) in the future. 
This requires a compromise with respect to ease of 
use, data requirements, and scientific complexity. 
Many such tools will evolve from service to prod¬ 
uct, requiring academic developers to accept a loss 
of control in conditions of application, as a natural 
trade-off (and inherent risk) of product maturity. 

A number of key thematic and technical areas 
where models for aquaculture are currently incipient, 
and expected to develop strongly in the next decade or 
so, have been identified below. In all the examples, such 
models will contribute to an overall modeling frame¬ 
work, by integrating and complementing existing tools. 

Disease Disease in cultivated aquatic species is 
a major source of concern, yet disease modeling is still 
relatively underutilized in these systems though its 
worth has clearly been demonstrated in several studies. 
However, while epidemiological theory predicts that 
host density thresholds may be an important part of 
host-parasite dynamics, clear empirical examples are 
rare, as much in aquaculture as in other studied agri¬ 
cultural systems. It is suggested that this is not evidence 
that host thresholds do not exist, rather that statistical 
difficulties arise with confounding factors or inade¬ 
quate data. Evidence here is afforded from much bet¬ 
ter-understood and more data-rich systems, for 
example, human measles [95-97], where there is 
compelling evidence on the effects of host density on 
the likelihood of disease outbreaks taking place within 
populations. 

One of the major limitations in aquatic systems, 
however, is that the routes by which pathogens are 
transmitted and their relative contributions to an epi¬ 
demic are often not understood. If transmission is 
incorrectly specified, there can be substantial conse¬ 
quences to estimates of N T . Additional experimental 
and field data are required to identify and quantify 
transmission routes for many aquatic animal patho¬ 
gens, if prediction of the effects of host density on 
disease incidence is to be made through modeling. 
To facilitate this, increasing emphasis is required on 
the use of real-time data acquisition combined 
with models for real-time analysis and short-term 


prediction of animal welfare. It is expected that such 
systems will become cheaper and more generalized, 
and that some of the indicators and trends will find 
application at longer timescales, albeit by means of 
a probabilistic approach. 

A further potential limitation of the models cur¬ 
rently used is their inability to predict across and link 
scales from the processes occurring within a tank to 
the transmission across and then between sites. 
Methods of investigating disease processes through 
these metapopulation approaches should be investi¬ 
gated further. 

To date, only a few models have been developed to 
simulate pathogenic infections of shellfish with respect 
to physiology, for example, Powell et al. (1996) for the 
American oyster C. virginica , but with widespread con¬ 
cerns about relaying, susceptibility, and mortality, 
models focusing on a more mechanistic approach will 
undoubtedly appear over the next decade. 

Harmful Algal Blooms This is another area where 
little predictive capacity exists, except in the short term 
through the use of operational oceanography, relying 
on bloom identification and tracking. Management is 
at present reactive, and modeling of appearance and 
development of such blooms is in its infancy, due to the 
lack of an appropriate paradigm. Sensors such as 
targeted RNA probes (e.g., [74]), integrating handheld 
devices, or potentially deployed in situ and used in 
a networked framework will both help in early detec¬ 
tion and management and contribute to the under¬ 
standing of the underlying triggers. Considerable 
developments are also expected in remote sensing algo¬ 
rithms able to discriminate (at least) between HAB and 
nontoxic blooms (S. Bernard, personal communica¬ 
tion, 2010). 

Certification and Traceability The arrays of sensors 
that can be deployed at the farm scale to enable coupled 
monitoring and modeling are important for both 
product certification and traceability. The number, 
reliability, and accuracy of underwater sensors will 
increase, and the cost will decrease, both with techno¬ 
logical developments and market growth. 

Real-time data acquisition and interpretation will 
make it possible for consumers to visualize the whole 
“cradle to grave” cycle of an aquaculture product. For 
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instance, a batch of oysters may be “bar coded” to 
reveal the origin of seed and the entire environmental 
interaction over the culture period, including metadata 
and measured data on water quality, HAB events, con¬ 
dition (meat ratio) of the animals, and impact on their 
environment, for example, in terms of reduction of 
eutrophication symptoms through the indirect 
removal of nitrogen and phosphorus, or conversely 
the addition of particulate organic material due to 
biodeposition. Such sensors will typically be queried 
at a sub-hourly frequency, particularly if they are also 
used for welfare monitoring; this will easily allow 
importers, health inspectors, or consumers to perform 
verification and certification, and will provide an 
important contribution to both food safety and envi¬ 
ronmental awareness. 

For the farmer, the existence of this kind of inte¬ 
grated coding will also help improve various aspects of 
culture practice and increase attractiveness of the busi¬ 
ness model to the key sector of insurance. For the 
consumer, such data will need to be presented in 
a comprehensible format, for example, in the form of 
a few indicators (Fig. 19). 



Carrying Capacity for Aquaculture, Modeling 
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QR code for farmed shellfish, readable on any smart phone 


Modeling with Data Scarcity Good data are required 
to support acceptable model predictions. The acquisi¬ 
tion of high-quality data, with appropriate spatial and 
temporal resolution, is expensive and often beyond 
the scope of developing countries, except on a fairly 
limited scale. This, together with an often fragmented 
approach to the study of interacting ecosystems, in 
many cases driven by institutional barriers, presents 
a challenge to the application of models. 

Improved mechanisms for data access, particularly 
for remotely sensed data, together with models that 
deal with uncertainty and risk, will contribute to con¬ 
version of sparse data into more meaningful informa¬ 
tion - although such an approach may be considered 
unsuitable in parts of the developed world, in many 
countries it will be a much better basis for decisions 
than the options that are presently used. In addition, it 
will promote a “virtuous cycle” toward more informed 
decision support and promote the use of better data 
and more sophisticated models, as the data become 
available to drive them. 

A wide variety of models will benefit from this 
development because many models developed and 
tested for temperate systems need adjustment with 
respect to parameters, thresholds, and equations, 
when applied to tropical and subtropical systems. 

Information Technology The last 5 years have seen 
a huge leap in various areas of distributed computing, 
all of which are expected to develop significantly in the 
coming years. 

Three examples are presented here: 

1. Web 2.0 now provides a large diversity of commu¬ 
nity- and corporation-based resources. Currently, 
over 7,000 items exist for aquaculture on YouTube - 
at the time of presentation of the VITAL Thematic 
Review [10] there were 1,800. For aquaculture 
modeling, that number has also quadrupled since 
2010 and now stands at 75. 

2. There is a strong trend toward the development and 
use of Software as a Service (SAAS), deployed on the 
web and competing with traditional desktop appli¬ 
cations. This is incipient in the aquaculture world 
but can be seen, for example, in the WinShell 
application (http://longline.co.uk/winshell), which 
allows users to simulate individual shellfish growth 
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Tidal prediction for the port of Mumbai using the Oceanus21 smart phone app (http://longline.co.uk/oceanus21). Ports 
can be automatically selected based on GPS coordinates and phone location 


on line. Central to the development of this kind of 
application are Rich Internet Applications (RIA), 
which provide a full user experience and are an area 
of rapid growth [75] 

3. Mobile computing is increasingly ubiquitous, and it 
is now possible to use models on many handheld 
devices, as illustrated in Fig. 20, which shows a tide 
prediction model for Mumbai, India. This kind of 
model might be used to help predict the yield in 
intertidal culture of bivalve shellfish. 

The trend toward the increasing use of such devices, 
including for various real-time applications in aqua¬ 
culture management, will increase. In parallel, the 
stand-alone server is rapidly being replaced by cloud 
computing, which will tend to make the circulation of 
data both easier and cheaper. Both elements will con¬ 
tribute to bridge the information divide between richer 
and poorer nations. 

Computer access, literacy, and internet connectivity 
are significant barriers to entry for the population in 
the rural areas in developing countries, where aquacul¬ 
ture takes place. Over the coming decade, many aqua¬ 
culture farmers will have their first contact with the 
World Wide Web by means of smart phones. Cell 
phones are ubiquitous even in remote areas and are 
much better adapted to local language and alphabet, 
which constitute other barriers in accessing technology. 


The future of aquaculture is promising. It needs to 
be, given the world population growth expected in the 
next decades. Simulation models, particularly when 
used within appropriate frameworks, show enormous 
potential to inform and guide the future development 
of aquaculture, toward a world which is more socially 
responsible, more equitable, and more sustainable. 
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Glossary 

Benthic Pertaining to the sea floor. 

Carrying capacity The intensity of a practice that 
a given environment can sustain indefinitely given 
the availability of various necessities in that envi¬ 
ronment and the various pressures on them. 
Ecosystem approach to aquaculture (EAA) A strategy 
to integrate aquaculture into context of the wider 
ecosystem such that it promotes sustainable devel¬ 
opment, equity, and resilience of interlinked social 
and ecological systems. 

Ecosystem-based management (EBM) A process that 
highlights the need to use the best available knowl¬ 
edge about the marine ecosystem to manage marine 
resources, with an emphasis on maintaining eco¬ 
system service functions. 

Integrated coastal zone management (ICZM) A pro¬ 
cess for managing coastal zones that uses an inte¬ 
grated approach and considers all aspects of the 
coastal zones, including biological, geographical, 
and political boundaries, to achieve sustainability. 
Model A simplified description, conceptual or mathe¬ 
matical, of a system or process, to assist in calcula¬ 
tions, predictions, and understanding. 

Pelagic Pertaining to the water column. 

Definition of the Subject 

Bivalve aquaculture is one of the fastest growing sectors 
of the food industry, raising concerns about the influ¬ 
ence of the activity on the environment. This is true at 
two levels: First, farmers must make sure that the 
bivalves that they raise do not deplete resources in 
a given area to such an extent that bivalve production 
is decreased. Second, society in general wishes that such 
activities have an acceptable impact on the environ¬ 
ment and are sustainable. There is also intense compe¬ 
tition for space and its use in many coastal zones, 
making siting of many farms contentious. Thus, many 
organizations have stressed the importance of deter¬ 
mining the carrying capacity of different areas for 
bivalve culture. There are a number of ways that “car¬ 
rying capacity” may be defined, including physical, 
production, ecological, and social, and the first three 
categories are to lesser or greater degrees related to 
social expectations and standards. A number of 
methods have been developed to calculate these 
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different categories of carrying capacity for bivalve 
culture. This entry outlines some advances to estimate 
the different categories of carrying capacity and sug¬ 
gests a framework that may be followed to encourage 
the development of a sustainable bivalve aquaculture 
industry. An emphasis is placed on the latter two cat¬ 
egories as these are the ones for which knowledge is the 
most lacking, are arguably the most complex, and for 
which advances are the most pressing. 

Introduction 

Aquaculture is the fastest growing sector of the food 
industry. Since the 1970s, production in the sector has 
increased at a rate of about 7% per year and by 2008 
accounted for 43% of the total annual fisheries produc¬ 
tion of 160 million tons and projections by the UN [1] 
suggest that this production will increase greatly in the 
future. This increase in production has raised concerns 
about the impacts of the activity on local environ¬ 
ments, (e.g., [2]) and much work has been focused on 
understanding the interactions between aquaculture 
and the ecosystem, (e.g., [3, 4]). Although concerns 
were initially largely directed at the influence of finfish 
cage culture on the environment (e.g., nutrient load¬ 
ing, disease issues, and interactions due to escapes), 
concerns have also been raised about the influence of 
farmed bivalves. 

As for aquaculture in general, production of farmed 
bivalves has increased greatly over the past few years 
and as of 2008, 11.7 million tons of the total 13.5 mil¬ 
lion tons of worldwide bivalve captures was from aqua¬ 
culture production. This includes the culture of over 
4 million tons each of clams and oysters and about 
1.5 million tons each of scallops and mussels. The 
greatest concerns relating to farmed bivalves include 
enhanced localized biodeposition [5], food depletion 
in the water column due to bivalve grazing [6], alter¬ 
ation of nutrient and oxygen fluxes [7] , and the transfer 
of disease and hitchhiking species [8]. Recent reviews 
on the environmental interactions associated with 
bivalve culture have been done for oysters [9], clams 
[10], and mussels [11]. 

Increasingly, regulators and other groups are 
looking to science to determine whether sites have 
reached their carrying capacity for bivalve culture. 
Moreover, a number of standards - performance 


standards, best management practices, certification 
standards, etc. - have been or are being developed to 
ensure that bivalve production is being done in 
a manner that is consistent with a global environmental 
ethic [12-14]. Some of these codes have been devel¬ 
oped by industry groups and are voluntary in nature, 
some have been developed and are used by public 
authorities and regulatory agencies, and others are 
being developed by a variety of organizations (e.g., 
buyers, nongovernmental organizations, marketing 
groups) as a means of informing consumers about 
a product with the goal of influencing farm practices 
through consumer choice and market forces [15]. 
In all cases, the idea is that such criteria will encour¬ 
age the development of a sustainable bivalve aqua¬ 
culture industry such that the carrying capacity of 
a given area has not been exceeded and impacts are 
minimized. Recently, the National Research Council 
[15] suggested that performance standards based on 
the carrying capacity of sites be developed and 
implemented at the ecosystem level. Similarly, the 
FAO [16] has suggested that an Ecosystem Approach 
to Aquaculture (EAA) be followed to ensure the “inte¬ 
gration of the activity within the wider ecosystem in 
such a way that it promotes sustainable development, 
equity, and resilience of interlinked social and ecolog¬ 
ical systems.” Many advances have been made in the 
estimation of carrying capacity for bivalve culture over 
the past few years. The broad aim of this entry is thus to 
update an earlier paper on the subject [17] and to 
discuss a number of ideas that have been developed in 
the interim. It focuses on the outgrowing stage of 
bivalve culture as this stage is arguably the most impor¬ 
tant in terms of its interactions with the environment. 
Many other activities related to bivalve culture exist 
(see list below) and these are discussed by McKindsey 
et al. [17]. 

Abbreviated selection of activities related to bivalve 
culture that may influence the ecological carrying 
capacity of coastal areas. (Modified from McKindsey 
et al. [17].) 

1. Seed collection 

(a) Dredging 

(i) Disturbance of benthic communities, 
especially the removal of long-living 
species 



Carrying Capacity for Sustainable Bivalve Aquaculture 


C 


1961 


(ii) Removal of juveniles from wild 

populations of target species 

(iii) Collection of non-target species 

(iv) Suspension of sediments 

(v) Depletion of food resources for other 
species 

(vi) Release of H 2 S and reduction of dissolved 

oxygen in the water due to oxygen¬ 
consuming substances, release of 

nutrients 

(b) Artificial collectors 

(i) Removal of juveniles from wild popula¬ 
tion of target species 

(ii) Increasing target and non-target species 
recruitment success 

(iii) Alteration of the hydrodynamic regimes 

(iv) Acting as FAD 

(v) Risk of entanglement for large vertebrates 
(e.g., marine mammals, sea birds, turtles, 
sharks) 

(vi) Foci for nuisance species 

(c) Hatcheries 

(i) Chemical pollution (e.g., pharmaceuticals) 

(ii) Genetic selection 

(iii) Spread of diseases 

(d) Importation 

(i) Introduction of alien species 

(ii) Genetic pollution 

(iii) Spread of diseases 

2. Ongrowing 

(a) Effects common to all techniques 

(i) Organic enrichment of seafloor 

(ii) Providing reef-like structures 

(iii) Alteration of hydrodynamic regime (cur¬ 
rent speed, turbulence) 

(iv) Food web effects: competition with other 
filter feeders, increasing recycling speed 
of nutrients, removal of eggs and larvae 
of fish and benthic organisms 

(v) Spawning: release of mussel larvae 

(vi) Providing food for predators of bivalves 

(vii) Control of predators and pests 

(b) Bottom culture 

(i) Activities to prepare the culture plots, 
e.g., dredging for predator removal 


(ii) Placement of protective structures (net¬ 
ting, pipes) 

(iii) Removal of associated organisms by 
dredging and relaying 

(iv) Competition for space with wild benthos 
organisms 

(c) Artificial structures for suspended and off- 
bottom culture (trestles, poles, rafts, longlines) 

(i) Acting as artificial reef or FAD (attraction/ 
displacement or enhancement of animals) 

(ii) Risk of entanglement for large vertebrates 
(e.g., marine mammals, sea birds, turtles, 
sharks) 

(iii) Foci for nuisance species 

3. Harvesting 

(a) Effects common to all techniques 

(i) Removal of biomass, nutrients 

(ii) Removal of filtration capacity 

(iii) Removal of non-target species 

(iv) Competition with predators 

(b) Dredging 

(i) Disturbance of benthic communities, 
especially removal of long-living species 

(ii) Suspension of sediments 

(iii) Release of H 2 S and decrease of dissolved 
oxygen in the water due to oxygen¬ 
consuming substances, release of 
nutrients 

(c) Collection of off-bottom structures 

4. Processing 

(a) Dumping of by-catch 

(b) Relaying near auction houses 

(c) Depurating 

(d) Dumping of shells 

(e) Effluents from processing plant 

(f) Spread of alien species or diseases 

As a first step, it is important to understand what is 
meant by “carrying capacity.” A number of definitions 
have been suggested that consider a variety of physical, 
biological, and social criteria. In this entry, the four 
functional categories of “carrying capacity” as outlined 
by Inglis et al. [18] and McKindsey et al. [17] are used: 

1. Physical carrying capacity - the total area of marine 
farms that can be accommodated in the available 
physical space. 
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2. Production carrying capacity - the stocking density 
of bivalves at which harvests are maximized. 

3. Ecological carrying capacity - the stocking or farm 
density above which unacceptable ecological 
impacts become apparent. 

4. Social carrying capacity - the level of farm develop¬ 
ment above which unacceptable social impacts are 
manifested. 

The specific aims of this entry are to (1) provide an 
overview and update of different categories of carrying 
capacity, (2) give a more in-depth review of factors 
that could be considered for the determination of eco¬ 
logical and social carrying capacity as these categories 
are the least developed conceptually and in terms of 
modeling, (3) outline a decision framework for incor¬ 
porating all four categories of bivalve carrying capacity 
into the determination of the overall carrying capacity 
of a given area for bivalve culture, and (4) outline 
research to address knowledge gaps for carrying capac¬ 
ity studies. 

Functional Categories of "Carrying Capacity" 

Physical Carrying Capacity 

The physical carrying capacity of a site (embayment, 
inlet, offshore area, etc.) is simply the geographic area 
in which conditions are suitable for the production of 
a given species using a given method. It is a function of 
the overlap of the requirements of the species being 
harvested and the physical resources (e.g., depth, sub¬ 
strate type, salinity, hydrodynamics) in the site. The 
physical carrying capacity of a site may differ greatly 
between species and culture methods being employed. 
For example, an embayment may have an extensive 
area that is suitable for on-bottom clam culture but 
relatively little area that is appropriate for suspended 
oyster culture. Likewise, modification of an area (e.g., 
by the addition of a species-specific appropriate sub¬ 
strate) may extend the physical carrying capacity of 
an area. 

Traditionally, the physical carrying capacity of sites 
was determined by building knowledge of an area using 
hydrographic charts as the base layer, adding other 
appropriate layers (e.g., temperature and salinity) 
as available, and analyzing the layers using formal 
or ad hoc Geographic Information Systems (GIS), 


(e.g., [19, 20]). With the expansion of culture sites in 
more remote regions where such information may not 
be readily available, the use of remote sensing methods 
to estimate various physical parameters (e.g., depth, 
salinity, temperature, as well as chlorophyll levels and 
related biological parameters that are needed to esti¬ 
mate production carrying capacity - see further) may 
be used in combination with GIS to facilitate site selec¬ 
tion [21]. Although more of a social factor (see fur¬ 
ther), an additional advantage of using GIS for 
aquaculture site selection is that it can be used within 
a coastal management framework to include other 
activities so as to avoid user conflicts [22, 23]. 

Production Carrying Capacity 

The production carrying capacity of a site is the stock¬ 
ing density at which harvests are maximized. Given that 
trade-offs between bivalve growth rates, market tastes, 
and economic returns, etc., would likely encourage 
selection of a specific bivalve size and type (see further), 
production carrying capacity is often not necessarily 
the greatest biomass. This category of carrying capacity 
is what most people think about when they consider 
“carrying capacity” and is the best studied. The pro¬ 
duction carrying capacity of a given site is strongly 
related to hydrodynamic and food regimes and its 
physical carrying capacity. Indeed, various “habitat 
suitability models” have been developed that combine 
both categories of data to predict the best areas 
for bivalve production, (e.g., [24]). Models to predict 
production carrying capacity have three main compo¬ 
nents: (1) a hydrodynamic model that transports food, 
nutrients, and other wastes; (2) a biogeochemical com¬ 
ponent that describes processes that influence food 
production and consumption; and (3) a physiological 
component that determines the rate of food consump¬ 
tion and growth of the farmed bivalves [25]. 

Bivalve culture systems are hierarchical, with indi¬ 
vidual bivalves and their associated fouling organisms 
nested within culture units (socks, cages and stacks of 
cages, pearl nets and strings of pearl nets, etc.), these 
being nested within culture gear (longlines or rafts), 
which are nested within farms, and so on [26-28]. An 
understanding of processes operating at each scale as 
well as the relations between these scales is needed to 
predict hydrodynamics and how this influences 
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biogeochemical and physiological processes within cul¬ 
ture systems and to understand cascading effects on the 
greater ecosystem. 

In its simplest form, the production carrying capac¬ 
ity of a given location is a function of the available food 
resources and the rate at which they are renewed via 
in situ production and/or flushing relative to their rate 
of removal (filtration) by the farmed bivalves. This type 
of approach has been developed for mussel culture in 
eastern Canada [29]. However, such models commonly 
under- or overestimate flushing and a number of 
authors have suggested that their use be avoided for 
production carrying capacity studies, (e.g., [30]). 
Despite this, a number of groups have advocated cal¬ 
culating the ratio of clearance time by farmed bivalves 
to flushing rate or water residence times (as calculated 
by such methods) to evaluate the sustainability of sites 
for bivalve culture [31] and as a rough first step in 
evaluating whether carrying capacity is potentially 
exceeded for ecological certification standards [32]. In 
the latter case, if the ratio between clearance and 


renewal times is less than unity, then further analyses 
based on primary productivity are required. 

More encompassing approaches to calculating pro¬ 
duction carrying capacity based on hydrodynamic and 
mathematical models that include spatially and tem¬ 
porally explicit feedbacks between the farmed bivalves, 
phytoplankton, zooplankton, detritus, and physical 
(e.g., temperature, nutrients, see Fig. 1) parameters 
are becoming the standard. Models vary greatly in 
complexity. Although a simple box model for a 
location may be sufficient for some situations, it is 
likely unsuitable for embayments with some degree of 
environmental heterogeneity [33]. The next level of 
complexity is a simple 1-D hydrodynamic transport 
model, (e.g., [34]), which may be expanded upon to 
include vertical transport for suspended culture, (e.g., 
[35]). More complex models include multiple coupled 
boxes within an area driven by a 2-D hydrodynamic 
model (with tidal and meteorological forcings). The 
most complex models, fully refined 3-D finite-element 
hydrodynamic models, generally provide a much more 
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Conceptual diagram of bivalve aquaculture interactions in coastal ecosystems considered in production carrying 
capacity estimations. Only main pathways are indicated 
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accurate estimation of hydrodynamics and maybe used 
to drive the biological models. Historically, computa¬ 
tional power has limited the broad application of com¬ 
plex models [36] but this is no longer the case as 
powerful computers are now readily available [25]. 
However, increased development time and sampling 
requirements to validate more complex models make 
the use of simpler box models a good approximation 
for understanding the hydrodynamics of an area and 
slow ecological processes [37] and for carrying capacity 
modeling [38]. 

Production carrying capacity models require data 
on biogeochemical components which include a large 
number of variables related to the flux of nutrients and 
primary production and interactions between different 
food sources for the farmed bivalves. This includes 
nutrient fluxes due to excretion by the farmed bivalves 
and associated organisms and organic matter in 
trapped sediments in culture structures [39-41], nutri¬ 
ent uptake by and production of phytoplankton 
[42-44], zooplankton and detritus components, and 
a benthic component that includes biological and 
chemical processes [45-47]. This latter submodel is of 
particular importance in shallow areas [33]. Figure 1 
gives an example of the types of processes that are 
included within such models. 

Bivalves graze on and may impact plankton and 
other suspended components in the water column 
[48]. These components and environmental factors 
(e.g., temperature) in turn greatly influence the growth 
of the farmed bivalves. Models to predict the physio¬ 
logical response and growth of bivalves vary greatly in 
their complexity. Models may be divided into statistical 
and more mechanistic approaches that include bioen¬ 
ergetics [49, 50]. Today, many studies use an approach 
based on dynamic energy budget (DEB) models that 
were developed by Kooijman, (e.g., [51-53]). In these 
models, energy budgets are partitioned into core pro¬ 
cesses: structural volume, reserves, and a reproductive 
buffer, and forced by food and physical parameters. 
Examples include work on a variety of bivalves, (e.g., 
[25, 54-56]) and have been incorporated into general 
models for bivalve production carrying capacity [35, 
57]. A generalized set of DEB models for a few of the 
most commonly farmed bivalve species has been 
assembled and is available (http://www.shellsim.com). 
Production carrying capacity models based on this 


latter suite of components have been successfully used 
in a number of studies, (e.g., [35]). Although these 
types of models are attractive in that they include 
logical mechanistic processes, their complexity makes 
them difficult to apply at times and requires substantial 
ground truthing and simpler models will often yield 
acceptable results. 

Ultimately, the different sub-models must be put 
together to estimate production carrying capacity. An 
important advance in modeling production carrying 
capacity is the development of Graphical User Interface 
(GUI) modeling environments that allow users to link 
different sub-models fairly easily through a visual envi¬ 
ronment rather than with text commands. An example 
of this is the use of object-oriented modeling environ¬ 
ment, as promoted for the FARM model developed by 
Ferreira et al. [35, 57]. This model links together vari¬ 
ous components forced by a 1-D hydrodynamic model, 
or 1-D with horizontal mixing for off-bottom bivalve 
culture, to estimate production carrying capacity 
and eutrophication assessment (see further) at the 
farm-scale based on a limited number of parameters. 
It may also be used to guide the selection of growing 
sites, culture layouts, production densities, farmed 
species, and to maximize production or economic 
returns. The model has been shown to be useful in 
a number of locations and for a number of species 
[57] and is available as a client-server application 
(www.farmscale.org; www.longline.co.uk/winshell) . 
Simile [58] is an object-based GUI modeling environ¬ 
ment that has been used extensively by Grant and 
colleagues [25, 33, 38, 54, 59]. This environment is 
well suited to constructing carrying capacity models 
for bivalve culture because of its inherent ability to 
represent spatial elements and specify hydrodynamic 
connections between them and, because of its GUI 
environment, it is transferable to nonexperts [38]. 

Models for production carrying capacity usually 
consider only the farmed bivalves as consumers of 
plankton. However, in some situations, particularly 
shallow sites with large densities of natural bivalves, 
grazing by natural bivalves may also exert a large pres¬ 
sure on plankton communities [60-62]. Thus Sequeira 
et al. [63] have recently developed a model that 
includes natural benthic communities as a forcing 
function on plankton communities and suggest that 
these populations may significantly reduce production 
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carrying capacity. Similarly, Cugier et al. [64] 
found that wild native and exotic filter-feeding organ¬ 
ism had a greater effect on the control of primary 
productivity than did the farmed bivalves in Mont 
Saint Michel Bay, France (Fig. 2). The biomass of 
filter-feeding fouling organisms, particularly tuni- 
cates, on farmed bivalves and infrastructure may be 
considerable, at times greater than that of the farmed 
bivalves [65]. Given that they may have similar 
grazing rates to that of farmed bivalves [66], these 
should be included within production carrying 
capacity models where fouling is great. This has been 
attempted for the Thau Lagoon [67] and the 
Oosterschelde Estuary [68]. 

Ecological Carrying Capacity 

The definition the “ecological carrying capacity” of 
a system is greatly dependent on social values as what 
is considered to be an “unacceptable” impact is depen¬ 
dent on the values of a given society. A society or 
their representatives must select which components 
(e.g., species or habitats) of a given area are important 
and set acceptable limits of change for each. Often, 
information on specific parameters may not be avail¬ 
able and managers may choose to consider components 
for which information is available or more easily 
obtained. As pointed out by McKindsey et al. [17], 
this may be a logical choice given that components 
that have a high societal value are also likely to have 
been studied. 

While production carrying capacity focuses on the 
farmed bivalves themselves and the organisms that 
support their production, ecological carrying capacity 
also includes other organisms and habitats in the eco¬ 
system. There are three main categories of ecological 
carrying capacity: (1) that related to the pelagic habitat; 
(2) that related to the benthic habitat; and (3) that 
which employs ecosystem function approaches. The 
first category is largely related to plankton depletion 
due to grazing by farmed bivalves. The second category 
is related to increased sedimentation within culture 
sites due to biodeposition by the farmed bivalves. The 
third category considers both these issues and changes 
in biomass and energy flow in a system. Ecological 
carrying capacity is strongly related to both the physical 
carrying capacity and the production carrying capacity 


of a site, particularly with respect to that related to the 
pelagic habitat. 

Models to estimate the production carrying capac¬ 
ity of a site contribute to determining the ecological 
carrying capacity of a site for organisms that are depen¬ 
dent on the delivery of food from the water column. 
Presumably, if the farmed filter-feeding bivalves impact 
themselves by overgrazing the available resources, then 
they are also likely impacting other organisms that feed 
in a similar manner [69]. Indeed, some recent produc¬ 
tion carrying capacity models explicitly included ben¬ 
thic filter-feeding organisms [63, 64] and some work 
has suggested that overgrazing by farmed bivalves may 
impact other suspension feeders in the surrounding 
ecosystem [70]. The calculation of carrying capacity 
for other planktivores in the water column may be 
similarly evaluated. Sedimentation rates to the bottom 
are a function of the plankton communities in an area 
(e.g., [71]). Thus, as has been shown for natural sys¬ 
tems with bivalves [72, 73], depletion of the water 
column via grazing by farmed bivalves may influence 
sedimentation rates within a given area and thereby 
benthic communities, potentially enhancing differ¬ 
ences between benthic communities within and outside 
of culture sites [11]. 

With respect to the benthic habitat, research has 
largely focused on the impact of increased sedimenta¬ 
tion due to biodeposit production by farmed bivalves 
on infaunal communities [9-11]. Research on the eco¬ 
logical carrying capacity of benthic habitats has like¬ 
wise been focused on predicting how increased 
sedimentation due to biodeposit production by farmed 
bivalves influences infaunal communities. Grant et al. 
[74] predicted benthic loading rates from biodeposits 
from farmed mussels (Mytilus edulis ) at a bay scale as 
a balance between biodeposit production, as calculated 
using a simple physiological model, and flushing, as 
calculated based on a simple tidal prism model (vali¬ 
dated using a finite-element 2-D hydrodynamic model 
and field work). Although there was some variation 
between modeled and observed biodeposition for 
both studied bays, the authors suggest that the 
approach is valuable as a screening tool to develop 
relative ranking of different systems and identify poten¬ 
tial issues. A number of approaches have also been 
evaluated that couple spatially explicit hydrodynamic- 
dependent particle tracking models to predict flux of 
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Carrying Capacity for Sustainable Bivalve Aquaculture. Figure 2 

Simulated annual maximum chlorophyll a for the reference case (A - current situation with all filter feeders included) and 
differences with the reference case (in%) for five scenarios of with different filter feeders removed, (B - without exotic 
slipper-limpets, C - without farmed mussels, D - without farmed oysters, E - without native filter feeders, F - with no filter 
feeders in the bay). Because of the wide range of differences between the scenarios, the scale bar is not the same for each 
sub-figure (From Cugier et al. [64]) 
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biodeposits from farmed bivalves to the bottom to 
predict benthic loading footprints and associated ben¬ 
thic community changes. A simple approach is to 
modify the existing DEPOMOD model, which was 
developed to this end for finfish cage culture but for 
which individual bivalve culture structures (e.g., mus¬ 
sel longlines, see Fig. 3) or groups of structures in 
a system maybe modeled [75]. However, this approach 
has a number of limitations. First, the model assumes a 
homogenous flow field, an assumption that is unlikely 
to be true for extensive culture sites. Second, the mod¬ 
ule for resuspension of sedimented biodeposits is not 
fully developed and remains one of the greatest sources 
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Carrying Capacity for Sustainable Bivalve Aquaculture. 
Figure 3 

Modeled biodeposition footprint (g m -2 d -1 ) within an 
idealized suspended mussel farm with (a) mussels only and 
(b) mussels fouled with Ciona intestinolis. Biodeposition 
rates were modeled for five backlines (white lines in figure) 
measuring 100 m each based on currents measured during 
a 24 h period in St. Marys Bay, eastern Canada (Modified 
from McKindsey et al. [65]) 


of uncertainty for biodeposition/impact models [74]. 
Although resuspension may be negligible in areas with 
weak currents [75], Giles [76] used a more advanced 
hydrodynamic model to drive a biodeposit dispersion 
model and stressed the importance of this aspect for 
predicting the dispersal of biodeposits in areas with 
strong currents. A number of recent studies have 
started to address this issue and supply data to better 
parameterize this module, (e.g., [75, 77, 78]). Although 
general relationships exist for benthic organic loading 
and infaunal community structure [79], predictive 
dose-dependent relationships between organic loading 
from bivalve biodeposits and faunal responses are 
largely lacking, making predictions difficult [80]. Sim¬ 
ilarly, there is no universal metric available to describe 
benthic responses although a large number of indices 
have been developed, some showing good relationships 
with aquaculture [81]. As noted above, the biomass of 
tunicates and other fouling organisms on farmed 
bivalves and infrastructure may be considerable, and 
these likely contribute greatly to organic loading to the 
sea bottom in some cases [65, see Fig. 3] and thus must 
be considered when determining the biodeposition- 
related ecological carrying capacity of an area for the 
benthic habitat. 

Tenore et al. [82] used a mass-balance approach to 
examine the influence of bivalve culture as a part of 
a coastal ecosystem but specifically to predict ecological 
carrying capacity. Gibbs [83] used food web analysis to 
estimate the level of bivalve culture that could develop 
before it dominated the energy flow in a marine system 
and impacted fisheries resources. More recent work has 
used Ecopath with Ecosim [84] to determine the tro¬ 
phic functioning of various areas that include bivalve 
culture [85-88]. These models differ considerably in 
their complexity (i.e., number of trophic groups 
included) and have generally found that bivalve culture 
promotes short energy pathways with high trophic 
efficiency and energy cycling. However, the aim of 
these studies was not specifically to determine the eco¬ 
logical carrying capacities of the studied areas. Research 
[69] to evaluate the carrying capacity of an area for 
mussel culture in New Zealand using Ecopath found 
that the ecological carrying capacity of the area 
(defined as the level of culture that would not signifi¬ 
cantly change the major energy fluxes or structure of 
the food web) was much less than the production 
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carrying capacity (defined as the level of production at 
which the ecosystem collapses down to a nutrient- 
phytoplankton-culture-detritus-dominated system) 
of the area. More recent research in the eastern United 
States based on the same approach but for coastal 
lagoons with oysters also found that the stocking den¬ 
sity calculated for ecological carrying capacity was less 
than that calculated for production carrying capacity 
[89]. Although the original version of Ecopath was 
limited in its applicability because it could not be 
used to simulate changes to flows with time, the new 
version does not assume a steady state. Rather, it bases 
the parameterization on an assumption of mass bal¬ 
ance over an arbitrary period, often a whole year, and 
biomasses of any given trophic group need not be at 
equilibrium. A component of Ecopath, Ecosim, accepts 
time series data for different trophic groups and may be 
appropriate to evaluate different management (i.e., 
stocking and harvesting) options. These approaches 
have some limitations. Perhaps the most important 
for bivalve culture is that the models are not spatially 
explicit. Thus, the model may not be used to identify 
near-field and far-field effects and exchanges between 
areas are assumed to be instantaneous. This is not 
logical in complex coastal areas where most bivalve 
culture is done. This issue has been addressed to 
some extent with the development of Ecospace, 
a dynamic, spatial version of Ecopath that incorpo¬ 
rates the key elements of Ecosim [84]. This review 
found no studies that use either Ecosim or Ecospace 
to estimate ecological carrying capacity for bivalve 
culture; information to do this is usually not available 
and when such spatially explicit temporal data are 
not available, Ecopath provides a standardized meth¬ 
odology for developing a model [89]. As it is, data on 
many specific biological variables (life history values, 
interactions, etc.) are lacking for most systems. If 
default values are used, these must not be simply 
accepted without critical evaluation [90]. Even when 
data is available to guide the determination of variable 
values, many must be adjusted to make the model 
balance [69]. Perhaps most importantly, Ecopath 
uses a largely top-down mass-balance approach and 
thus poorly represents bottom-up effects [91], a 
situation that will be problematic for bivalve 
culture given that it largely impacts nutrients and 
lower trophic levels. 


In sum, most potential measures of ecological car¬ 
rying capacity consider only a single or a restrained 
number of ecosystem components [92]. Little research 
has been directed at understanding the impact of 
biodeposition (of biodeposits and of farmed bivalves 
and associated organisms) on the productivity or sus¬ 
tainability of benthic infaunal communities or the 
communities of larger invertebrates and fishes that 
may prefer to associate with culture sites to profit 
from the biodeposition. Likewise, little research has 
addressed the impact of the modification/addition of 
physical structure associated with bivalve culture (i.e., 
both the infrastructure and the farmed product, as well 
as removal of seagrass, etc.) [11]. The addition of 
structure may attract and create suitable habitat for 
a large variety of organisms, such as fish and decapods, 
but also of fouling organisms that may otherwise not 
have appropriate habitat in a given area. In contrast, the 
removal of some features for bivalve culture may repel 
other organisms. Thus, flexible approaches to evaluat¬ 
ing ecological carrying capacity need to be developed to 
incorporate our evolving understanding of the func¬ 
tioning of marine ecosystems and their interactions 
with bivalve culture [17, 89]. 

Social Carrying Capacity 

The social carrying capacity of a site is the most com¬ 
plex of all types of carrying capacity to determine. It 
depends on not only the three above categories of 
carrying capacity but also on trade-offs between stake¬ 
holders to meet the demands of both the population 
(socioeconomic factors such as traditional fisheries, 
employment in other sectors, and recreational use) 
and the environment (protected habitats, species, 
etc.) [22, 31, 83, 93-95]. Moreover, social issues often 
inform other categories of carrying capacity (see fur¬ 
ther in “Decision Framework” section). This category 
is at the heart of Marine Ecosystem-Based Management 
(EBM) and Integrated Coastal Zone Management 
(ICZM) and must be fully developed so that responsi¬ 
ble management decisions may be made [96]. That 
being said, the techniques for determining the social 
carrying capacity of a site are the least developed of all 
carrying capacity categories [97]. 

Despite these limitations, a number of criteria 
are common to each of the different methods: 
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representativeness, independence, and involvement/ 
buy-in of the groups involved in the process, and trans¬ 
parency [98]. 

One method that has been developed to manage 
natural resources is the concept of “limits of acceptable 
change,” or LAC. Although originally developed to 
define the limit of recreational activities in terrestrial 
wilderness areas, it may also be applicable to a wider 
range of natural resource management issues [99, 100]. 
More recently, it has been applied to bivalve aquacul¬ 
ture in New Zealand [ 101] . It was recognized that LAC 
was not a tool to determine the level of bivalve culture 
that was ecologically sustainable. Rather, it provided an 
adaptive management framework to prevent signifi¬ 
cant negative effects due to the activity. Through 
a collaborative process, LAC provides a framework to 
identify indicators of change, setting levels of changes 
of indicators that are acceptable, and identifying man¬ 
agement responses that are to be undertaken if these 
levels are exceeded. The selected indicators must be 
relevant and practical to measure. Although in the 
New Zealand example ecological indicators were 
selected (spatial extent of phytoplankton depletion, 
also done recently in eastern Canada [25]), the 
approach could also be used for other types of indica¬ 
tors derived from other categories of carrying capacity 
evaluations. The important aspect is that the selected 
indicators are chosen in a collaborative way [102], 
including input and discussions with members of the 
public, environmental managers, scientists, members 
of the industry, groups with conflicting interests, etc. In 
addition to the process being representative, it must 
also be transparent and adaptive so that new informa¬ 
tion may be easily included in the decision-making 
process. Indeed, much work has shown that collabora¬ 
tive efforts between all stakeholders are essential to 
ensuring satisfaction in the consultation process and 
developing policy [103]. 

A recent example of this is the evaluation of the 
ecological carrying capacity of lagoons in Rhode Island, 
eastern United States, for oyster culture [97]. This 
involved the development of the Working Group on 
Aquaculture Regulations (WGAR), which guided and 
oversaw the development of a mass-balance modeling 
approach to calculate both the ecological and social 
carrying capacities of the area for oyster culture. 
Whereas it is usually left up to the modelers to 


determine what constitutes an acceptable or an unac¬ 
ceptable impact [97], it was considered that the process 
would be much more transparent and inclusive by 
including input from all stakeholders, following the 
criteria outlined by Soto et al. [104] stating that an 
ecosystem approach to aquaculture (EAA) should: 

1. Be developed in the context of ecosystem functions 
and services with no degradation of these beyond 
their resilience capacity 

2. Improve human well-being and equity for all rele¬ 
vant stakeholders 

3. Be developed in the context of (and integrated to) 
other relevant sectors 

The above strategies to evaluate the social carrying 
capacity of locations may be considered as social- 
ecological systems (SESs) where all of the potential 
stakeholders participate. Walker et al. [105] outline 
four general steps in the process: 

1. Stakeholder-led development of a conceptual 
model of the system, including its history and pre¬ 
liminary assessments of the drivers of key ecosystem 
goods and services. 

2. Identification of the range of drivers of the system, 
stakeholder visions for the future, and contrasting 
possible future policies, to identify a limited set of 
future scenarios. 

3. Identify the resilience of the system by examining 
the results from i and ii. This is generally done 
through the development of models of the system’s 
dynamics to identify important attributes that 
affect resilience. 

4. Stakeholder evaluation of the process and outcomes 
in terms of policy and management implications. 

Ostrom [106] recently developed a framework to 
evaluate the sustainability of management strategies for 
SESs. It is a nested approach with four first-level core 
subsystems. In bivalve culture systems, these may 
include (1) resources systems - a given embayment or 
other logical management area; (2) resource units - 
such as phytoplankton biomass or physical space; 

(3) governance systems - the specific rules related to 
culture activities that manage resource use; and 

(4) users - the various groups that use the resource 
for sustenance, recreation, etc. Each core subsystem is 
made up of a series of secondary subsystems and 
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interacts with other core subsystems. One of the main 
findings was that, although many researchers have 
predicted that users of a system will not self-organize 
to create a sustainable SES and thus regulation by 
governments is needed, this prediction was not 
supported when stakeholders were enabled to discuss 
management options. Monitoring and enforcement 
were other key components that determined the suc¬ 
cess of the studied SESs. 

A further method to determine the social carrying 
capacity of a site is by attaching a monetary value to the 
different categories of impact due to aquaculture [15]. 
However, this is a complex undertaking and must 
include both positive and negative effects of the 
operations [107]. Moreover, acceptability of different 
effects is quite variable among locations and groups, 
further complicating the goal of attaining consensus for 
different variables. 

The use of fuzzy expert systems has also been advo¬ 
cated to determine the social carrying capacity of an 
area for bivalve culture [17]. This approach has been 
shown to be useful when data types are disparate and 
uncertainties are great or simply unknown. In short, 


instead of stating that a given level of production is 
acceptable or unacceptable, a given level is treated as 
being, say, 50% acceptable. It is rather more like dealing 
with varying shades of gray rather than cases that are 
black and white. This has the important effect of induc¬ 
ing less conflict between stakeholders on issues that 
may be contentious. 

In sum, methods to determine social carrying 
capacity for bivalve culture remain poorly developed. 
In general, few methods have been developed specifi¬ 
cally for aquaculture although methods developed 
for other sectors may be adapted. Analysis of the 
attempts to use SESs in other systems has shown that 
some approaches work; such approaches may be 
looked to for determining social carrying capacity for 
bivalve culture. 

Decision Framework to Determine Carrying 
Capacity 

A hierarchical approach to determine the carrying 
capacity of an area for bivalve culture is proposed 
(Fig. 4). The first level, the physical carrying capacity, 
relates to the available natural conditions and the needs 
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Carrying Capacity for Sustainable Bivalve Aquaculture. Figure 4 

Hierarchical structure to determine carrying capacity of a given area. Note that social carrying capacity provides 
guidance to choosing pertinent response variables and on establishing limits for these. Superscripts indicate examples 
of the type of information that informs the selection of response variables for other carrying capacity categories 
(Modified from McKindsey et al. [17]) 
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of the operation and bivalves to be cultured. The 
second level, the production carrying capacity, is 
a function of the supply of food to the farmed bivalve 
in an area and is determined using modeling efforts. 
At the third level, the ecological carrying capacity of an 
area is estimated using modeling or by following 
a logical framework to evaluate the range of possible 
outcomes for production estimates varying between 
none (and/or the current level) and the maximum 
calculated production carrying capacity; there is 
little use to go beyond this point as it is assumed that 
this level will not be surpassed knowingly. Finally, 
managers weigh and balance the different scenarios 
based on the outcomes from each of the preceding 
categories of carrying capacity and make a decision as 
to what level of productivity is acceptable - the social 
carrying capacity. 

It must be understood that the three first categories 
of carrying capacity are a function of social values. For 
example, a given coastal area in which bivalve culture 
may be done is often valued for myriad competing 
activities, such as water quality, visual aesthetics, com¬ 
petition for limited space, other species, etc. For the 
first category, physical carrying capacity, competition 
for physical space may be direct or indirect. For exam¬ 
ple, direct competition may result as there may be 
multiple demands to use the same area for different 
purposes, such as bivalve culture as well as fishing or 
recreational boating. Indirect competition may arise 
because of visual aesthetics or the not-in-my-backyard 
factor where waterfront owners or users do not want 
their views affected by aquaculture installations. Pro¬ 
duction carrying capacity is likely rarely maximized 
because of economic costs related to production. 
Thus, Nobre et al. [108] developed a modeling 
approach that considers both ecological and economic 
aspects and their interactions to maximize profit and 
EAA as focusing aquaculture management on maxi¬ 
mizing output is likely economically inefficient and 
carries unnecessary ecological risks. In fact, this 
approach addresses all four categories of carrying 
capacity with an emphasis on maximizing economic 
returns. The ecological carrying capacity of an area 
is also clearly a function of social values as what is 
valued varies among populations (e.g., specific bird 
and fish populations, water clarity, eelgrass, or specific 
rare habitats). 


Consultation with stakeholders throughout the 
process of defining the carrying capacity for a given 
area will identify the appropriate (given the social 
values of the population) response variables or indices 
to be examined [97]. Ideally, the scientists should then 
be able to select suitable tools from a toolbox (e.g., 
models, GIS, and comparisons with previous studies) 
to predict the form of the response curves of the 
selected response variables to a range of production- 
level scenarios. It is important to note that although it 
is the role of scientists to describe the form of the 
responses to a range of production-level scenarios 
(see Fig. 5), their role ends once they have done this. 
It is then up to managers, in consultation with the 
various stakeholders, to use all available scientific infor¬ 
mation to inform regulations and policy [17, 97]. 
Again, this should be done within the context of EBM 
and ICZM, given that there is likely a paucity of infor¬ 
mation about a number of factors that are needed to 
make informed judgments [96]. Thus managers will 
likely have to rely on instinct, local knowledge, extrap¬ 
olation from studies done elsewhere, contributions 
from various stakeholders, etc., to make management 
and policy decisions. Even faced with an absence of 
some types of information, this does not remove the 
logic of the hierarchical nature of the decision 



Carrying Capacity for Sustainable Bivalve Aquaculture. 
Figure 5 

Hypothetical response curve of an environmental variable 
under the influence of varying levels of bivalve culture 
production. The dotted line indicates the level of the 
indicator that has been determined to be acceptable and 
the dashed line the corresponding level of production 
(Modified from McKindsey et al. [17]) 
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framework outlined above. Indeed, it could be argued 
that this is precisely when the process is most useful and 
should be followed using all available information - so 
that an unbiased view of the situation may be formed 
and thus promoting appropriate management deci¬ 
sions. As pointed out by McKindsey et al. [17], failure 
to follow the process (e.g., by stating at the outset of the 
process that certain types of development or develop¬ 
ments in certain areas are not permitted) will likely 
result in otherwise feasible bivalve culture installations 
not being initiated. It is also contrary to the notion of 
effective and transparent processes in the establishment 
ofICZM or EBM. 

Knowledge Gaps and Future Directions 

This entry concludes with a brief discussion of knowl¬ 
edge gaps and directions for future research to better 
estimate the carrying capacity of areas for sustainable 
bivalve culture, except for the estimation of physical 
carrying capacity for which methods are fairly well 
developed and are currently used extensively to this end. 

While it is generally true that “existing models must 
be made spatially explicit” [17], this is currently being 
done. However, models to this end require much infor¬ 
mation that is often not easily available. So, perhaps 
a better knowledge gap to address is to determine when 
such models are needed and when can general models 
be used. 

Similarly to the first point, although temporal var¬ 
iation must be included in models to consider seeding 
and harvesting activities, some generalized models to 
this end have already been developed [35, 57,108, 109]. 
These should be expanded upon and evaluated in sev¬ 
eral locations to fully assess their generality. 

Far-field impacts for the benthic environment 
need to be evaluated. For example, is sedimentation 
outside of culture sites decreased due to grazing and, if 
so, what is the influence of this on the benthic environ¬ 
ment? How does this influence our interpretation of 
“impacts” on the benthic environment within farm 
sites? 

Work is needed on how nonlinear or unexpected 
results may be incorporated into carrying capacity 
models. For example, how can the provision of bivalve 
culture-related structure in the environment - which 
may increase the abundance of fouling organisms and 


thus filtration capacity or attract various other organ¬ 
isms - be included in carrying capacity estimates? 

Methods need to be developed to include both 
“positive” and “negative” effects into the decision¬ 
making process. For example, while suspended bivalve 
culture may increase organic loading locally, it may also 
mitigate the effects of eutrophication (and models 
have been developed to assess this, e.g., 35) or increase 
the abundance of commercially important species 
locally [11]. 

Despite the growing prevalence of certification 
standards and ecolabeling for aquaculture products, it 
is not clear that such processes are improving condi¬ 
tions in the field [14]. Although such standards are 
often elaborated within a multi-stakeholder frame¬ 
work, it is not clear under which conditions they are 
necessary and how the three previous points are con¬ 
sidered in their development or application. 

Given that many groups promote including the 
notions of reversibility or resilience in the definition 
of carrying capacity, (e.g., [ 16] ), surprisingly little work 
has actually addressed this for the benthic environment 
[but see 110]. Work on evaluating the resilience of 
benthic systems to bivalve culture is needed. 

Appropriate management tools, such as methods 
for combining disparate data types, need to be devel¬ 
oped to incorporate the information needed to esti¬ 
mate various aspects of carrying capacity to aid in 
decision making and policy development. 
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Glossary 

CdTe Inorganic compound semiconductor with high 
absorption coefficient (larger than 10 4 cm -1 ) and 
almost ideal energy bandgap (1.49 eV) for terres¬ 
trial photovoltaic applications. 

Conversion efficiency Ratio between the useful elec¬ 
trical power of the solar cell or module to the 
incoming sunlight power (here called efficiency). 
The conversion efficiency of solar cells and modules 
depends on external parameters such as tempera¬ 
ture, irradiation intensity, and spectrum. For com¬ 
parable results the conversion efficiency has to be 
measured under standard test conditions (STC). If 
not otherwise mentioned the presented efficiency 
values are measured under STC. 

Heterojunction A pn-junction where the p- and 
n-type semiconductors are different materials 
such as a junction in the CdTe/CdS stack. 

Photovoltaic (effect) Direct conversion of the sunlight 
into electricity, first discovered by A. E. Becquerel 
in 1839. 

pn-junction Combination of p- and n-type semicon¬ 
ductors for the separation of charge carriers gener¬ 
ated by the photovoltaic effect. 

Solar cell Electronic device for direct conversion of 
solar energy into electrical energy using the photo¬ 
voltaic effect (also called photovoltaic cell). 

Solar module Interconnection of a number of solar 
cells in series for increasing the output voltage or in 
parallel for increasing the output current. 

Standard test conditions (STC) Definition of the 
external parameters for the device under test 
during determination of the photovoltaic perfor¬ 
mance. These include a defined irradiation inten¬ 
sity of 100 mW/cm 2 , a reference illumination 
spectrum AM 1.5 G, and device temperature of 
25 °C. 

Thin film Layer of material with thickness between 
sub-nanometer and few micrometers. Thin film 
solar cells consist of a stack of several thin films 
with different properties and functions. 


Definition of the Subject 

CdTe thin film solar cells and modules are currently the 
driving force for production cost reduction in the 
whole photovoltaic sector. The device concept is simple 
consisting of multilayer stack of metal and semicon¬ 
ducting thin films. For the deposition of these layers 
a variety of large area and high-speed deposition 
methods are available. The CdTe solar cell as it is 
known today was first described in 1969 by Adirovich 
et al. [1]. In the beginning of the twenty-first century 
CdTe solar modules dominated the PV market in terms 
of production volume and lowest cost. This chapter is 
a review of more than 40 years of research and devel¬ 
opment activities and hot topics in the field of CdTe 
thin film photovoltaics. 

Introduction 

Still today the photovoltaic (PV) market is dominated 
by silicon wafer-based solar modules; however, in 2009 
the annual production volume of CdTe thin film solar 
modules increased to above 10% of the total annual PV 
production [2]. The enormous increase of the CdTe- 
based PV in the market share in the recent history is 
induced by the company First Solar Inc. By the end of 
2009, they reached a production capacity of 1.4 GWp 
and announced production costs as low as 0.76$/Wp as 
well as a plan for further growth of the capacity toward 
2.2 GWp by end of 2012 [3]. This makes the leading 
producer of CdTe PV technology First Solar Inc. to be 
the largest solar cell manufacturer worldwide (2010). 

In CdTe compound semiconductor-based solar 
cells CdTe is the absorbing material with direct energy 
bandgap of 1.49 eV at 300K [4] and an absorption 
coefficient larger than 10 4 cm -1 for photons with 
energy above the bandgap [5]. In the Shockley- 
Queisser model these are almost ideal properties for 
the application of terrestrial solar energy conversion 
[6]. The theoretical efficiency limit of CdTe-based 
solar cells is about 30% under AM 1.5 G irradiation 
(reference spectrum of the sun under an angle of 
48.2°, the spectrum is specified in the international 
standard IEC60904-3 Ed.2 2008) [6, 7]. 

CdTe solar cell research started with the investiga¬ 
tion of pn-homojunctions by RCA Laboratories, USA, 
in the late 1950s; Paul Rappaport at RCA Lab reported 
an efficiency of 2.1% (100 mW/cm 2 , unknown 
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spectrum) for a p-type CdTe single crystal-based cell 
with one-sided n-type doping by indium in 1959 [8]. 
Twenty years later in 1979, active area efficiency of 6% 
(90 mW/cm 2 , tungsten lamp) was reported by French 
National Center for Scientific Research (CNRS) in 
France for a homojunction device with a p-type CdTe 
layer grown by close space vapor transport deposition 
onto a n-type single crystalline CdTe substrate [9]. In 
1982, CNRS reported improved efficiency of 10.7% 
(107 mW/cm 2 , AMI spectrum) with the same deposi¬ 
tion method [10]. The main problems of the single 
crystal CdTe solar cells based on the concept of shallow 
pn-homojunction are linked to the difficulties in 
achieving high doping concentration in the thin 
p-type conducting layer that is deposited onto the 
n-type single crystal and also the diffusion of compen¬ 
sating dopants from the thin layer into the CdTe 
substrate [11]. 

CdTe heteroj unction solar cells were investigated in 
two approaches according to the CdTe conductivity 
type. Work on n-type CdTe was mainly done in com¬ 
bination with p-type Cu 2 Te. This investigation started 
in the 1960s and the best cell efficiency of about 7% 
(60 mW/cm 2 , unknown spectrum) was reported in 
1973 by Justi et al. [12]. In those devices light enters 
through the p-type Cu 2 Te layer and is absorbed in the 
n-type CdTe and a metal mesh is used as front contact. 
A major problem in further increasing the efficiency 
was the lack of p-type transparent conductors. 

The work on p-type CdTe solar cells also started in 
the 1960s by depositing a thin layer of n-type semicon¬ 
ductor on p-type CdTe single crystals [13]. In this 
configuration the highest efficiency of 13.4% 
(100 mW/cm 2 , AMI.5) with remarkable high open 
circuit voltage of 892 mV was obtained with 
a n-In 2 0 3 /p-CdTe junction [14]. 

Thin film CdS/CdTe solar cells as they are known 
today were first demonstrated by Adirovich et al. in 
1969 with efficiency of about 1% (100 mW/cm 2 , 
unknown spectrum) [1], A comprehensive description 
of CdS/CdTe solar cells including fundamental issues 
some of which are still open challenges today was 
presented by Bonnet and Rabenhorst in 1972 [15]. 
They reported an efficiency of up to 6% (50 mW/cm 2 , 
xenon arc lamp using UVand IR filters). The CdS/CdTe 
thin film solar cells have been intensively studied by 
many research groups worldwide. Major previous 


reviews on CdTe thin film photovoltaics include the 
pioneer work of R.H. Bube [16], the book chapters of 
McCandless & Sites [17] and of Burgelman [18], 
a special issue on CdTe in Solar Cells [19], and several 
other contributions such as a comparison between 
CIGS and CdTe technology [20]. In the following sec¬ 
tions a review of the achievements, results, and chal¬ 
lenges of thin film CdS/CdTe solar cell research and 
development is presented. 

State-of-the-Art of CdTe Solar Cells 

State-of-the-art high-efficiency CdTe solar cells are 
multilayer devices with CdS and CdTe thin films as 
n-p heteroj unctions sandwiched between front and 
back electrical contacts. Figure 1 shows the schematic 
cross-section of such a CdS/CdTe thin film solar cell in 
superstrate configuration (layer stacks are always given 
in sequence of the deposition). The superstrate config¬ 
uration means that the light enters the device through 
the substrate which has to be a transparent material for 
optical photons. In this configuration a transparent 
and electrically conducting front contact is deposited 
first on the substrate. Typical materials used for this 
purpose in CdTe solar cells are transparent conductive 
oxide (TCO) layers such as tin oxide doped with fluo¬ 
rine (FTO), indium oxide doped with tin (ITO), zinc 
oxide doped with aluminum (AZO), and cadmium 
stannate Cd 2 Sn0 4 (CTO). In most highly efficient 
CdS/CdTe solar cell configurations the TCO layer is 
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CdTe Solar Cells. Figure 1 

Schematic cross-sections of CdS/CdTe thin film solar cells 
in superstrate [left) and substrate [right) configuration 
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covered by a highly resistive and transparent (HRT) 
layer. The function of that HRT layer is discussed in 
section Highly Resistive and Transparent Layer. 

A thin layer of CdS is deposited as n-type semicon¬ 
ductor onto the TCO followed by the p-type CdTe 
absorber. The TCO/CdS/CdTe layer stack is annealed 
in a chlorine and oxygen containing ambient, which is 
commonly referred to as “activation treatment” or 
“CdCl 2 treatment.” For low resistive ohmic electrical 
back contact formation, a chemical etching process for 
modification of the CdTe surface is commonly applied. 
The necessity of that surface treatment process prior to 
the metal back contact deposition is because of the 
problems associated with high electron affinity and 
energy bandgap of CdTe. The requirements, properties, 
and functionality of each layer and CdTe surface mod¬ 
ifications are described in section Processing and Prop¬ 
erties. The very first CdS/CdTe thin film solar cell in 
superstrate configuration was presented already in 
1969 by Adirovich et al. [1] with efficiency of about 
1% under low illumination intensity. They deposited 
the CdS and subsequently CdTe layers onto conductive 
Sn0 2 coated glass substrates and used Cu as back 
electrical contact. Figure 2 shows the schematic energy 
band diagram for state-of-the-art CdS/CdTe solar cells. 
In this schematic the light is entering the device from 
the left side. 

Broadly, the CdTe solar cell processing technology 
can be categorized based on high or low deposition 


temperatures used for the deposition of the CdTe 
layer. High-temperature methods (above 550° C) 
include close space sublimation (CSS), vapor transport 
deposition (VTD), and modifications of those, while 
low-temperature methods (below 450° C) include 
sputtering, electro deposition (ED), and high vacuum 
evaporation (HVE). Table 1 shows a summary of 
achieved solar cell efficiencies on rigid glass and flexible 
polyimide substrates with CdTe layer deposited by dif¬ 
ferent methods. 

The certified record efficiency for CdS/CdTe solar 
cells on glass is 16.5% on cell level achieved by the 



CdTe Solar Cells. Figure 2 

Schematic energy band diagram of CdS/CdTe solar cells 
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CdTe Solar Cells. Table 1 PV performance of CdS/CdTe solar cells in superstrate configuration deposited by various 
methods on rigid glass and flexible polymer substrate. The given temperature is the maximum temperature of the 
complete process. If not otherwise mentioned, the PV performance was measured under standard test conditions 


Method 

Temp (°C) 

Substrate/TCO/HRT 

Eff. (%) 

Voc (mV) 

J sc (mA/cm 2 ) 

FF (%) 

Area (cm 2 ) 

Lab 

Ref. 

CSS 

660 

BSG/CTO/ZTO 

16.5 

845 

25.9 a 

75.5 

1.03 

NREL 

[21,22] 

VTD 

550 

SLG/Sn0 2 /Ga 2 0 3 

13.6 

810 

23.7 

70.8 

- 

IEC 

[23] 

Sputter 

390 

ASG/AZO 

14.0 

814 

23.6 

73.3 

0.15 

Toledo 

[24] 

HVE 

420 

BSG/AZO/i-ZnO 

15.6 

834 

24.7 a 

75.9 

0.15 

Empa 

[25] 

ED 

400 

Glass/FTO 

14.2 

819 

23.5 b 

74 

0.02 

BP Solar 

[26] 

HVE 

420 

PI/AZO/i-ZnO 

12.4 

823 

19.6 

76.5 

0.15 

Empa 

[27] 

Sputter 

390 

PI/AZO 

10.7 

768 

19.5 

69.8 

- 

Toledo 

[28] 


BSG borosilicate glass, ASG aluminosilicate glass, 5LG soda-lime glass, PI polyimide 

a With antireflection (AR) coating 

b Measured under ELH lamp with 84 mW/cm 2 intensity 































1980 


c 


CdTe Solar Cells 


National Renewable Energy Laboratory (NREL), USA, 
using a complex process to minimize various optical 
and electrical losses and deposition/annealing process 
temperatures to well above 600°C [21]. Efficiency of 
15.6% was achieved with the low-temperature (below 
450°C) HVE process [25]. It can be seen in Table 1 that 
for high- and low-temperature processed devices com¬ 
parable open circuit voltages and fill factors are 
obtained. This indicates that the process temperature 
for the CdTe deposition may not have a major impact 
on the electrical quality of the polycrystalline semicon¬ 
ductor as far as their relevance for the highest reported 
efficiencies are concerned. The largest difference of the 
compared devices is visible in the short circuit current 
density depending on the optical properties of the used 
TCO material, the CdS layer thickness, and the sub¬ 
strate material itself. 

The limited transmittance through the substrate 
material can also impose a major limitation in the 
CdS/CdTe solar cells in superstrate configuration. 
While borosilicate glass (BSG) has 3-5% higher trans¬ 
parency compared to standard soda-lime glass (SLG) 
the low transparency of polyimide (PI) is particularly 
a problem for the development of flexible, lightweight 
solar cells. Polyimide has been currently identified 
as the most promising material for the development 
of flexible CdS/CdTe solar cells in superstrate 
configuration. 

The PI material is chemically stable and processing 
temperatures of 450°C in vacuum can be applied with¬ 
out significant changes of the material properties. With 
flexible and lightweight substrate highest efficiency of 


12.4% was obtained using a low-temperature HVE 
process [27]. The rather low J S c of that device results 
from the optical losses in the PI film while V 0 c and FF 
of flexible devices on PI are comparable to cells on glass 
substrates (see Table 1). 

Optical losses in the substrate material are not an 
issue in CdTe/CdS devices grown in the substrate con¬ 
figuration. This configuration was first described by 
Bonnet and Rabenhorst in 1972 [15]. They used 
a conductive molybdenum foil as substrate material 
covered with a thin layer of Cu. The CdTe layers were 
deposited by high-temperature vapor phase deposition 
(HTVPD) at 650° C followed by the CdS layer grown 
with vacuum evaporation and contacted with indium. 
With this configuration they reached device efficiencies 
between 5% and 6% (50 mW/cm 2 , xenon arc lamp 
using UV and IR filters). The role of the thin Cu 
layer between the Mo foil and the CdTe was for the 
formation of a heavily doped Cu x Te interlayer as 
tunneling contact. 

In Table 2, various CdTe/CdS devices in substrate 
configuration on rigid glass and flexible substrate are 
compared. The highest efficiency of 9.9% was reported 
by Romeo et al. already in 1992 [29]. They used a BSG 
substrate covered with a thin layer of Cu followed by 
a Mo layer as electrical back contact. They argued that 
the Cu diffuses through the Mo and forms a Cu x Te 
interlayer which acts as a tunneling contact between 
the Mo and the CdTe. The CdTe layer was deposited by 
HVE method followed by a sputtered CdS layer. The 
cell was finished with a 3 fim thick AZO layer. In 2010, 
Empa developed a completely Cu-free CdTe solar cell in 


CdTe Solar Cells. Table 2 PV performance of CdS/CdTe solar cells in substrate configuration deposited by various 
methods on rigid and flexible substrates. The given temperature is the maximum temperature of the complete process 


Method 

Temp (°C) 

Substrate 

Eff. (%) 

V oc (mV) 

J sc (mA/cm 2 ) 

FF (%) 

Area (cm 2 ) 

Lab 

Ref. 

HVE 

500 

BSG/Cu/Mo 

9.9 

690 

28 a 

51 

0.3 

Parma 

[29] 

HVE 

420 

BSG/Mo/Sb 2 Te 3 

8.6 

658 

21.4 

61 

0.15 

Empa 

[30] 

HTVPD 

650 

Mo foil/Cu 

5 



45 


Bl 

[15] 

CSS 

570 

Mo foil 

3.5 

500 

10.6 

40 

- 

UNAM 

[31] 

HVE 

500 

Mo foil 

5.3 

629 

22.0 

38 

- 

El Paso 

[32] 

HVE 

450 

Pl/Au/Cu 

7.3 

692 

21.6 

49 

- 

ETHZ 

[33] 

Sputter 

390 

Mo foil/ZnTe 

7.8 




0.05 

Toledo 

[34] 


a Spectrum and intensity of artificial illumination not reported 
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substrate configuration using a Mo/Sb 2 Te 3 back con¬ 
tact structure with efficiency of 8.6% measured under 
standard test conditions. In this device the CdTe layer 
was deposited by HVE followed by CBD-CdS and fin¬ 
ished with a sputtered i-ZnO/AZO bilayer electrical 
front contact [30]. 

With flexible CdTe/CdS devices in substrate config¬ 
uration highest efficiency of 7.8% was obtained with 
a nitrogen doped ZnTe tunneling contact between a Mo 
foil and a sputtered CdTe layer [34] . Without tunneling 
enhancing interlayer efficiency of 5.3% was reported by 
Singh et al. [32]. Romeo et al. achieved an efficiency of 
7.3% on polyimide in substrate configuration using 
a lift-off process [33]. The difficulty in formation of 
a low resistive and ohmic contact to the p-type CdTe - 
which does not contribute to diffusion of back contact 
elements across the pn-junction during cell processing 
and lifetime of the device - is the major limitation of 
the substrate configuration. The physical aspects of the 
ohmic contact formation are discussed in more detail 
in section Processing and Properties. 

Industrial Production 

For solar electricity generation CdTe solar cells are 
interconnected to large area solar modules in order to 
yield high voltage and power. For thin film solar mod¬ 
ules commonly monolithical interconnection is used 
where the layers are patterned during the growth of 
the multilayer stack for serial connection of adjacent 
cells. Figure 3 shows the principle of the monolithic 
interconnection. 

The industrial production of CdTe thin film solar 
modules started in the 1980s. At that time several 
companies located in USA, Japan, and Germany started 
large-scale production with different deposition 
technologies. Golden Solar Inc., USA, developed 


a production plant based on spray deposition tech¬ 
nique with annual capacity of 2 MWp [35]. Solar 
Cells Inc., USA (now First Solar Inc.), worked on the 
industrialization of the high rate vapor transport tech¬ 
nique [36]. BP Solar, USA, used electrodeposition for 
the growth of the CdTe layer and achieved module 
efficiency of 10.9% [37]. Matsushita Battery Company 
Ftd., Japan, developed screen printing as well as CSS 
technologies for large-scale CdTe deposition [38]. 
Both, BP Solar and Matsushita Battery built pilot- 
production plants for CdS/CdTe thin film solar mod¬ 
ules, however, stopped their activities in 2002. Antec 
Solar GmbH, Germany, worked on the industrializa¬ 
tion of a CSS technique based on the work of the solar 
pioneer Dieter Bonnet [39]. 

The key players in CdTe solar module commercial¬ 
ization are listed in Table 3. Today the company First 
Solar Inc. dominates not only the production of thin 
film solar modules but the whole PV market in terms of 
large production volume and lowest cost of produc¬ 
tion. With annual production capacity of 1.4 GWp in 
2010, First Solar Inc. has a total PV market share of 
more than 10%. Abound Solar, USA, is on the way of 
becoming the second largest producer of CdTe thin film 
modules with an annual production capacity target of 
200 MWp in 2010 [46]. The third company, which 
offers a CdTe solar module on the market is Calyxo 
GmbH, a subsidiary of the company Q-Cells SE in 
Germany. The annual production capacity target of 
Calyxo is 110 MWp by 2011 [42]. Arendi, a start-up 
company in Italy announced to set up a production 
plant for annual production capacity of 15 MWp. 
PrimeStar Solar, a company in USA with involvement 
of General Electric, announced an annual production 
capacity of tens of MWp. Roth8cRau AG together with 
CTF company in Germany is involved in developing 
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CdTe Solar Cells. Figure 3 

Principle of monolithic interconnection of single cells to solar modules 
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CdTe Solar Cells. Table 3 Summary of industrial activity on CdS/CdTe solar modules 


Company 

Eff. (%) 

> 

u 

o 

> 

Jsc (A) 

FF (%) 

Area (m 2 ) 

Method 

Ref. 

Abound Solar 

10.1 

47.8 

2.28 

66.8 

0.72 


[40] 

Antec Solar 3 

6.9 

85.0 

1.11 

52.8 

0.72 

CSS 

[41] 

Arendi 






css 


BP Solar 3 

10.9 

26.2 

3.24 

62.3 

0.49 

ED 

[37] 

Calyxo b 

9.0 

90.7 

1.16 

61.9 

0.72 

APVTD 

[42] 

First Solar Inc. 

11.1 

61.7 

1.76 

73.8 

0.72 

VTD 

[43] 

Golden Photon 3 

8.5 




0.33 

Spray 

[44] 

Matsushita 3 

9.3 




0.12 

Printing 

[38] 

Matsushita 3 

11.0 




0.54 

CSS 

[38] 

Roth&Rau c 

9.9 

121 

2.39 


1.92 


[45] 

Prime Star Solar 

- 





CSS 


Xunlight 26 Solar d 






Sputtering 


Solexant d 






Printing 


von Roll Solar d 






Printing 



Currently no activity 

Announced module efficiency of 10.8% in July 2010 
development of turnkey CdTe production plants 
development of flexible CdTe solar modules 


and offering turnkey CdTe production plants with an 
annual capacity of 80 MWp [47]. 

Most of the mentioned companies with currently 
running production plants of significant volume 
develop modules on rigid glass substrates. Substituting 
a flexible substrate for the rigid glass can further 
increase the low-cost potential of this technology. Flex¬ 
ible CdTe solar cells and modules offer several advan¬ 
tages in the production of devices as well as on the final 
product in terrestrial and space applications. A cost- 
effective and fast roll-to-roll deposition process is pos¬ 
sible with the use of a flexible substrate material. Also 
faster heating and cooling are applicable for thin flex¬ 
ible substrate material and there is no need for cost 
intensive robotics as needed for handling of fragile and 
heavy glass. This approach will not only lower the 
production cost but also the energy payback time 
significantly. 

Due to the light-weightiness and flexibility of the 
final product new markets become attractive. The 
building integrated architecture will enter new 


dimensions for energy production on roofs and facades 
not only for flat surfaces but also for complex curved 
shapes as well as for lightweight industrial and rural 
buildings. Other applications include the portable 
source of power for mobile and electronic applications. 
Flexible CdTe solar cells have been already developed in 
the superstrate and substrate configurations. Efficien¬ 
cies of 3.5-7.8% have been reported for the substrate 
configuration using molybdenum foil as substrate 
material [15, 31, 32, 34]. The highest reported efficien¬ 
cies of flexible CdTe solar cells in superstrate configu¬ 
ration on PI are 10.7% by sputtering [28] and 12.4% by 
high vacuum evaporation [27] . Using the later technol¬ 
ogy a flexible 31.9 cm 2 CdTe solar module of 8.0% total 
area efficiency was developed at Empa, Switzerland, 
with 11 cells monolithically interconnected by all 
laser-based scribing [48]. 

The company Xunlight 26 Solar, USA, is currently 
developing flexible CdS/CdTe solar modules using an 
all-sputtering approach based on the technology devel¬ 
oped by Compaan et al. at Toledo University [28] while 
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a few start-up companies have announced pilot plants 
based on proprietary printing technologies, e.g., 
Solaxant, USA, and von Roll Solar, Switzerland. 

Processing and Properties 

In this section processing and properties of the indi¬ 
vidual layers in the multilayered state-of-the-art CdTe 
thin film solar cells are described in detail. 

Window Layer 

The most commonly used CdTe thin film solar cells are 
pn-heterojunction devices. The junction is formed 
between the p-type CdTe compound semiconductor 
and a n + -type window layer stack. This is a one-sided 
abrupt junction resulting in an almost complete shift of 
the depletion region into the p-type absorber material. 
Photons with energy above the energy bandgap of the 
n-type semiconductor stack are absorbed in the win¬ 
dow material. Carriers generated within the diffusion 
length from the junction in the window material are 
collected and contribute to the photocurrent. However, 
because of the high recombination rate in the window 
layer and the asymmetric depletion region across the 
junction, most of the carriers generated in the window 
material are lost. Since photons absorbed in the win¬ 
dow layer do not contribute to the overall current it is 
essential to reduce the parasitic light absorption in the 
window material or layers to improve the performance 
of such devices. 

For optimum flow of minority charge carriers 
across the junction a continuous band alignment is 
desired. In heterojunction devices the pn-junction 
forming partners most often have different energy 
bandgaps resulting in energy band discontinuities. In 
such heteroj unction devices the band offset in the con¬ 
duction band between the p-type absorber and the 
n-type window material should be small in order to 
prevent a barrier formation for photogenerated elec¬ 
trons [49]. 

The requirements for the n-type window stack can 
be summarized as follows (cf. [49-51]): 

• Sufficiently large bandgap and negligible absorp¬ 
tion of light with energy below the energy bandgap 

of the window material in order to reduce parasitic 

absorption losses. 


• Conduction band offset A E c with p-type counter 
part of the heterojunction should be small; 
a negative A E c enhances interface recombination 
of electron hole pairs and a positive A E c forms 
a potential barrier for electrons. 

• Formation of no or only minimum number of 
interface traps (deep defects) at the junction. 
A large interface trap density increases the recom¬ 
bination velocity at the interface. 

• Fermi level close to the conduction band in order to 
reach a high built in potential and high recombina¬ 
tion barrier for holes at the interface. 

• High conductivity for efficient electron transport 
from the heteroj unction to an external circuit. 

In state-of-the-art CdTe-based solar cells a stack of 
three layers is used to meet the above-mentioned 
requirements of the window material. The stack is 
composed of a TCO layer for electronic transport of 
the generated charge carriers and a HRT layer in order 
to prevent shunting and current flow back into weak 
diodes. Both the TCO and the HRT layers are high 
bandgap materials with low parasitic absorption losses, 
however, with an unfavorable conduction band offset if 
directly applied onto the absorber layer which results in 
rather low efficiency devices [25]. In order to improve 
the band alignment between the n-type window mate¬ 
rial and the p-type absorber layer a thin CdS buffer 
layer is essential. An additional requirement of the CdS 
layer is the reduction of the interface trap (deep-defect) 
density. 

Transparent Conductive Oxide Layer The transpar¬ 
ent conductive oxide has to combine high transparency 
in the wavelength range between 300 and 850 nm with 
high electrical conductivity. As the presence of free 
carriers, which is needed for the current transport, 
leads to free carrier absorption there is always a trade 
of between conductivity and transparency. The most 
commonly used metal oxides for TCO materials are 
FTO, ITO, AZO, and CTO. Deposition techniques for 
thin film TCO include spray pyrolysis, chemical vapor 
deposition (CVD), rf- and pulsed dc-sputtering, and 
pulsed laser deposition. The method used for the depo¬ 
sition influences the layer properties as well as the 
production costs of the TCO layer. In superstrate device 
configuration it is an inevitable requirement that the 


c 




1984 


c 


CdTe Solar Cells 


TCO layer needs to withstand the subsequent layer 
depositions and treatments during CdTe solar cell pro¬ 
duction. Hence the suitability of the TCO material is 
predetermined by the selection of the CdTe solar cell 
production processes. For example, ZnO-based TCOs 
are incompatible with the ammonium hydroxide 
containing chemical bath deposition (CBD) solution 
often used for CdS deposition as well as with heat 
treatments above 300°C in the presence of oxygen due 
to increasing resistivity [52]. 

The optical and electrical performance of the TCO 
layer directly influences the J S c and FF of the device 
[53] . To quantify and compare the optical and electrical 
performance of different TCOs, Roy Gordon [54] pro¬ 
posed the expression er/a = R s * ln (T + R) — 1 as figure 
of merit, where a is the conductivity, a the absorption 
coefficient (average value in the visible range), R s is the 
sheet resistance in ohm square, and T and R are the 
total transmittance and reflectance (in the visible 
range). The highest figure of merit will be achieved by 
the TCO with the largest product of effective electron 
mass and mobility, while film thickness and doping 
concentration are less important parameters [55]. In 
Table 4, commonly used TCO materials are compared 
according to this figure of merit. It should be noted that 
the defined figure of merit does not take the specific 
plasma frequency and transparency cut of band-to- 
band transitions into account (usually in the UV 
part). For CdTe solar cells, where the response of the 


CdTe Solar Cells. Table 4 Figure of merit (As proposed by 
Gordon [55]) of some TCO materials used for CdTe solar 
cells. The higher the figure of merit value the higher will be 
the expected achievable solar cell efficiency 


Material 

Sheet 

resistance 

(fl/square) 

Visible 
abs. coeff. 
alpha 

Figure of 
merit 

(ft -1 ) 

Ref. 

CTO 

10 

0.01 

10 

[21] 

ZnO:F 

5 

0.03 

7 

[55] 

AZO (CVD) 

3.8 

0.05 

5 

[55] 

AZO 

(sputtered) 

4.5 

0.04 

5 

[30] 

ITO 

6 

0.04 

4 

[55] 

FTO 

8 

0.04 

3 

[55] 


device (300-850 nm) is close to the visible range of the 
solar spectrum (380-780 nm), this figure of merit is 
a meaningful value. Absorption in the glass which is 
mainly caused by iron impurities is also a source of 
optical loss in commercially available low-cost glasses. 

Other aspects that can play an important role for 
the choice of the TCO material are the minimum 
required deposition temperature, diffusion barrier 
properties, chemical durability, morphology and 
nucleation properties, and toxicity. A discussion of 
these and other aspects can be found in the work of 
Gordon [55]. 

For CdTe solar cell production FTO is one of the 
favored TCO materials as it is chemically and thermally 
stable and low-cost deposition processes, such as spray, 
are suitable for large-scale deposition. The highest 
figure of merit is achieved with CTO. With this layer 
processed at 660° C the highest reported CdTe solar cell 
efficiency of 16.5% has been achieved in conjunction 
with ZTO as highly resistive and transparent interlayer 
[21]. As shown in Table 4 also, AZO is an excellent 
material due to its figure of merit. Low cost of the raw 
material, low required deposition temperature, low 
toxicity, and low plasma frequency compared to com¬ 
petitive materials are other advantages of this material. 
The lower thermal and chemical stability compared to 
FTO and CTO are however problematic when conven¬ 
tional high-temperature production processes (like 
CSS) are used for CdTe solar cells. However, using 
low-temperature deposition techniques (below 
450°C) several research groups have succeeded in pro¬ 
ducing highly efficient CdTe solar cells on ZnO-based 
TCOs [25, 56]. 

Highly Resistive and Transparent Layer In high- 
efficiency CdTe devices a highly resistive and transpar¬ 
ent (HRT) layer is added between the TCO and the CdS 
in order to limit electrical losses at local shunts and 
grain boundaries which can occur especially in solar 
cells with reduced CdS layer thickness. Commonly used 
materials include Sn0 2 [57], ln 2 0 3 [58], Ga 2 0 3 [17], 
ZnO [25], and Zn 2 Sn0 4 (ZTO) [21]. Because of pos¬ 
sible nonconformal layer deposition of CdS on TCO 
and recrystallization during the CdCl 2 treatment of the 
CdS/CdTe stack, direct contacts between the TCO and 
the CdTe can locally occur due to island formation 
of the CdS and/or sulfur diffusion into the CdTe. 
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This means that locally the pn-junction can be formed 
between the TCO and the CdTe semiconductors [59]. 
Those TCO-CdTe diodes are connected in parallel with 
the CdS/CdTe diodes and adversely influence the 
device performance mainly reducing the V 0 c and FF 
and hence the efficiency. 

Rau et al. developed a model through which the 
beneficial effect of the i-ZnO layer in polycrystalline Cu 
(In,Ga)Se 2 (CIGS) -based solar cells can be explained by 
the conduction band offset at the HRT-buffer interface 
enabling self-limitation of electrical losses due to elec¬ 
trical potential differences between the n-type window 
material and the single grains in the CIGS absorber 
[60]. The offset barrier limits current flowing into the 
diodes with poor quality and thus improves the per¬ 
formance of parallel connected diodes. A similar role is 
envisaged for the HRT layer in CdTe-based devices. 
Additionally, the HRT layer can reduce diffusion from 
the TCO into the CdTe of metallic elements such as 
indium or aluminum which are potential compensat¬ 
ing donors in the p-type CdTe. 

CdS Buffer and Alternatives In state-of-the-art 
high-efficiency CdTe solar cells CdS is used as a buffer 
between the n-type transparent oxide bilayer and the 
p-type CdTe layer. The main role of the CdS layer is the 
proper alignment of the conduction and valence band 
in the heterojunction as well as reduction of interface 
traps. The CdS layer can be deposited with various 
techniques like CBD [21, 37, 61], sputter deposition 
[24], CSS [22], and HVE [62]. This variety of growth 
possibilities allows choosing a deposition technique 
which is compatible to the subsequent CdTe growth 
by either vacuum or non-vacuum-based processes. CdS 
grows intrinsically n-type and no relevant conduction 
band offset between CdS and CdTe is formed as calcu¬ 
lations [63] and measurements [64] revealed. If a thick 
CdS layer is used (above 200 nm), nearly all photons 
with wavelength below 517 nm are absorbed in the 
CdS. As the electron hole pairs in the CdS layer are 
barely collected this leads to losses in the J S c in the 
order of about 5 mA/cm 2 which translates into an 
efficiency loss of approximately 20% relative. The CdS 
absorption losses can be reduced by reducing the 
CdS layer thickness and simultaneously introducing 
the HRT layer to compensate for possible inhomoge¬ 
neous coverage (see Fig. 4). Table 5 illustrates the effects 



Wavelength (nm) 


CdTe Solar Cells. Figure 4 

External quantum efficiency (EQE) of CdS/CdTe devices 
with varied CdS layer thickness. The higher photo current 
of devices with thin buffer (below 100 nm) is compromised 
by decreasing V oc and FF. Optimum efficiency for lab cells 
is obtained with 50-80 nm thick CdS and HRT layers 

CdTe Solar Cells. Table 5 Impact of CdS window layer 
thickness and the use of HRT layers on solar cell perfor¬ 
mance [65]. The given CdS thickness is measured before 
the annealing treatment of the CdS/CdTe stack in CdCI 2 
ambient 


Window 

layer 

(thickness) 

(nm) 

HRT layer 

(thickness) 

(nm) 

Voc 

(mV) 

Jsc 

(mA/cm 2 ) 

FF (%) 

Eff (%) 

CdS (240) 

No 

803 

18.4 

73.8 

10.9 

CdS (120) 

No 

699 

19.9 

47.6 

6.6 

CdS (120) 

i-ZnO (200) 

844 

21.0 

72.3 

12.8 


of CdS thickness reduction and application of HRT 
layer on the PV properties of CdTe solar cells. 

In large area module production, however, a certain 
thickness of CdS is needed to maintain high produc¬ 
tion yield; hence a significant J S c loss due to parasitic 
light absorption is inevitable in industrially produced 
CdTe solar modules. Although the application of HRT 
layer helps in improving the reproducibility and 
processing robustness, there is still an interest to search 
for alternative materials to CdS, primarily to reduce 
optical losses. Promising candidates to replace CdS are 
wide bandgap oxides or sulfides. The requirements for 
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an alternative material are proper conduction band 
alignment to CdTe, sufficient temperature stability, 
and no out diffusion, especially of n-doping elements 
into CdTe (like Al, In, Ga), during subsequent CdTe 
deposition and annealing treatment. The ZnO/CdTe 
heterojunction has already been studied in 1988 [66] 
and is reported to yield solar cell efficiencies close to 
10% [25]. This heteroj unction system suffers from 
unfavorable band alignment and a large lattice 
mismatch [67, 68]. ZnS and Zn x Cd!_ x O grown by 
CBD have been tested as n-type partner to CdTe lead¬ 
ing to 3% efficient CdTe thin film devices [69]. ZnS 
deposited by spray pyrolysis and ZnO!_ x S x deposited by 
atomic layer deposition (ALD) were investigated lead¬ 
ing to nearly 10% efficient CdTe thin film devices [70]. 
The external quantum efficiency of such a ZnS/CdTe 
device is shown in comparison with a conventional 
CdS/CdTe device with 120 nm thick CdS layer in 
Fig. 5. Also shown are the transmittance (corrected 
for reflectance by T + R) of TCO and the HRT layers. 
The areas between the curves labeled with the numbers 
1, 2, and 3 indicate the dominant optical loss mecha¬ 
nisms due to parasitic light absorption in the CdS layer, 


reflection at the various interfaces, and the free carrier 
absorption in the TCO, respectively. The ZnO!_ x S x 
ternary compound is especially promising as it allows 
tuning the band alignment to CdTe via engineering the 
sulfur to oxygen ratio [71]. 

CdTe Absorber Layer 

The compound semiconductor CdTe has a bandgap of 
1.49 eV at 300 K and an absorption coefficient of 
>10 4 cm -1 . This makes CdTe an ideal absorber mate¬ 
rial in thin film solar cells. 

Another big advantage of CdTe is its simple phase 
diagram (Fig. 6) compared to other compound semi¬ 
conductors. CdTe evaporates congruently. The com¬ 
pound has a considerably higher fusion temperature 
and lower vapor pressure than the single elements. 
Excess tellurium and cadmium have a low solubility 
in CdTe. This facilitates stoichiometric growth and 
allows the utilization of a variety of different deposition 
techniques. Commonly used deposition techniques are 
described in the following subsections. A comprehen¬ 
sive review of CdTe deposition and properties can be 
found in [17]. 



CdTe Solar Cells. Figure 5 

Comparison of external quantum efficiency of CdS/CdTe and ZnS/CdTe solar cells. The other curves shown are 
transmittance plus reflectance (T + R) of the AZO TCO layer and the i-ZnO HRT layer. The areas between the curves labeled 
by the numbers 1,2, and 3 indicate the dominant loss mechanisms due to CdS absorption, reflection, and TCO free carrier 
absorption, respectively 
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CdTe Solar Cells. Figure 6 

(a) T-x Phase Diagram of CdTe at 1 atm. (b) Vapor pressures of different materials used for the production of CdTe solar 
cells. Due to the simple phase diagram of CdTe and the higher vapor pressures of the single elements compared to 
the compound, the growth of CdTe is relatively simple (Data from [1 7]) 


CdTe Deposition Techniques 

Close Space Sublimation (CSS) CdTe is evaporated 
out of the compound in an ambient of inert gas (Ar) 
with pressures in the order of 1 mbar [21, 39]. A small 
amount of oxygen is commonly added in order to 
reduce the formation of pinholes and to improve 
junction quality. A small source-substrate distance of 
a few millimeters is used to allow the sublimated 
material to reach the substrate at the rather high 
pressure. With this method growth rates on the order 
of pm/min are achieved. Substrate and source 
temperatures of approximately 600° C and 700° C are 
used, respectively. Grains of as-deposited layers exhibit 
nearly random orientation and are commonly 1-10 pm 
in size. A typical scanning electron microscopy (SEM) 
picture of the CdTe surface is shown in Fig. 7a. The 
current efficiency record for CdTe solar cells of 16.5% 
has been produced using this method [21]. 

Vapor Transport Deposition (VTD) A carrier gas 
(argon and oxygen) saturated with Cd and Te vapor is 
transferred to the heated substrate (500-600° C) where 
supersaturation and subsequent reaction to CdTe 
happens [72-74]. Grain sizes and orientation are 
comparable to CSS-deposited films. First Solar Inc., 
the largest producer of CdTe modules, is using this 
technique in its production plants [75]. 


High Vacuum Evaporation (HVE) CdTe is deposited 
either by co-evaporation from elemental sources or by 
evaporation from the compound source in high 
vacuum (10 -6 mbar) [20, 25, 76]. Figure 7b shows 
that as-deposited grains have sizes of approximately 
0.5-1 pm with a strong preferential orientation in 
(111) direction. In spite of using a growth temperature 
as low as 350°C this technique has proven to yield 
efficiencies up to 15.6% [25]. The low-temperature 
process also allows the growth of flexible and 
lightweight cells on polyimide film (section “State-of- 
the-Art of CdTe Solar Cells”). 

Sputtering A CdTe target is sputtered by Ar-ions at 
a pressure of about 10 mtorr [56, 77, 78] . Cd and Te are 
transferred to the sample where condensation and film 
growth takes place. CdTe films are grown at substrate 
temperatures of 200-400°C and as-sputtered CdTe 
with grain sizes of 100-200 nm are obtained [79]. 
With this method, 14% efficient cells on glass have 
been produced [56]. Sputtering has also been used for 
flexible cells on polymer films. 

Metal Organic Chemical Vapor Deposition 
(MOCVD) Cd and Te containing metal-organic 
precursors are transferred to the substrate by a carrier 
gas where the precursors decompose and reaction to 
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CdTe Solar Cells. Figure 7 

Scanning electron microscope (SEM) pictures of as-deposited CSS-CdTe and HVE-CdTe 


CdTe occurs [80]. An atmospheric pressure and a 
growth temperature of 300-400° C are used. With a 
MO CVD-grown CdTe/CdZnS solar cell, 13.3% 
efficiency has been achieved [80]. 

Electro deposition The cathodic electrodeposition of 
CdTe is carried out from acidic solutions containing 
Cd 2+ and HTe0 2 + [81]. The ions are reduced at 
the substrate and reaction to CdTe occurs. The 
temperature of the bath is approximately 90°C. Grain 
sizes of as-deposited CdTe films are of the order 
of 0.1-0.5 um. Subsequent annealing leads to 
recrystallization and conversion of the CdTe layer 
from n-type to p-type. 

Screen Printing A paste containing CdTe, CdCl 2 , and 
a binder is deposited onto the substrate through 
a mesh at atmospheric pressure [82, 83]. After heating 
to remove the binders, the layer is annealed at 
600-700° C to recrystallize the film and activate the 
junction. With this method, 12.8% solar cells have 
already been produced in 1984 [83]. 

Spray Deposition A solution containing precursors of 
Cd and Te and complexing agents is sprayed at 
atmospheric pressure onto a substrate [84-86]. 
Dissolved CdTe nanoparticles are also used for 
spraying CdTe films. After removal of the solvents by 
heating, the layer is annealed at temperatures up to 
600° C. 


Spin-Casting of Dissolved Nanocrystals A solar cell 
based on CdTe or CdSe nanoparticles has been 
introduced by the group of P. Alivisatos [87]. 
Dissolved CdTe or CdSe nanoparticles are spin-cast 
from solution on a TCO. The company Solexant is 
currently bringing this technology to market. 

CdCl 2 Annealing Treatment In order to obtain 
highly efficient CdTe solar cells, a heat treatment of 
the CdS/CdTe stack in a chlorine containing ambient 
is mandatory. This annealing treatment is commonly 
referred to as CdCl 2 -treatment or activation treatment. 
The treatment is crucial for highly efficient CdTe solar 
cells produced with any method. Figure 8 shows that 
the treatment improves all cell parameters. The details 
of the ambient, temperature, and duration may differ 
from group to group, depending on the process. Many 
investigations of the influence of the treatment on the 
structural and electronical properties of solar cells have 
been carried out. Comprehensive reviews can be found 
in [88, 89]. 

Annealing Treatment Methods The annealing 
treatment is performed in the presence of chlorine 
and oxygen at temperatures of 400-500°C [88, 89]. 
Chlorine can be introduced in various ways, but the 
most common way is by using CdCl 2 . CdTe films 
produced by screen printing of spray deposition, e.g., 
are treated with additives of CdCl 2 . In the case of 
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CdTe Solar Cells. Figure 8 

Photovoltaic performance of a HVE-grown CdTe solar cell 
heat treated at 420 °C with and without the presence of 
CdCI 2 

HVE- or CSS-CdTe, CdCl 2 is usually added after 
deposition. It can be done by dipping the cell into 
a solution of CdCl 2 in methanol, but because of 
problems for large-scale manufacturing, other 
methods have been developed like vacuum deposition 
of CdCl 2 and subsequent annealing in air or annealing 
the sample in a vapor containing CdCl 2 [90-92]. 
Alternatively to CdCl 2 one can use other Cl- 
containing chemicals to perform the treatment of 
CdTe. A gas from the Freon® family has been also 
used to fabricate CSS-CdTe solar cells with efficiency 
up to 15.8% [93,94] . Apart from replacing the CdCl 2 in 
the treatment, the Freon® treatment has the advantage 
of eliminating the need of an etching step, which is 
difficult to implement into in-line production [95]. 
However, some environmental concerns are raised 
about using this gas, but a close cycle loop can help to 
overcome this problem. Other possible chlorine 
sources include HC1 or Cl 2 [91, 96]. The different 
treatments showed similar effects on the structural 
and electronic properties. 

Structural Effects The CdCl 2 -treatment can have a 
strong influence on the structural properties of CdS 
and CdTe layers and interfaces depending on the 
growth conditions and the properties of the as- 
deposited layers. 

As-deposited HVE-CdTe grows heteroepitaxially 
on the CdS grains with a columnar structure with 


many stacking faults and a (111) preferred orientation 
especially when grown at low temperatures as it can be 
seen in Fig. 9 [62, 97]. The CdCl 2 treatment leads to 
recrystallization of the CdTe layer which starts from the 
surface of CdTe [97]. CdCl 2 acts as a sintering flux, 
enhancing grain growth and coalescence of grains. 
After the treatment HVE-CdTe grain sizes have grown 
to sizes of a few micrometers [62, 97]. The treatment 
reduces the preferential orientation in (111) direction, 
the orientation relationship between CdTe and CdS is 
lost, and the number of stacking faults is strongly 
reduced (Fig. 9). When the treatment in air is 
performed without the presence of chlorine, the 
increase in grain size is diminished and less reduction 
in stacking fault density is observed [97]. 

As-deposited CdTe layers grown with the high- 
temperature CSS-method exhibit large grain sizes, 
which increase with increasing distance from the CdS 
junction in addition to a few smaller CdTe grains close 
to the CdS junction. In this case no grain growth upon 
heat treatment is observed for large grains, while grain 
growth has been found for small grains close to the 
junction (Fig. 10) [97, 98]. 

As-deposited CdTe layers by electrodeposition 
method have rather small grains but after the annealing 
treatment significantly enlarged grain size is observed. 
In that case no extra CdCl 2 is added since the precur¬ 
sors contain the elements that are needed for the 
recrystallization. 

In conclusion, small grains become larger after 
recrystallization while no significant change is observed 
for initially large grains. CdTe grain sizes, which can 
differ for different deposition methods, have compara¬ 
ble sizes after the treatment. 

The lattice mismatch between CdTe and CdS is 
~ 10% [99] , which can lead to a large number of defects 
at the interface [97] . This lattice mismatch can partly be 
compensated by intermixing of CdS and CdTe in the 
interface region, forming CdS!_ x Te x . The bandgap of 
the compound changes from 1.49 eV for CdTe to 2.4 eV 
for CdS with a bowing which has its minimum at about 
1.4 eV for a S content of about 25%. The ternary 
compound has a miscibility gap, which was determined 
for temperatures above 650°C [100] and expanded to 
conditions relevant during activation treatment [101]. 
The solubility limit of S in CdTe and of Te in CdS at 
415°C was determined to be 5.8% and 3%, respectively. 
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CdTe Solar Cells. Figure 9 

Transmission electron microscopy (TEM) pictures of as-deposited and treated CdTe solar cells deposited by HVE [59] 


2.4- 
2 . 2 - 
2 . 0 - 
1 . 8 - 
1 . 6 - 

1.4- 
1 . 2 - 

1 . 0 - 

g 

'cu 0.8 • 


g 

'td 

cu 


o 


0 . 6 - 

0.4- 

0 . 2 - 

0 . 0 - 






S v 


o 


• o" 

v 

o*^> ° v 


• As-grown 
V 60 minutes 
■ 30 minutes 

O 20 minutes 
A As-grown (TEM-data) 
A Treated (TEM-data) 


— i— 
10 


11 


Distance from CdS/ITO interface (|im) 


CdTe Solar Cells. Figure 10 

As-deposited CSS-CdTe has large grains in the order of 1-3 pm with smaller grains close to the junction. CSS-CdTe does 
not show growth of large grains upon CdCI 2 treatment, but small grains close to the junction exhibit grain growth 
(Adapted from [98]) 


The highest S concentration observed in CdTe is close 
to the solubility limit [89, 97]. 

After the CdCl 2 treatment, intermixing of CdTe and 
CdS is observed as illustrated in Fig. 11. The extent of 
intermixing depends on the deposition methods used 
for CdS and CdTe layers. Intermixing has beneficial as 


well as detrimental effects. As already mentioned, it 
facilitates lattice relaxation between CdTe and CdS, 
which can reduce defects in the interface region and 
improve the junction formation. It leads to a partial 
consumption of the CdS layer; due to this consumption 
the spectral response in the short wavelength region is 
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CdTe Solar Cells. Figure 11 

Concentration profiles of sulfur across the CdS/CdTe interface for both HVE- and CSS-grown cells after deposition and 
after heat treatment with different amounts of CdCI 2 .1: As-deposited by HVE. 2,3,4:30 min treated at 430 °C with 400,150, 
and 70 nm CdCI 2 , respectively. 8,9: As-deposited by CSS and treated with 100 nm CdCI 2 for 15 min at 420 °C [97] 


enhanced (section “CdS Buffer and Alternatives”). 
However, if all CdS is consumed, the junction will be 
built between CdTe!_ x S x and the TCO, which has infe¬ 
rior junction properties compared to CdTe/CdS and 
yields lower V 0 c and fill factor. A complete CdS con¬ 
sumption can also occur locally, if the CdS layer thick¬ 
ness is too small due to nonuniformity or because the 
CdS layer recrystallizes nonuniformly. This results in 
many CdTe/CdS and CdTe/TCO junctions connected 
in parallel, also decreasing the V 0 c and FF of the cell. In 
order to minimize the effect of a reduced voltage with 
decreasing CdS thickness, a HRT layer is introduced 
between CdS and the TCO, as it builds a more effec¬ 
tive junction to CdTe than the TCO [25] (section 
“CdS Buffer and Alternatives”). A negative effect of 
CdS/CdTe intermixing is the decrease of the window 
layer bandgap by diffusion of Te into CdS. It shifts the 
absorption of the window layer to longer wavelengths. 
On the other hand, S diffusion into CdTe reduces the 
bandgap of CdTe, shifting the cutoff of the spectral 
response to a larger wavelength which is shown in 
EQE plot in Fig. 12. This yields a slightly higher quan¬ 
tum efficiency in the wavelength region close to the 
CdTe bandgap. 



CdTe Solar Cells. Figure 12 

Effect of variation of treatment temperature on the 
quantum efficiency. More CdS is consumed at higher 
treatment temperatures. The cutoff at the absorber band 
edge is slightly shifted to larger wavelengths for higher 
temperatures 

Electronic Effects The treatment has pronounced 
effect on the electrical properties of the CdTe layer 
causing conductivity type conversion from n- to 
p-type and further enhancing the p-type conductivity 
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of the CdTe layer. This is not obvious at first glance, as 
Cl Te is an n-dopant in single crystalline CdTe, but the 
Vcd-Clxe complex, the so-called A-center is a shallow 
acceptor [102, 103]. The p-type doping with Cl cannot 
be achieved in the presence of Cd overpressure, but 
only with excess Te [104]. This is achieved by the 
presence of oxygen during annealing. The presence of 
oxygen further increases the effective acceptor 
concentration in the annealed CdTe layer and the V 0 c 
of the finished cells [105]. These electronic changes 
might be explained by a surface-type conversion 
through the formation of a Cd-deficient surface layer 
by the preferential oxidation of Cd. Oxidation thus 
leads to deviation from stoichiometry and Cd- 
vacancies are generated below the surface, enhancing 
p-type conductivity [104, 106]. Additionally, 

isoelectronic oxygen can act as a shallow acceptor in 
CdTe [107]. The annealing treatment also leads to an 
efficient passivation of the grain boundaries, which 
reduces recombination at trapping states associated 
with the grain boundaries. However, several studies 
also found an increased number of deep defects 
after treatment, which might limit the performance 
[108-111]. 

Influence of Polycrystallinity The polycrystalline 
nature of the absorber influences the performance of 
a CdTe solar cell. Grain boundaries can induce a large 
number of trapping states [112], which can serve as 
recombination centers for the photogenerated charges, 
decreasing cell performance. Grain boundaries can also 
lead to local band bending, which builds a potential 
barrier and therefore hinders charge transport across 
grain boundaries [113]. However, polycrystalline CdTe 
solar cells exhibit a very good performance and even 
outperform its single crystalline counterpart [114]. The 
high performance of polycrystalline devices is possible 
because of an efficient passivation of the grain bound¬ 
aries after the absorber treatment. According to 
a model by Visoly-Fisher et al. [115] the better perfor¬ 
mance of polycrystalline devices can be explained 
by hole depletion along grain boundaries which 
occurs after the annealing treatment of the absorber. 
The hole depletion results in local downward band 
bending close to the grain boundaries, which assists 
in the separation of photogenerated charge carriers 
(Fig. 13). Holes are transported through the grain 



CdTe Solar Cells. Figure 13 

According to a model by Visoly-Fisher, grain boundaries 
assist in separation of generated electron hole pairs. Hole 
depletion along grain boundaries leads to band bending. 
Holes are drawn toward the grain boundaries and are 
transported along grain boundaries, while electrons are 
transported in the grain bulk (Redrawn from [114]) 

bulk to the back contact, while electrons are 
transported along the depleted grain boundaries to 
the junction. Due to this depletion the recombination 
of minority carriers is reduced. On the one hand, 
depleted grain boundaries enhance charge separation 
and transport but on the other hand the cell voltage is 
reduced as depleted grain boundaries ending at the 
junction are connected in parallel with the main junc¬ 
tion of the CdTe/CdS cell. 

CdTe Thickness Reduction Due to the high absorp¬ 
tion coefficient only 1 pm of CdTe is needed to convert 
92% of the sunlight with energies larger than the 
bandgap into electricity. Nevertheless, thicker CdTe 
(2-10 pm) is commonly employed to avoid pinholes 
and grain boundary recombination pathways originat¬ 
ing from weak control on film morphology, micro- 
structure, and device structure. Currently R&D efforts 
are being made to decrease the CdTe thickness without 
negative effect on the cell performance, as less material 
consumption would further decrease the cost of CdTe 
solar modules. Especially a decreased amount of Te due 
to thinner CdTe could enable an increase of the market 
share of CdTe modules [116]. By re-optimizing the 
post-deposition treatment and the back contact 
annealing and using a mild etchant for CdTe, it is 
possible to achieve comparable efficiencies with 
absorbers of about 1 pm thickness [106, 117]. 
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Doping of CdTe The short circuit current density in 
highly efficient CdTe solar cells is quite close to its 
theoretical limit (26 mA/cm 2 vs. 30 mA/cm 2 ), while 
the open circuit voltage is well below its theoretical 
limit. In spite of a comparable bandgap of CdTe and 
GaAs, the V 0 c of CdTe is about 200 mV lower than the 
V 0 c of single crystalline GaAs devices and also the FF is 
slightly lower. The low voltage of CdTe solar cells is 
caused by a rather low carrier lifetime of about 1 ns and 
low hole density of about 10 14 cm -3 [118]. A way to 
increase the open circuit voltage would be by increasing 
the shallow acceptor density. For this purpose doping 
of p-CdTe is essential. However, substitutional dop¬ 
ing of p-type II-VI semiconductors is generally 
challenging because of self-compensation [119, 120]. 
Self-compensation denotes the spontaneous formation 
of defects, which partly or completely compensate the 
doping of the intentionally incorporated dopants. 
Especially in large bandgap semiconductors the spon¬ 
taneous formation of compensating defects can be 
energetically favorable. These compensating defects 
can be spontaneously formed through native defects 
like vacancies, interstitials, antisites, or dopant-vacancy 
pairs. Also the added impurity can compensate itself if 
it has amphoteric behavior, i.e., its electrical activity 
depends on the position in the lattice. To overcome 
these problems, nonequilibrium techniques like low- 
temperature growth, ion-implantation, or quenching 
from high temperature can be used. However for solar 
cell manufacturing these methods are not attractive. 

Most II-VI semiconductors can be doped efficiently 
to either n- or p-type, depending on the covalent radius 
ratio of II- to Vl-atoms [119]. CdTe is an exception, as 
single crystals of CdTe can be doped to both p- and 
n-type although with limited maximum carrier con¬ 
centration. Moderate acceptor densities of about 
10 17 cm -3 have been achieved in single crystal CdTe 
by doping with group I impurities substituting Cd or 
group V impurities substituting Te. Also controlled 
deviation from stoichiometry to Te-rich CdTe 
can lead to p-conductivity by native defects like V C a 
[121-123]. In polycrystalline CdTe solar cells p-doping 
with hole densities of about 10 14 cm -3 is achieved by 
the absorber treatment in the presence of Cl and O 
(section “CdCl2 Annealing Treatment”). Also Cu, 
which is used in the back contact, can partly diffuse 
into CdTe and contribute to p-type doping [124, 125]. 


But Cu has a relatively large activation energy (Table 6) 
and is often associated with stability problems of CdTe 
solar cells (section “Electrical Back Contact”). Attempts 
to additionally dope the absorber have been less suc¬ 
cessful [135-137]. A reason might be a more pro¬ 
nounced self-compensation in polycrystalline material 
because of an increased number of defects due to grain 
boundaries. Not only are deliberately added impurities 
present in CdTe solar cells, but also impurities that 
originate from the source materials or from interdiffu- 
sion during processing steps from other layers or the 
substrate, especially in the case of high-temperature 
deposition methods. Many of these impurities are 
potentially active and can affect the photovoltaic prop¬ 
erties [138]. 

Electrical Back Contact 

One of the main challenges in CdTe solar cell technol¬ 
ogy is related to the difficulty of formation of an ohmic 
and stable electrical back contact to the CdTe layer. 
Because of a large electron affinity of 4.3 eV and a rela¬ 
tively large bandgap of 1.49 eV of CdTe, a metal with 
a work function of >5.8 eV would theoretically be 
needed to form an ohmic contact to p-CdTe (Fig. 14); 
however, such materials are rare. Additionally, surface 
states of CdTe cause Fermi level pinning and an abrupt 
metal contact will result in barrier heights above 0.8 eV, 
independent of the work function of the metal [50]. 
A barrier at the back contact can significantly affect the 
I-V curve by blocking the hole transport from CdTe to 
the metal. It results in a blocking behavior of the cur¬ 
rent at large positive bias, an effect commonly referred 
to as rollover. An I-V curve with rollover can be 
modeled by a series connection of two diodes with 
opposite polarity - the CdTe/CdS junction and the 
CdTe-metal back contact Schottky diode [139]. The 
rollover can cause a decrease of fill factor, which is 
significant for barrier heights above 0.5 eV [140]. 
Therefore alternative concepts to the abrupt CdTe- 
metal contact are being developed to form quasi- 
ohmic tunneling contacts. A common way is to 
implement a highly doped buffer layer between p- 
CdTe and the secondary metallic contact for reducing 
the width of the depletion layer. This leads to a very 
thin barrier of a few nanometers, allowing holes to 
tunnel through the barrier as shown in Fig. 14. 
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CdTe Solar Cells. Table 6 Donor and acceptor levels of different impurities, native defects, and impurity-native defect 
complexes in CdTe 



Acceptors 



Donors 



Element 

Lattice position 

Ea 

Reference 

Lattice position 

Ed 

Reference 

Li 

Lied 

57.8 

[126] 

Lij 

13.9 

[127] 

Na 

Na C d 

58.8 

[126] 




Cu 

Cu C d 

146 

[128] 




Ag 

Ag C d 

107.5 

[128] 




Au 

Au cd 

263 

[129] 




Al 




Alcd 

14.05 

[130] 

Ga 

Ga C d — V cd 

131 

[131] 

Ga cd 

13.88 

[130] 

In 

l n Cd — V cd 

142 

[131] 

ln C d 

14.15 

[130] 

N 

N Te 

56 

[132] 




P 

^Te 

68.2 

[132] 




As 

ASje 

92 

[132] 




Sb 

Sb Te 

280 

[133] 




Bi 

Bhe 

300 

[134] 




0 

0 Te 

46 

[107] 




Cl 

Che — Vcd 

125 

[131] 

CI Te 

14.48 

[130] 


V cd 

130 

[134] 





Tej 

570 

[134] 






CdTe Solar Cells. Figure 14 

Band diagram of the p-CdTe-back contact. Due to the high 
electron affinity of CdTe it forms a Schottky junction with 
metals. By increasing the CdTe acceptor density close to 
the metal or by introducing a thin buffer layer with high 
acceptor density, a thin quasi-ohmic tunneling contact can 
be formed 


A conventionally used buffer layer is Cu x Te since it 
has p-type conductivity with a high acceptor density. It 
can be produced by selectively etching the surface of 
CdTe, depositing a few nanometers of Cu and subse¬ 
quent annealing. The etching produces a Te-rich CdTe 
surface and can be obtained by using a diluted bromine 
methanol (BM) solution [141] or a mixture of nitric 
and phosphoric acid (NP) [142]. The BM etch is 
a milder etchant and it affects the morphology of 
CdTe less than the NP etch. Figure 15 shows that the 
NP etching smoothens the surface and widens grain 
boundaries (GBs). Preferential etching along GBs can 
produce shunting pathways as depicted in Fig. 16. 
Figure 17 shows the evolution of a Te layer with increas¬ 
ing etching time with a NP etch as determined by X-ray 
diffraction (XRD). Cu can be added by evaporation or 
by applying a Cu doped paste, which itself serves as the 
secondary contact. Alternatively, Au can be used as the 
secondary contact but it can diffuse with time and 
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CdTe Solar Cells. Figure 15 

SEM images of the CdTe morphology after etching. The BM etch {left) affects the morphology less than the NP etch {right) 
which smoothens the surface and widens GBs [143] 



CdTe Solar Cells. Figure 16 

Schematics of a CdTe solar cell with shunting paths along 
grain boundaries. Overetching can lead to strong etching 
along grain boundaries, leaving conducting Te-rich CdTe 
all along from the CdTe top surface to the CdS interface 



XRD angle 20 [°] 


CdTe Solar Cells. Figure 17 

XRD patterns of NP-etched CdTe with different etching 
times. Etching produces a Te-rich layer [144] 


cause efficiency degradation. Therefore other metals, 
e.g., Mo, are applied. Cu x Te-based contacts form low 
resistive quasi-ohmic contacts, but diffusion of Cu 
preferably along the grain boundaries in CdTe can 
lead to a fast degradation of the cells. This makes the 
contact suitable for research purposes but not for the 
use in commercial products. However, efforts have 
been made do develop stable contacts by carefully opti¬ 
mizing the thickness and composition of Cu x Te for 
long-term stable performance. Promising alternative 
buffer layers to Cu x Te have been developed. These 
buffer layers include Sb 2 Te 3 [145], Sb [144], ZnTe 
[146], As 2 Te 3 [94], HgTe [147], and Ni:P [148]. 

Sputtered Sb 2 Te 3 /Mo have been applied to CdTe 
solar cells and efficiencies up to 15.8% [149] have 
been achieved. The use of electrodeposited Sb 2 Te 3 as 
back contact buffer layer has also yielded reasonable 
efficiencies above 11% [150]. In spite of these convinc¬ 
ing efficiencies, the band alignment is claimed to be 
rather unfavorable [151]. Figure 18 illustrates that 
CdTe solar cells with Sb- or Sb 2 Te 3 -buffer and Mo 
metal layer have proven to show stable performance 
for a long period of time [143, 152]. 

A negligible valence band offset was measured 
between CdTe and ZnTe [153, 154], which results in 
a good contact between these two semiconductors. 
A highly p-type doped ZnTe forms a tunneling contact 
to metals like Ti. A high doping density of more than 
10 20 cm -3 can be achieved by doping with Cu [155]. 
With ZnTe.Cu based back contact, solar cells with effi¬ 
ciency of 12.9% have been produced [146, 156]. 

ITO is also well suited for the use as contact to 
etched p-CdTe [157]. Due to its transparency it can 
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CdTe Solar Cells. Figure 18 

Stability of CdTe solar cells with different back contacts under light soaking at elevated temperatures. Sb 2 Te 3 /Mo and 
Sb/Mo back contacts show stable performance over a very long period of time [143] 


be applied for use in bifacial CdTe solar cells (see also 
Alternative Device Structures). 

The performance stability problem of CdTe solar 
cells is generally connected to the back contact and 
especially to the diffusion of elements that form the 
back contact including the buffer layer. A comprehen¬ 
sive review on the stability of CdTe cells can be found in 
[158]. On the one hand, Cu helps creating an ohmic 
back contact and can increase the acceptor density in 
CdTe, but on the other hand it is a mobile element in 
CdTe. Excess Cu can easily diffuse along grain bound¬ 
aries and accumulate at the junction and in the CdS, as 
found by SIMS measurements [143]. This results in 
deterioration of the solar cell performance. Degrada¬ 
tion is commonly observed by the development of 
a rollover of the IV-curve at high forward bias and 
a decrease of V 0 c and FF. The short circuit current 
density usually stays unaffected [158]. Both the CdCl 2 
treatment [159] as well as the etch process [144] have 
influences on the stability of the cells in addition to the 
choice of the metal and buffer materials. 

Alternative Device Structures 

The conventionally used cell design has shown good 
performance, but also performance limitations as the 
highest efficiency of 16.5% is still below the theoretical 
limit of 30%. Therefore alternative device configura¬ 
tions that can possibly enhance the efficiency of CdTe 
solar cells should be considered. 


Although highest efficiency CdTe solar cells are 
grown in superstrate configuration, cells in substrate 
configuration are attractive for reducing the losses in 
the window layer. Such configuration is currently 
reconsidered because of its potential use on flexible 
substrates. It would enable the use of flexible metal 
foils and solve the problem of parasitic light absorption 
of superstrate CdTe solar cells on polyimide foils. In 
order to achieve highly efficient CdTe solar cells in 
substrate configuration, the problems of forming an 
efficient ohmic contact to CdTe which withstands the 
subsequent high-processing temperatures remains a 
challenging issue. 

a-Si solar cells are commonly grown in a p-i-n struc¬ 
ture in order to reduce recombination. A similar concept 
might also be applied to CdTe solar cells, where CdS is 
used as n-layer, CdTe as intrinsic layer, and a highly 
doped p-type semiconductor-like ZnTe as p-layer and 
electron reflector. This would lead to higher open circuit 
voltage without the need of an increased doping accep¬ 
tor concentration in CdTe [118, 160]. At the same time 
this would enable the use of thinner absorber layers. 

Bifacial CdTe solar cells with a transparent back 
contact have been developed by using ITO as 
a contact to CdTe [157]. This structure enables illumi¬ 
nation from both sides and can be used for the appli¬ 
cation in tandem solar cells. A CdTe solar cell with 
transparent back contact and 2.5 pm absorber thick¬ 
ness yielded a stable efficiency of 10.3% for front-side 
illumination and 2.1% in back-side illumination [161]. 
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By reducing the CdTe thickness to 1 pm and adapting 
the CdCl 2 treatment, the performance of back¬ 
side illumination could be increased to 3.2%, still 
maintaining a relatively high efficiency of 8% for 
front-side illumination (Fig. 19). 

Environmental and Material Resource Issues 

Environmental, Health, and Safety Issues 

The CdTe thin film technology has demonstrated excel¬ 
lent potential for cost-effective production of solar 
electricity. The energy payback time of CdTe modules 
compared to other PV technologies is significantly 
lower [162-164]. The environmental, health hazard, 
and safety (EH&S) issues of CdTe solar modules have 
been extensively investigated by several independent 
agencies in the USA and Europe (e.g., Brookhaven 
National Laboratory, USA, Fraunhofer Institut fuer 
Festkoerpertechnolgie, Germany, etc.). The studies 
prove that the Cd emission during the total life cycle 
of a CdTe module is lower compared to other PV and 
conventional energy production technologies [165]. 
Despite this scientific fact and the technological proofs, 
the CdTe PV technology has suffered from the percep¬ 
tion of toxicity of elemental Cd, even though it is used 
in the form of a stable compound in thin film modules. 


Referring to the perception and concerns on Cd 
issues, the studies of V. Fthenakis (Sr. Chemical Engi¬ 
neer, Brookhaven National Laboratory) and K. Zweibel 
(Former Manager, Thin Film Partnership Program, 
NREL, USA) confirm that CdTe modules would be 
almost benign with zero emission, without associated 
health hazards and they gave a clean chit to CdTe 
technology. 

A peer review, organized by the European Commis- 
sion’s Joint Research Center and moderated by the 
German Federal Ministry of Environment (Berlin 
2005), has studied the major published data on the 
environmental profile of CdTe PV provided by 
the leading institutes and companies. A summary of 
the outcome is given below: 

• Cd is produced as a by-product of Zn and can either 
be put to beneficial uses or discharged to the envi¬ 
ronment posing another risk. 

• CdTe used in PV is a chemically stable compound 
that does not leak into the environment during 
normal use or foreseeable accidents. 

• Air emissions of Cd from the whole life cycle of 
CdTe PV (mining to end of life) are 100-360 times 
lower than Cd emitted into air from coal and oil 
power plants that PV displaces. 


c 



CdTe Solar Cells. Figure 19 

Performance of a CdTe solar cell with transparent back contact and a CdTe thickness of 1 pm upon front- and back-side 
illumination 
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• Absolutely no pollutant emission during normal 
operation. “Every energy source or product may 
present some direct or indirect environmental 
health and safety hazards but the CdTe industry is 
proactive in controlling them, and these issues 
should by no means be considered a problem for 
CdTe PV modules .” 

• CdTe modules have shorter energy payback time 
and lower life-cycle emission of C0 2 , S0 2 , and 
NO x than all other PV technologies. 

• A low production cost technology like CdTe will 
accelerate the growth of the share of all PV technol¬ 
ogies in the energy market. 

The conclusion of the peer review was that the 
environmental risks of CdTe PV are minimal if mate¬ 
rials are recycled and/or end-of-life systems and poli¬ 
cies are in place. 

Reduction of CdTe layer thickness and increase of 
efficiency with EH&S conform processes are expected 
to have a major impact on the life cycle of CdTe PV, 
although the significant cost saving may not be felt at 
production capacities below 50 MWp. In state-of-the- 
art CdTe modules (average CdTe layer thickness of 
4 pm) the amount of Cd per unit area is approximately 
12 g/m 2 . Reducing the CdTe layer thickness toward 
1 pm will reduce the Cd amount per unit area to 
3 g/m 2 . Assuming a module efficiency of 10%, an 
average annual energy yield of 900 kWh/kWp, and 
a lifetime of 25 years, the energy produced per unit 
weight of Cd will increase with reduced CdTe layer 
thickness from about 190 to 770 kWh/g. It has to be 
emphasized that there is neither consumption nor 
emission of Cd or Te during production and operation 
of the solar modules. At the end of life the material is 
still inside the module and can be recycled, which closes 
the chain of the raw material. 

Resources of Raw Materials 

Among the required elements for the production of 
CdTe thin solar modules tellurium is the least abun¬ 
dant. Tellurium is a scarce material that is expected to 
have a share of 0.01 ppm in the earth’s crust. The 
majority (above 90%) of its current production is 
stemming from anode slimes collected from electro¬ 
lytic copper refining [166]. From this source about 
430-500 metric tons of tellurium are available per 


year [167, 168]. The refined tellurium is consumed as 
a metallurgical alloying element, as catalyst in chemis¬ 
try, in blasting caps, and as pigment for blue and brown 
colors in ceramics. Purified tellurium is used in thermal 
imaging, thermoelectric devices, phase change memo¬ 
ries, and CdTe thin film solar cells [169]. The demand 
of tellurium in PV applications has increased signifi¬ 
cantly in the past decade. Ken Zweibel estimated the 
consumption of tellurium for 1 GWp of state-of-the- 
art CdTe module technology being 91 metric tons 
[116]. This means that First Solar Inc. would consume 
about 25-30% of the annual tellurium production per 
year (assuming that the factories are running at the 
announced production capacity). With increasing con¬ 
cern in tellurium supply on the one hand other sources 
such as gold tellurium, lead-zinc ores with higher con¬ 
centrations of tellurium, or bismuth telluride ores are 
investigated and an increased annual production is 
expected [116, 166]. On the other hand intensive 
R8cD effort is expected to reduce the tellurium demand 
per output power (Wp) in CdTe solar modules by 
increased module efficiency and reduced absorber 
layer thickness. Such a scenario with increased module 
efficiency to 15% and reduced absorber thickness to 
0.67 pm was presented by Ken Zweibel leading to 
a reduction of tellurium consumption to 15 metric 
tons per GWp [116]. Zweibel further calculated that 
with the mentioned technological improvement and 
today’s annual tellurium production about 6-7% of 
the global electricity demand can be covered by CdTe 
solar modules in 2030. This is a significant contribution 
and justifies further financial and R&D investments in 
this technology. 

Further Directions 

Even though the CdTe solar technology has already 
been successfully transferred from the research labora¬ 
tory to the industrial production, there are still chal¬ 
lenges and major problems to be solved for further 
efficiency improvement, cost reduction, as well as 
novel applications. An important point of consider¬ 
ation is the large difference between the theoretically 
achievable efficiency of close to 30% and the practically 
achieved 16.5% up to now. Among the open problems 
are (1) increasing the transmittance of the substrate- 
TCO stack while maintaining high conductivity using 



simple processing techniques; (2) reducing the CdS 
thickness well below 100 nm or using alternative mate¬ 
rial of higher bandgap and/or lower absorption for 
minimization of absorption loss, without adversely 
affecting the fill factor and cell voltage; (3) reducing 
the self-compensation effect in the polycrystalline 
CdTe material to increase the net acceptor density; 
(4) increasing the raw material utilization by reducing 
the CdTe layer thickness and applying optical confine¬ 
ment effects as well as by alloying CdTe with Se in order 
to reduce the Te consumption; (5) forming a low 
resistive ohmic and stable back contact onto p-type 
CdTe by an all-vacuum deposition process; (6) devel¬ 
oping highly efficient CdTe solar cells in substrate or 
p-i-n configurations; (7) developing low-temperature 
processes; (8) developing low-cost, large area scalable 
processes of high material utilization; and (9) adapting 
roll-to-roll processing. Also important for technologi¬ 
cal and production cost reduction point of view is to 
further enhance the solar module efficiency thereby 
closing the gap between highest cell and module effi¬ 
ciencies. R&D to overcome these challenges will help to 
further increase the conversion efficiency and to reduce 
the production costs. 


Abbreviations 

ALD Atomic layer deposition 

APVTD Atmospheric pressure vapor transport 
deposition 

AR Antireflection 

ASG Aluminosilicate glass 

AZO Aluminum doped zinc oxide 

BI Battelle-Institute e.V 

BM Bromine methanol 

BSG Borosilicate glass 

CBD Chemical bath deposition 

CIGS Copper indium gallium diselenide 

CNRS French National Center for Scientific 

Research 

CSS Close space sublimation 

CTO Cadmium stannate 

CVD Chemical vapor deposition 

ED Electrodeposition 

EH&S Environmental, health hazard, and safety 
issues 

EQE External quantum efficiency 
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ETHZ 

Swiss Federal Institute of Technology Zurich 

FF 

Fill factor 

FTO 

Fluorine doped tin oxide 

GBs 

Grain boundaries 

HRT 

Highly resistive and transparent oxide 

HTVPD 

High-temperature vapor phase deposition 

HVE 

High vacuum deposition 

IEC 

Institute for Energy Conversion, University 
of Delaware 

IQE 

Internal quantum efficiency 

ITO 

Tin doped indium oxide 

Jsc 

Short circuit current density 

NP 

Nitric-phosphoric acid 

NREL 

National renewable energy laboratory 

PI 

Polyimide 

PV 

Photovoltaic 

R 

Reflectance 

R&D 

Research and Development 

SEM 

Scanning electron microscopy 

SLG 

Soda-lime glass 

STC 

Standard test condition 

T 

Transmittance 

TCO 

Transparent conductive oxide 

UNAM 

Centro de Investigacin en Energa, Solar 
Materials Department, Temixco, Morelos 62 

Voc 

Open circuit voltage 

VTD 

Vapor transport deposition 

XRD 

X-ray diffraction 

ZTO 

Zinc stannate 
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Glossary 

One of the most useful ways to begin a study of the 
Comprehensive Environmental Response, Com¬ 
pensation, and Liability Act (CERCLA) is to engage 
the intriguing lexicon the act provides. As is dem¬ 
onstrated by the definitions below, terms are often 
defined under CERCLA in ways that are remarkably 
broad, granting a great deal of elbow room and 
flexibility for the US Environmental Protection 
Agency (EPA) to go about the work of protecting 
the public from hazardous materials, no matter 
when or how they have come to present a threat. 
Environment Under CERCLA, the term “environ¬ 
ment” is defined broadly in a manner that is 
distinctly multimedia. An environment is anywhere 
or anything that can become contaminated with 
a hazardous substance, including all surface and 
ground waters, land surfaces or subsurface strata, 
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or the ambient atmosphere. Any release of a haz¬ 
ardous material into any sort of environment is 
potentially covered under CERCLA. 

Facility The term “facility” usually brings to mind an 
image of a building or series of buildings, but 
CERCLA defines the term more broadly. A “facility” 
under CERLCA includes any site or area where 
a hazardous material has been deposited, placed, 
stored, disposed of, or has somehow come to be 
located. It is just as likely to refer to a faulty storage 
tank or a leaking pipe as to a building or similar 
structure. 

Hazardous substance In defining hazardous sub¬ 
stances, CERCLA makes use of all other definitions 
and lists of such substances in previous US envi¬ 
ronmental legislation. However, there is one impor¬ 
tant exception: petroleum and petroleum products, 
natural gas and synthetic gas used for fuel. This is 
the so-called Petroleum Exception. Interestingly, 
the definition of a hazardous substance does not 
refer to any particular or minimum amount of 
a material that must be present for it to become 
hazardous. In theory, there is no lower limit of 
materials that must be present or must have been 
released before a remedial action is authorized, or 
liability is imposed. 

Contaminant or pollutant CERCLA also authorizes 
cleanups of categories of substances that are 
broader than materials judged to be hazardous 
according to the standards of earlier federal regula¬ 
tions. Contaminants or pollutants, under CERCLA, 
are materials that may cause adverse health effects. 
While CERCLA allows the EPA to respond to sites 
polluted by these lesser threats, it authorizes recov¬ 
ery of cleanup costs from responsible parties only in 
cases of contamination by a documented hazardous 
substance. 

Release Again, a release of a hazardous substance is 
broadly defined. CERCLA authorizes response to 
any sort of a release, including all manner of spills, 
leaks, and discharges into the air, as well as leaching, 
dumping, injecting, emitting, or disposing. Certain 
exceptions are made, generally for releases that are 
covered under other federal laws. As an example, 
motor vehicle emissions are excluded from 
CERCLA but are covered by federal emission stan¬ 
dards and other regulations. 


Removal Removal is also broadly defined, creating 
more potential opportunities to assess liability. 
Removal may refer to certain actions directly 
involved in cleanup of a site, such as removing the 
hazardous material from the area, or transporting 
waste to an authorized disposal facility. However, it 
may also refer to actions such as monitoring, 
assessing, and evaluating releases or potential 
releases. This allows the EPA to take preventive 
action to prevent a potential release. 

Remedy or remedial action As in the case of removal, 
the construction of the terms “remedy” or “reme¬ 
dial action” in CERCLA allows for response to both 
actual or potential or threatened releases. The terms 
are also broadly construed to allow for a wide range 
of response actions, including public health or envi¬ 
ronmental monitoring, transportation of hazard¬ 
ous materials, implementation of an alternative 
water supply, and even removal or relocation of 
communities to safe locations. 

Potentially responsible party or PRP Remedial 
actions that occur through CERCLA are often 
conducted not by the EPA directly, but by 
parties that are judged to be in some way accounle 
for the release. These parties, known as Potentially 
Responsible Parties, or PRPs, fall into four catego¬ 
ries: owners, operators, arrangers or generators, 
and transporters. For more information on these 
categories and their responsibilities and liabil¬ 
ities under CERCLA, see section “Liability Scheme” 
below. 

CERCLA Is also a distinctly creative act, which has also 
introduced new entities and terms into the knowl¬ 
edge base of environmentally conscious people, 
those affected by CERCLA actions and even 
CERCLA’s critics. 

The Superfund The Superfund has become a common 
name for all of CERCLA, but it also refers to the 
Hazardous Substance Superfund, established to 
provide financial resources to clean up hazardous 
sites. The Superfund was established by taxes 
imposed upon the chemical and petroleum indus¬ 
tries and an environmental tax on corporations. It 
has since been supported by allocations from the 
federal general fund and through additional legis¬ 
lation, such as the American Recovery and Rein¬ 
vestment Act. 
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The National Priority List or NPL The NPL was 

added to an existing resource, the National Contin¬ 
gency Plan (NCP), which was brought into being 
under the Clean Water Act to address the contain¬ 
ment and removal of spilled oil and other hazard¬ 
ous substances. Under CERCLA, the NCP was 
revised to include direct provisions for responding 
to releases of hazardous substances, pollutants, and 
contaminants, including those released on land or 
in the air. CERCLA also revised the NCP to include 
a list of sites that were considered to be top prior¬ 
ities for response. This is the National Priority 
List, or NPL. Criteria for placement on the NPL 
are discussed in section “NCP, NPL, and Site 
Procedures” below. 

Finally, CERLA brought into being a new federal 
agency The Agency for Toxic Substances and 
Disease Registry or ATSDR. ATSDR is charged 
under CERCLA with assessing the presence and 
nature of health hazards at specific Superfund 
sites, as well as working to prevent or reduce further 
exposure and the illnesses that result from expo¬ 
sures. It compiles and maintains a prioritized list 
of hazardous materials at Superfund sites, the 
CERCLA Priority List of Hazardous Substances. 
ATSDR also has a range of other duties, including 
health surveillance and the development and distri¬ 
bution of information on toxic substances. For 
more information, see section “ATSDR and the 
CERCLA List” below. 

Definition of the Subject 

The Comprehensive Environmental Responses, Com¬ 
pensation, and Liability Act of 1980, or CERCLA, 
also known as the Superfund Act, is a landmark 
piece of environmental legislation [1]. Its notably 
bold design, discussed in detail below, fits the daunting 
work it was established to perform, the remediation of 
the worst and most dangerous releases of hazardous 
materials in the history of the USA. CERCLA also plays 
a role in the prevention of threatened releases. While 
it is subject to criticism and may not function per¬ 
fectly, or even quite as intended, CERCLA remains an 
essential safeguard of environmental and public health 
in the USA. 


Introduction to CERCLA 

Sustainability may be defined as the capacity to endure. 
The achievement of sustainability suggests a relationship 
to time, in which the past is never so bad as to doom the 
present or erase the future. CERCLA is unique among the 
central pieces of environmental legislation in the USA in 
that it is largely oriented to the past, and ensuring that the 
legacy of environmental destruction that accompanied 
the growth of industrialism in the USA does not continue 
to jeopardize the future of the nation, its people, or the 
environment. As a practical measure, CERCLA casts envi¬ 
ronmental disasters and hazardous releases as problems 
that can be solved. These problems, however, are often of 
a scope and gravity that defy inexpensive or easily realized 
solutions. This is why CERCLA operates not only through 
the use of a public “Superfund” but also through partner¬ 
ships, sometimes coerced, with other parties, who are 
held to be responsible for hazardous releases, even those 
that occurred many years ago. 

Early in 2010, approximately 1,280 sites were listed 
on the National Priority List, either in the midst of 
a CERCLA remediation or awaiting one. An additional 
340 sites had been delisted in the history of CERCLA 
since 1980, with more than 60 sites under consideration 
to be added to the list [2]. However, it was only 
a small group of well-publicized releases of hazardous 
substances that originally inspired CERCLA. 

CERLCA was enacted by Congress in response to 
environmental and public health disasters, including, 
inter alia: (a) the environmental contamination at a site 
in Kentucky that came to be known as “The Valley of 
the Drums”; a site of dioxin contamination in Times 
Beach, Missouri, that eventually caused the abandon¬ 
ment of the town; and, most notably, (b) the Love 
Canal contamination near Niagara Falls, New York. 

The Love Canal incident is one of the more famous 
environmental disasters to occur in the USA. In the 
summer of 1977, 200 families were evacuated from the 
area of Love Canal when waste chemicals began to seep 
into the basements of homes in the community. In 
1980, the US government filed suit, under the Resource 
Conservation and Recovery Act and the Safe Drinking 
Water Act, against the company that had dumped tens 
of thousands of tons of hazardous chemicals at the Love 
Canal site and then abandoned it, years before. 
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CERCLA was designed to address cases such as Love 
Canal, in which the past intervenes into the present, 
clouding the future of an area and its people. As noted 
above, CERCLA is, in large measure, retroactive in 
nature, allowing the government to go back in time to 
find responsible parties for current environmental 
contaminations. In 1980, during the closing days of 
the 96th Congress, the original CERCLA legislation 
passed. It was signed by President Jimmy Carter as 
one of his last official acts in office. 

This entry will review CERCLA, including its initial 
provisions and later revisions. It will focus on issues 
that include liability under CERCLA, defenses against 
liability, site selection including technical review of 
sites, and the CERCLA Priority List of Hazardous 
Materials under ATSDR. 

Understanding CERCLA 

Original CERCLA and Basic Elements 

The original CERCLA legislation included four basic 
elements: 

1. A system for gathering and analyzing information 
concerning potential or actual releases of hazardous 
materials, pollutants, or contaminants 

2. Lederal authority to respond to control and clean 
up releases of hazardous materials, pollutants, or 
contaminants 

3. The Hazardous Substance Response Trust Lund, or 
Superfund, to underwrite response and cleanup 
activities 

4. A strict liability scheme for those judged to be 
responsible for releases of hazardous substances 

In the matter of notification of a release, CERCLA 
functions in concert with other regulations, including 
the Emergency Planning and Community Right to 
Know Act, in requiring any person in charge of a facility 
(as uniquely defined under CERCLA as noted above) 
who has knowledge of a release of a hazardous sub¬ 
stance to notify the National Response Center, a federal 
clearinghouse for hazard and public and environmen¬ 
tal health information, as well as other appropriate 
state, federal, or local agencies. 

CERCLA permits the EPA to request from any 
individual or party information deemed relevant to 


a release of hazardous materials. As with many federal 
authorities under CERCLA, this power is broadly 
construed. As an example, the EPA can obtain financial 
information to discover a party’s ability to pay for the 
costs of a cleanup, as well as information that more 
directly relates to the handling of hazardous materials 
or the cause or scope of the release [3]. 

The second basic authority under CERCLA is the 
authority to clean up a site at which a release has 
occurred or is occurring. CERCLA authorizes two dif¬ 
ferent types of response actions to releases or threat¬ 
ened releases, removal actions, and remedial actions: 

• Removal actions are short-term response actions 
for cases in which releases or threatened releases 
require prompt response. These actions may be 
emergency actions, time-critical actions, or non¬ 
time-critical actions. Removal responses are gener¬ 
ally used to address localized risks; for instance, the 
removal of drums containing hazardous materials 
that have been found on a site may be necessary to 
reduce or eliminate a contamination and mitigate 
risks posed to public health. Removal may also 
prevent contamination from compromising other 
elements of the environment; for example, topsoil 
removal may prevent further contamination of the 
water supply. 

• Remedial actions are generally longer-term actions, 
designed to permanently reduce the risks associated 
with a release or threatened release. While these 
actions may have some urgency, they may constitute 
a long-range project rather than a time-intensive 
emergency response. For instance, a remedial action 
may involve taking actions to contain a hazardous 
substance or neutralize a toxin. Only sites listed on 
the National Priority List (NPL) are eligible for 
remedial action under CERCLA. 

On some occasions, the type of response selected 
under CERCLA is subject to controversy: As one exam¬ 
ple, see the case of the Agriculture Street Landfill in 
New Orleans as discussed below, where a partial 
removal action may not have been sufficient to reme¬ 
diate a complex site. 

CERCLA also created the Superfund, and is some¬ 
times known as “The Superfund Act.” This fund sup¬ 
ports response, removal, and remediation activities. 
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However, it is not the only source of funds for these 
activities. Despite its nickname, the Superfund lacks 
sufficient funds to clean up all the sites on the National 
Priority List, or even those that are being cleaned up at 
any particular time. Instead, the preferred federal strat¬ 
egy is to conserve Superfund resources and compel or 
persuade other parties to clean up a site themselves. 
These additional parties, known as “Potentially Respon¬ 
sible Parties” or PRPs, are held responsible under the 
CERCLA liability scheme. These parties may also sue or 
reach agreements with other PRPs concerning which 
parties are responsible for the cleanup and in what 
proportion. See section “Liability Scheme” below. 

SARA Revisions of CERCLA 

In 1986, CERCLA was revised by the passage of the 
Superfund Amendments and Reauthorization Act, or 
SARA [4]. SARA added standards for an acceptable 
cleanup, provisions concerning the legal settlement 
process, and rights for state and citizens including 
some of those listed under Citizen Roles above. 

Before SARA, CERCLA did not include any partic¬ 
ular standards regarding cleanup operations. SARA 
provides guidelines on matters such as the type of 
cleanup action, compliance with the National Contin¬ 
gency Plan, and cost-effectiveness. SARA also provides 
criteria for assessing proposed remedies. These are 
discussed in section “NCP, NPL, and Site Procedures” 
below. 

SARA also engages the issue of how to promote 
voluntary settlements EPA claims against PRPs. SARA 
provides substantial guidance on issues such as contri¬ 
butions among PRPs, how to address shares of 
unknown PRPs, and what to do about PRPs who are 
responsible only for very small quantities of hazardous 
material. SARA also added the “Innocent Landowner” 
defense for potential PRPs, which is discussed under 
Liability Scheme below. 

CERCLA was also revised by the Small Business 
Liability Relief and Brownfields Revitalization Act of 
2002 [5]. Some of the changes made by this act are 
noted throughout the entry below. 

Citizen Roles 

SARA also established a series of important roles for 
citizens to play in legal actions under CERCLA. First, 


citizens or citizens groups affected by a Superfund site 
may apply for a Technical Assistance Grant. These 
grants are used by citizens to hire their own indepen¬ 
dent technical consultants to advise them about health 
and environmental issues and monitor the work of the 
EPA and others regarding the site. 

Citizens may sue any party, except the EPA, that is 
alleged not to be in compliance with any order, stan¬ 
dard, condition, or requirement that is in force under 
CERCLA. The EPA may choose to intervene in the suit 
and prosecute the alleged offender. In such a case, the 
citizen suit is barred. 

Citizens may also petition for a preliminary assess¬ 
ment at a site if that site is not scheduled for response 
actions. This method maybe useful in making a certain 
site a higher priority for a remediation [3]. Another 
central right reserved for citizens is the right to com¬ 
ment. For instance, citizens can comment on any set¬ 
tlement agreements or consent decrees prior to judicial 
approval. 

In addition, citizens have the right to know about 
the chemical and substance hazards they may face. 
Specifically, this requirement includes emergency noti¬ 
fication of chemical releases; notification of chemical 
use, storage, and production activities in a community; 
emission reporting requirements; state and local emer¬ 
gency planning, and the development of emergency 
response plans. 

NCP, NPL, and Site Procedures 

As mentioned in the Glossary above, CERCLA substan¬ 
tially revised the National Oil and Hazardous Sub¬ 
stances Pollution Contingency Plan (NCP). The NCP 
provides guidelines and procedures governing the 
response to releases or threatened releases of hazard¬ 
ous substances, pollutants, or contaminants. Under 
CERCLA, NCP criteria for the remediation of a 
hazardous release include a preference for permanent 
solutions, as well as cost-effectiveness and a practical 
ability to implement the solution. 

Under CERCLA, the NCP was also revised to 
include a National Priority List (NPL), which serves 
as an information and management tool to help the 
EPA prioritize sites for cleanup. Inclusion of a site on 
the NPL does not in itself assign liability or require 
potentially liable parties to initiate action to clean up 
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the site. The NPL serves primarily informational pur¬ 
poses, identifying for the states and the public those 
sites, facilities, or releases that appear to warrant reme¬ 
dial actions. 

Upon notification of a potentially hazardous 
release, or when a potentially hazardous site or release 
is otherwise discovered, the site is listed in the Com¬ 
prehensive Environmental Response and Liability 
Information System (CERCLIS) for subsequent evalu¬ 
ation. EPA then assembles information on the site and 
conducts a Preliminary Assessment/Site Inspection 
(PA/SI) to determine the scope of any contamination 
or potential contamination. This process involves 
a review of records regarding the site and any hazard¬ 
ous substances associated with it, interviews of poten¬ 
tially knowledgeable parties, visual inspections by 
qualified personnel, and limited field sampling. The 
PA recommends further investigation, if warranted. 

If the site is thought to present a threat to public or 
environmental health or safety, additional investiga¬ 
tions are performed concerning the hazardous mate¬ 
rials at the site, their potential for contacting humans, 
and possible migration pathways for spreading 
contamination. Information gathered in the investiga¬ 
tive review process is then subject to a hazard analysis, 
using criteria that include the toxicity of hazardous 
substances, the location of possible receptors, and 
threats to the human food chain or the watershed 
or atmosphere. This analysis culminates in the devel¬ 
opment of a score on the Hazard Ranking System 
(HRS). A HRS score above 28.5 results in placement 
of the site on the NPL. 

Sites that score at or above the threshold are then 
subject to a Remedial Investigation (RI) and Feasibility 
Study (FS). The RI includes an extensive sampling 
program and risk assessment in order to define the 
extent and degree of the contamination and the asso¬ 
ciated risks. It is designed to be a thorough analysis but 
can take years to complete. Typically, reports are com¬ 
piled that describe the site’s geology and hydrogeology, 
the sources of contamination, the type and degree of 
mobility of the contaminants, and all aspects of the 
threat to the public health or the environment. Any 
part of these data is potentially useful in developing and 
studying possible methods of remediation. 

The next step is the FS, which is used to develop 
and evaluate various remediation alternatives. The FS 


details several remedial methods and analyzes each in 
light of the data from the RI. The work is often broken 
down into phased components, called Operable Units, 
to separate the remediation process, and often the 
physical or geographic area of the contamination, 
into more manageable units [6]. 

The preferred alternative identified by the FS is 
presented in a proposed plan for public review and 
comment. If the EPA stays with the proposed solution 
after this comment period, the selected alternative is 
approved in a Record of Decision (ROD). The site then 
enters into a Remedial Design phase and then the 
Remedial Action phase. Many sites require Long- 
Term Monitoring and 5-year reviews once the Reme¬ 
dial Action has been completed. If a site is not consid¬ 
ered sufficiently hazardous for inclusion on the NPL, 
the EPA may still order a removal action to address an 
imminent threat. 

EPA’s nationwide strategic plan for addressing 
hazardous sites is found in another document, the 
Superfund Comprehensive Accomplishment Plan 
(SCAP). This document provides a list of those activ¬ 
ities that are expected to occur at each site during 
a fiscal quarter. Information often appears in the 
SCAP before it is announced to the public [6]. 

Remediation Priorities and ARARs 

As the EPA works toward a feasible solution for a site, 
various factors may become important. One is the level 
or degree of cleanup necessary. The central issue here is 
often what level of risk or remaining contamination is 
acceptable. As the EPA generally works with PRPs with 
a financial stake in the cleanup process, levels of reme¬ 
diation often become controversial. See the section on 
the “Agriculture Street Landfill site” for an example of 
a case where no real agreement was reached between 
affected citizens, the EPA and the PRPs, resulting in 
a lengthy court battle. 

Section 121 of CERCLA is especially significant 
with respect to decisions about methods of remedia¬ 
tion as it sets forth requirements for cleanup standards 
[7]. Remedial actions under CERCLA are consistent 
with the precepts of the NCP when possible. This 
section states that actions should be cost-effective. 
The guidelines also state a preference for permanent 
solutions that reduce the volume, toxicity, or mobility 
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of the contamination. Less permanent solutions, such 
as the construction of barriers, are not favored. 

For hazardous materials left on site, remedies 
under CERCLA are required to achieve all Applicable 
or Relevant and Appropriate Recommendations 
(ARARs). The level or degree of the hazard must 
remain less than any standard, requirement, criterion, 
or limit under any federal environmental law or any 
state or facility siting law that is more stringent than the 
federal guidelines. This means that the site must 
achieve the cleanup level (or safety level) set by the 
most stringent laws that are applicable. The NCP 
includes criteria for determining whether a law or stan¬ 
dard applies. Laws or standards may not apply, for 
instance, if they target a certain medium, such as 
water, and the hazardous material at the site is found 
in a different medium, such as soil. Other differences 
include (a) the activities regulated by the regulation 
versus those undertaken at the site and (b) the type of 
facility considered in the regulation versus that at 
which the proposed remedial action is to take place. 

Once determined to apply, an ARAR generally must 
be met unless it is waived. CERCLA does provide a few 
circumstances in which a remedy may be selected that 
does not meet an ARAR, for instance, if the remedy is 
only part of a larger solution that will meet the ARAR 
or if meeting the ARAR would lead to greater risks to 
human health or the environment than would alterna¬ 
tive actions [6]. 

ATSDR and the CERCLA List 

Another important aspect of site selection and devel¬ 
opment of remedial responses is the CERCLA Priority 
List of Hazardous Substances maintained by the 
Agency for Toxic Substances (ATSDR). This list 
established, in order of priority, the substances most 
commonly found at facilities on the NPL and which are 
determined to pose the most significant potential 
threat to human health due to their known or 
suspected toxicity and potential for human exposure. 
The list is revised and published every 2 years, with 
a yearly review and revision to reflect additional infor¬ 
mation on hazardous substances. 

This priority list is not a list of “most toxic” sub¬ 
stances, but rather a prioritization of substances based 
on their frequency, toxicity, and potential for human 


exposure at NPL sites. This priority list and the order of 
the materials on it are based on an algorithm that relies 
on three components: frequency of occurrence at NPL 
sites, toxicity, and potential for human exposure to 
the substances found at NPL sites. Some substances 
included on the list are low in toxicity but often appear 
at NPL sites. The items high on the list are subjects of 
toxicological profiles prepared by ATSDR [8]. 

Liability Scheme 

The liability scheme in force under CERCLA is novel 
and contradicts even some conventional canons of 
common law. For instance, a party cannot avoid liabil¬ 
ity under CERCLA simply by entering into a contract 
with another party, such as the new owner of a site, that 
transfers liability or indemnity. CERCLA supersedes 
such agreements. 

CERCLA delineates four classes of “Potentially 
Responsible Parties,” or PRPs, any of which may be 
liable for the cleanup of a Superfund site. These four 
classes are: 

• The current owners or operators of the facility 

• Owner or operators of the facility at the time of 
disposal of any hazardous substance 

• Any person who arranges for the disposal, treat¬ 
ment, or transport of a hazardous substance at the 
facility 

• Any person who transported a hazardous sub¬ 
stance, pollutant, or contaminant to a facility, or 
accepted the material for transport, if that person 
selected the facility 

Within these categories, the liability is broad. As an 
example, both the current owner and the owner at the 
time of contamination may be held liable. Does this 
mean that an owner who knows nothing about the 
dumping of materials years before on a property 
the new owner purchased just last year may be held 
liable for the cleanup of the material? In some cases, the 
answer is “yes,” although such an owner is not entirely 
without defenses. Owners have a better opportunity 
to avoid liability if the release did not occur during 
the time they owned the property, and if they 
performed due diligence in purchasing the property, 
handled any hazardous materials discovered with care, 
and cooperated fully with EPA inquiries. See section 
“Defenses” below. 
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The operator is generally defined as the party who 
controls the operations at a site or facility. Parties that 
hold a security stake in a facility or site but are not 
technically its owners may be considered operators if 
they make any control decisions about a business or 
property. 

Transportation companies that move materials 
from place to place may or may not be held liable 
under CERCLA. Courts have ruled that transporters 
who select the site where the materials are brought 
(generally the Superfund site) are liable, while those 
simply moving materials to a site picked by another 
party, such as the generator, are not liable. However, 
in actual practice, generators have often hired trans¬ 
porters simply to remove the materials from the site, 
without knowing where the materials would go. In 
these cases, the transporter is liable for selecting the 
disposal site, and the generator is also liable under the 
category of “Arranger.” 

Arrangers, often referred to as “generators,” are the 
largest category of parties held liable under CERCLA. 
Arrangers contract for the removal, disposal, or treat¬ 
ment of hazardous materials, which may or may not be 
owned or produced by the arranger. This category has 
somewhat fuzzy boundaries - for instance, what if the 
arranger sells the material to another party for 
a legitimate business reason, such as for use in an 
industrial process? Is this “arranger” liable when the 
material later is dumped at an unmarked site? 
However, what if this “sale” was simply a ruse to 
cover for an illegal dumping operation? Such questions 
are subject to careful investigation and often litigation. 

Defenses 

Defenses against liability under CERCLA are limited. 
CERCLA is considered a “strict liability” law, meaning 
that mitigating factors such as negligence or an absence 
of negligence are not always considered in reaching 
determinations of liability. Even the defenses provided 
for potential PRPs under the law may not be viable in 
actuality. As one example, the law makes a provision 
for an Act of God defense; in actual practice, however, 
this remedy has not been upheld. Courts have ruled 
that all the usual “acts of God,” hurricanes, tornadoes, 
earthquakes, etc., are foreseeable, and the resulting 
release of hazardous materials is avoidable with the 


proper precautions. Similarly, an act of war provides 
a defense only when the act is demonstrated to be the 
sole cause of the release [9]. 

A more viable defense that is frequently raised is the 
“Omission of a Third Party” defense whereby a defen¬ 
dant attempts to escape liability by claiming that the 
release was solely the act of a third party, for which the 
PRP is not responsible. However, for this defense to be 
applicable, the PRP may not have an articulated rela¬ 
tionship with the third party; for instance, the third 
party may not be a rogue employee of the PRP company, 
or a party working under a contract with the defendant. 

Another related defense is referred to as the “Held 
Only for a Security Interest” defense, or the “Secured 
Creditor Exception,” which may protect parties that 
hold a stake in a property only for a security interest, 
without participating in the management of the prop¬ 
erty. Lenders are among those who may be eligible for 
this defense. 

The revisions of CERCLA under SARA codified an 
emerging defense related to the Omission of a Third 
Party defense, the “Innocent Landowner” defense. To 
qualify for this defense in most cases, the landowner 
must establish a justifiable ignorance of the contami¬ 
nation. For instance, it is not enough simply to have 
purchased the property and then discovered only after¬ 
ward that the site was contaminated. Instead, owners 
are required to show that they did due diligence, 
including undertaking all appropriate inquiry into the 
conditions of the property before purchase. Land- 
owners may also rely on an “innocent landowner” 
defense in cases in which a property is acquired 
through inheritance or bequest. 

Under the Small Business Liability Relief and 
Brownfields Revitalization Act, another defense for 
another class of “innocent landowners” is designated. 
To qualify, a number of provisions have to be demon¬ 
strated, including: 

• All disposal or release of hazardous materials 
occurred before the landowner purchased the 
property 

• All appropriate inquires into the previous owner¬ 
ship and use of the facility were made 

• Appropriate care was taken regarding the hazardous 
materials, including appropriate action to stop the 
release 
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The landowner is also required to cooperate fully 
with the EPA’s investigation into the property, the 
release of hazardous materials, and any response 
actions. 

The Small Business Liability Relief and Brownfields 
Revitalization Act also provided another category of 
defense for PRPs, called the “De Micromis Party Excep¬ 
tion.” This defense is for parties who have generated, 
transported, or arranged for transport of only small 
amounts of hazardous material. The exemption applies 
when amounts are below 110 gal of liquid materials or 
200 lb of solid materials. However, the exemption only 
applies when these materials do not figure heavily in the 
cost of remediation or restoration of the site. This 
exemption also protects a “de micromis” party from 
other PRPs seeking contributions toward the expense of 
remedies: A PRP suing another potential PRP must 
show that the “de micromis” defense does not apply [3] . 

Enforcement Tools 

The EPA has several powerful enforcement tools to 
direct toward PRPs. Two of these, civil judicial injunc¬ 
tive actions and unilateral administrative orders, fall 
under the “imminent hazard” section of CERCLA, 
which authorizes the EPA to act if it determines 
that a release or threatened release poses an “imminent 
hazard” to public health [10]. An administrative order, 
in particular, can be compelling, as courts have held 
that PRPs have no right to a hearing or review before 
the order is issued. Willful violation of a unilateral 
EPA order makes the PRP subject to a fine of up to 
$25,000/day [3]. 

Litigation is another potential remedy. Under 
CERCLA, monies that can be recovered through litiga¬ 
tion include costs incurred by state or federal govern¬ 
ments, costs incurred by another party, and damages to 
natural resources including the expense of damage 
assessment. Under these broad categories, the EPA has 
recovered expenses for investigation, monitoring, and 
testing of a site; planning a response; staff costs; attor¬ 
ney fees; as well as expenses related more directly to 
remedial actions at the site [3]. 

The EPA prefers not to take remedial action on 
its own and then attempt to recoup the costs of its 
actions from PRPs. The preferred response is for 
PRPs themselves to conduct the remediation, under 


the monitoring and supervision of the EPA or relevant 
state agencies. Further, the EPA prefers that these 
actions be taken “voluntarily,” which generally means 
under threat of a potential administrative order or 
another action. The EPA has several advantages that 
make its consultations with PRPs likely to be convinc¬ 
ing. If the PRP complies with the EPA, it may escape the 
fines imposed under administrative orders, the dam¬ 
ages granted by courts, and the legal fees entailed in 
court battles. It may also reduce any negative publicity 
or quell the complaints of angry citizens. Rather than 
battle the EPA, a route that many PRPs find more 
fruitful is to engage in discussions with, or legal actions 
against, other PRPs involved with the same release or 
facility to attempt to spread fiscal responsibility for 
the cleanup. 

States and Local Governments Under CERCLA 

States are responsible for the future management of all 
removal and remedial actions, once those actions are 
judged to be complete. States must also provide at least 
one federally licensed hazardous waste disposal facility 
to receive wastes removed from the Superfund sites 
within the state. The facilities may also be privately 
owned or operated and contracted to the state. 

Under SARA, states were given additional roles in 
federal cleanup actions. States have the opportunity to 
review and comment on many federal actions involved 
in a remediation, including the Remedial Investigation 
and Feasibility Study, the proposed remedial action, the 
engineering design of the action, technical data and 
reports related to the remedy, and any decisions to 
waive standards. 

States and local governments may also be held liable 
under CERCLA if they have acted in the roles of the 
PRP categories. However, governmental agencies act¬ 
ing purely in a regulatory manner are generally exempt 
from liability. 

Case Study: Agriculture Street Landfill Project 

The Agriculture Street Landfill in the Ninth Ward of 
New Orleans was operated as a landfill for approxi¬ 
mately 50 years, ending in 1958. In 1965, it was briefly 
reopened to receive a massive amount of debris from 
Hurricane Betsy. The landfill was locally referred to as 
Dante’s Inferno because its contents often caught fire, 
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and the area was frequently smoldering. It was a toxic 
environment to be sure. 

In 1967, the city of New Orleans and the Housing 
Authority of New Orleans decided to develop the area 
as a community for low-income residents. Between 
1969 and 1971, the Press Park townhomes were devel¬ 
oped. Later, Gordon Plaza, a development of single¬ 
family homes was constructed. Several playgrounds 
were also built in the area. A tract of land was set 
aside for an elementary school, and, in 1986, Mouton 
Elementary School opened for its first group of stu¬ 
dents. Residents and employees working in the area 
often complained of unusual smells and illnesses. 

The record of environmental testing at the site has 
been contested. According to the area citizens’ group 
and the law firm that eventually filed a class action suit 
on behalf of area citizens, the results have often been 
kept from residents. The city of New Orleans tested the 
site in the late 1970s in anticipation of the development 
of Gordon Plaza. After the testing, the city required the 
builders to add topsoil to the building site, but the 
results of the tests themselves have never been made 
public. Before the construction of Mouton School, 
another round of testing revealed the presence of lead, 
mercury, and arsenic. The Orleans Parish School Board 
received a recommendation that topsoil be brought in 
and a clay barrier used to contain the contamination. 
However, no barrier was established when the school 
was built. Instead, during construction area residents 
were subjected to dusts that may have included hazard¬ 
ous materials [11]. 

In 1986, the EPA tested the area, but area residents 
said that they were never shown the results of the test. 
In 1993, the EPA tested again . This time, they reported 
to area residents that the former Agriculture Street 
Landfill was contaminated with approximately 150 dif¬ 
ferent hazardous substances, including almost 50 that 
were known to cause cancer. In 1994, the EPA declared 
the area a potential Superfund site and placed it on the 
National Priority List. 

In 1994, area residents formed Concerned Citizens 
of Agriculture Street Landfill. Utilizing an important 
provision of CERCLA, the group requested a technical 
assistant grant from the EPA, to fund an independent 
environmental consultant. The group lobbied for 
the closing of the Mouton school, and the school was 
soon closed. 


In the mid-1990s, the city government broke ranks 
with the federal authorities; led by Mayor Marc Morial, 
the city joined residents in arguing for federal support 
for relocating area residents. The EPA believed that the 
site could be remediated and from 1998 to 2001 took 
action, including replacing topsoil from open areas, but 
without excavating structures or streets. Residents were 
given certificates stating that their properties had been 
partially remediated, and were also given written restric¬ 
tions on what they could do with their properties [11]. 

A lawsuit filed on behalf of the residents was 
granted class action status in 1999. The case went to 
trail in 2004, with the residents prevailing. The landfill 
area was ruled to be unreasonably dangerous under 
state law. The residents were awarded damages and 
market value of their properties. Employees and stu¬ 
dents also received awards. The judge in the case used 
discretionary powers to increase the monies received by 
100%. Upon appeal, judgments for emotional distress 
were halved, but the rest of the decision stood. As of 
2010, residents were applying to receive their claims, 
some 40 years after the regrettable residential develop¬ 
ment of the former landfill began [11]. 

As for the landfill site, it is believed that Hurricane 
Katrina removed much of the new topsoil deposited in 
remedial actions and may have dispersed contaminants 
throughout the area. It may be that the EPA’s efforts in 
this area, which many believe were not of sufficient 
boldness to respond effectively to the severity of the 
contamination, have come largely to naught [12]. 

Criticisms of CERCLA 

CERCLA cuts a board swath through American society. 
It is an ambitious federal regulation that was designed 
to engage the enormous and complex problem of 
releases of hazardous materials. It also imposes new 
and in some ways novel legal requirements on those 
who may be deemed in any manner responsible for 
a release or threatened release, even in cases in which 
they would not be held responsible under conventional 
canons of common law. It is little wonder that CERLCA 
is controversial and comes under criticism from a range 
of viewpoints and for myriad reasons. 

Business owners, industry and corporate interests 
have complained that the strict liabilities imposed by 
CERLCA inhibit economic growth, not only by tying 
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up money in expensive remediations, but also by cre¬ 
ating a chilling effect that prevents the purchase for 
redevelopment of any sites that could prove to be 
contaminated. One of the main charges laid against 
CERCLA from this perspective is that it simply is not 
“fair” It is unfair, for instance, for owners or operators 
who exercised all due caution in handling a material 
to be held liable for a release that occurred due to 
the unsafe actions of another party. It is also unfair 
to expect today’s generation of business owners or 
shareholders to remediate a site that was contaminated 
years ago. 

A particular sticking point has been the extension of 
liability to parties that may have sent only a small 
quantity of materials, comprising a hazard that is lim¬ 
ited in scope and degree, to a disposal facility where 
a hazardous release eventually occurred. For instance, 
cases that involved pizzerias, donut shops, fitness facil¬ 
ities, and a local Elks Club have been publicized [13]. 
The revisions of defense provisions of CERCLA under 
the Small Business Liability Relief and Brownfields 
Revitalization Act were directed at ensuring that small 
business and other parties who are not large owners, 
operators, generators, handlers, or transporters of haz¬ 
ardous materials may escape liability. 

Simultaneously, environmentalists and citizens’ 
groups have criticized CERCLA on other grounds. 
These parties have argued that CERCLA remains tied 
to corporate interests, that the EPA is hesitant or unwill¬ 
ing to take on controversial projects, and that its pre¬ 
ferred remedial actions often do not go far enough to 
solve the problem. The slow progress of cleanup projects 
is another common complaint [6]. 

As a large public project that uses taxpayer money, 
CERCLA has also drawn criticism from taxpayers and 
watchdog groups that have raised objections to what is 
perceived as inefficient use of public funds. However, 
other critics and public officials argue that the real 
problem is that there is simply not enough money in 
the system to perform the ambitious work CERCLA 
calls for, especially since the industry tax that originally 
funded the Superfund expired in 1995 and has not been 
renewed. 

In the absence of public funding, private funding 
for remedies under CERCLA is all the more essential. 
However, businesses have also found a way to circum¬ 
vent the law: declaring bankruptcy to avoid legal or 


financial responsibility [ 14] . The use of the legal system 
to settle questions of CERCLA liability, whether 
between PRPs and the EPA or among different PRPs, 
has also been criticized as a drain on resources [15]. 

Critics from a variety of perspectives have also argued 
over issues involving the intersection of science and 
politics, including methods for assessing and rating 
sites and placing certain projects of the National Priority 
List. Some have claimed that the risks of some sites are 
greatly exaggerated by the EPA, while the risks posed by 
other sites maybe overlooked. Questioning the objective, 
scientific basis for placement on the NPL, some critics 
have said that there is no evidence that those projects 
included on the NPL are more hazardous than those that 
are not. Similarly, some critics have claimed that due to 
improper risk assessments, the EPA has performed 
unnecessary actions on sites, beyond those needed to 
control the hazards [15]. In the matter of the Agricul¬ 
ture Street Landfill, however, the opposite claim borne 
out by the courts is seen: that the EPA’s actions did not 
go nearly far enough to ensure public health. 

Perhaps one of the most resonant criticisms of 
CERCLA, offered over the years by people from differ¬ 
ent political persuasions and economic or social inter¬ 
ests, is that its provisions make the revitalization or 
rehabilitation of older, often abandoned, industrial 
sites much less attractive. Not only have potential 
developers or owners been reluctant to take control of 
such properties for fear of liability, lenders are also 
reluctant to make loans for the purchase or redevelop¬ 
ment of the property. Provisions of the Small Business 
Liability Relief and Brownfields Revitalization Act 
address this issue. 

Future Directions 

While criticisms of CERCLA are many, they have not 
yet reached a pitch that would put the future of 
CERCLA into radical question. Americans are too 
concerned with issues of sustainability, public health, 
and the condition of the environment to allow 
hazardous sites to go without remediation. Alternative 
schemes have been proposed - for instance, market- 
based systems that would allow for the remediation and 
redevelopment of sites by private parties under market 
conditions [ 15] . However, such alternatives have not so 
far been seriously engaged in the political system, 
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suggesting that at this point, Americans want govern¬ 
mental agencies to have a strong hand in remediation 
efforts, as well as the prevention of threatened hazard¬ 
ous releases. 

The most pressing issues as CERCLA heads into the 
future appear to be funding cleanups and maximizing 
the effectiveness of funds spent. Historically, approxi¬ 
mately 70% of cleanup activities under CERCLA have 
been paid for by PRPs. However, as parties find avoid¬ 
ance methods, whether through legal exceptions or by 
declaring bankruptcy, the federal share may increase. 
Originally, federal monies were provided through 
taxes on industry, but these have since expired. The 
Superfund is now increasingly dependent on general 
fund appropriations. The Superfund did receive a 
boost in funding under the American Recovery and 
Reinvestment Act of 2009, the federal stimulus meant 
to alleviate a recession. This act provided $600 million 
that was directed toward 50 Superfund sites, approxi¬ 
mately doubling the federal investment in these cleanup 
operations. However, it is not clear how or if this level of 
funding can be sustained when the Recovery Act expires 
[16]. One hopeful sign, however, may be a decrease 
in litigation and resultant expenses [17]. President 
Obama has also proposed reinstating the corporate 
taxes that originally funded the Superfund [18]. 

Even as commitment to CERCLA wavers, at least 
some degree in the USA, the act is seen worldwide, in 
some instances, as an intriguing method for funding 
other environmental and sustainability projects. As one 
example, the proposed International Climate Change 
Fund, an alternative to the climate agreement set by the 
Kyoto Protocol, would use a trust fund similar to the 
Superfund to pay for the development and purchase of 
“clean” or low-carbon technologies for developing 
countries. All countries would be required to pay 
into the fund, with the payment based on their own 
carbon emissions, including greenhouse gases and 
“flow” emissions [19]. 

CERCLA is still a relatively young law, especially 
given the scale of the work it engages in the world. It is 
also novel, in purpose, orientation, and form. Even 
after 30 years, one may still be in the mode of adjusting 
to it and striving to understand if it is the right means 
for the job it set out to do. Thus, revisions of CERCLA 
to date, such as SARA and the Small Business Lia¬ 
bility Relief and Brownsfield Revitalization Act, have 


performed formative functions, reshaping the law to be 
more effective and fair, rather than supplanting 
CERCLA altogether. CERCLA was boldly constructed 
for an enormous undertaking. Whether it can complete 
this work is unknown and under dispute. However, it 
seems fair to say that lesser or more conventional 
legislation would be even less likely to succeed. 
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Glossary 

Convection Mass transfer effected by flow due to 
applied forces like pressure (forced convection) or 
to density differences (free convection). 

Diffusion Mixing caused by molecular motion. 
Diffusion coefficient The negative of the flux per con¬ 
centration gradient. 

Diffusivity Another name for the diffusion 
coefficient. 

Dispersion Mixing caused by diffusion and simulta¬ 
neous flow. 

Flux Mass or moles transferred per area per time. 
Mass transfer Diffusion and dispersion, especially 
across interfaces. 

Mass transfer coefficient The flux per concentration 
difference, especially near an interface. 

Overall mass transfer coefficient The flux per virtual 
concentration difference from one phase across an 
interface into a second phase. 
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Diffusion is mixing without stirring. It is mixing caused 
by Brownian motion, that is, by thermally induced 
random motion of molecules or small particles. 
Because diffusion is often slow, it frequently limits the 
overall rate of the process. Diffusion has the reputation 
of being a difficult subject, which it can be; however, the 
difficulty most often comes from complicated units, 
from interfaces, or from the combination of diffusion 
and convection. By itself, diffusion is not hard. It is 
easier than viscous flow and much easier than ideas like 
entropy or chemical potential. 

Two other phenomena, closely related to diffusion, 
are also reviewed in this entry. Dispersion is mixing 
caused by the interaction of flow and diffusion. Often, 
it is described using mathematics similar to those 
which describe diffusion. In environmental problems, 
these two phenomena are sometimes treated without 
distinction and without penalty. Mass transfer, an alter¬ 
native description of diffusion, assumes that all con¬ 
centration changes occur near interfaces. While it is 
used largely to describe chemical processing, it has 
considerable value in environmental problems. 

Introduction 

This entry is organized as four sections. The first section 
gives the mathematical description of diffusion itself. 
The second reviews dispersion, a different phenomenon 
that is mathematically similar to diffusion but which is 
caused by different physical effects. The third section, 
“Diffusion Coefficients,” reviews values of the diffusion 
coefficients themselves; and the fourth section “Diffu¬ 
sion Across Interfaces,” explains mass transfer across 
interfaces, especially air-water interfaces. Finally, 
the section “Important Special Cases” reports some 
common situations which have important features. 

Basic Diffusion 

The basic mathematical description of diffusion is 
Fick’s Law, suggested by Adolph Fick (1829-1910) 
when he was just 26 years old. For a dilute solution, 
Fick’s Law is: 


where j { is the flux of a solute “1,” and dci/dz is the 
concentration gradient, that is, the change of the solute 
concentration with position. The diffusion coefficient 
D is a proportionality constant which is nearly constant 
in almost all situations. The flux is the amount of solute 
moving per cross-sectional area per time and so has the 
dimensions of mass (M) per area (length squared L 2 ) 
per time (t). The concentration gradient has dimen¬ 
sions of concentration {MIL 3 ) per distance (L). Thus 
the diffusion coefficient has dimensions of (if It ). 

The form of Fick’s Law in Eq. 1 has some hidden 
implications. First, it is a one-dimensional equation of 
what is actually a more general vector relation. Because at 
least four out of five diffusion problems are one¬ 
dimensional, this is often not a major issue. Second, the 
concentration can be expressed in different units. If it 
were expressed in moles per volume, then the flux 
would be in moles per area per time. If it were expressed 
as a mole fraction or a mass fraction or a partial pressure, 
then unit conversions would be necessary and annoying, 
but not difficult. Third, the minus sign in Eq. 1 is arbi¬ 
trary, stuck in to make the diffusion coefficient positive. 
The only difficult implication of Eq. 1 is the restriction 
to dilute solutions, explored in more detail at the end of 
this section. The restriction is rarely important because 
solutions in the environment are so often dilute. For 
example, liquid water contains 55 mol/1, so almost 
every aqueous solution is dilute. 

The most important case of Fick’s law is diffusion 
across a thin film, described next. Other important 
cases and concentrated solutions are reviewed later. 

Diffusion Across a Thin Film 

The simplest case, steady diffusion across a film, is also 
the most important. Imagine a thin film separating two 
well-stirred solutions, as shown in Fig. 1. On the left, 
the solution has a concentration c 10 ; on the right, the 
concentration is c u . The key parts of this case are the 
variation of concentration across the film q(z) and 
the flux ji. Finding these requires a mass balance on 
a differential volume Az thick and located at an arbi¬ 
trary position z within the film: 

{mass accumulation) = {mass diffusing in — out) 


c 


h = 


—D — 
dz 


(1) f {ciAAz) = 0'i A) z - (jiA) z+Az 


( 2 ) 
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Figure 1 

Concentration profile across a thin film 


where A is the constant cross-sectional area of the film. 
Because diffusion is steady, the concentration does not 
change with time, the left-hand side of Eq. 2 is zero, and 


C\ — Cio _ 2 
Cll — ClO £ 


(9) 


The flux can now be found by combining this result 
with Fick’s Law: 


dci 

]i = 

D dz ( 10 ) 

= C1 *) 

If the concentration difference across the film is dou¬ 
bled, the flux doubles. If the diffusion coefficient is 
twice as big, the flux will be twice as big, too. If the 
film thickness increases two times, the flux will be cut 
in half. 

This important example is so simple mathemati¬ 
cally that many novices tend to skip over it. This is 
a mistake. Its nuances are explored by the following 
questions: 

1. How does the flux change if the film is chemically 
different than the adjacent solutions? 


•h \z illz+Az 

(z + Az) — z 


0 dz 

(3) 

This restates the assumption of steady-state diffusion, 
independent of time. Combining this relation with 
Fick’s Law (Eq. 1) yields: 

d 2 C\ 

° = D lz 2 

(4) 

This is subject to two boundary conditions: 


N 

II 

O 

II 

o 

(5) 

II 

II 

n 

(6) 


This is enough to solve this important problem. 
Equation 4 may be integrated once to find: 


where A is an integration constant. Integrating a second 
time gives: 

C\ — Az T B (8) 


where B is a second integration constant. Evaluating 
A and B from Eqs. 5 and 6 gives: 


In this case, the boundary conditions in Eqs. 5 and 6 
change to: 

z = 0 C\ — Hcio (11) 

z = £ ci = Hcu (12) 

where H is a partition coefficient, the ratio at equilib¬ 
rium of the concentration inside the film to that in the 
adjacent solution. Paralleling the arguments above, 

h = ^ ^ ^ ( c io - c u ) (13) 

The diffusion coefficient D in Eq. 10 is replaced with 
the product ( DH ), which is called the permeability. 
(The term ( DH/l ) is called the permeance.) As will be 
shown later, diffusion coefficients in gases and liquids 
do not vary much, but partition coefficients vary a lot. 
Thus partition is often the key to permeability. 

2. How is the flux changed by a fast reversible reaction 
giving an immobile product? 

This case occurs surprisingly frequently in, for 
example, adsorption in soil or dyeing of wool. The 
answer is that at steady state, the flux does not change. 
Every point crossing the film has a different concentra¬ 
tion which is in equilibrium with a different absorbed 
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amount. Still, at steady state, the reaction is at local 
equilibrium and does not affect the flux. This is not the 
case for unsteady state, for irreversible reactions, or for 
mobile reaction products. 


Other Important Cases 


Many diffusion problems are not thin films. Surpris¬ 
ingly, many do behave as if they were thin films. For 
example, for a sphere of radius R slowly dissolving in 
a stagnant fluid with a concentration of c 1oo , the flux is: 


h 


D 

R 


(Clo Cloq) 


(14) 


where c 10 is the concentration in solution at the surface of 
the sphere. For diffusion from a solution of c 10 through 
a very thin, impermeable film with a cylindrical orifice 
of radius R , and into a solution at c loo , the flux is: 



(C\0 ~ Cioo) 


(15) 
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Concentration profile into a semi-infinite slab 
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The fluxes in these cases are strong mathematical 
parallels to that in Eq. 10. The mathematics in these 
cases is different, but the final result is remarkably 
similar. The thin film limit is a good guide about 80% 
of the time. 

For unsteady state diffusion, this is not true. Fluxes 
and concentration profiles for a wide variety of unsteady 
cases have been calculated and are tabulated in a few 
clear texts. One of these cases, useful in perhaps 10% 
of all cases, is unsteady diffusion into a semi-infinite 
slab, shown in Fig. 2. In this case, a mass balance gives: 


dfi _ <9 2 <4 

dt dz 2 


(16) 


where t is the time. For this semi-infinite slab, the initial 
and boundary conditions are: 


t = 0 

allz 

C\ 

— Cioo 

(17) 

t = 0 

z = 0 

C\ 

= ClO 

(18) 

t = 0 

Z = (X) 

Cl 

= Cioo 

(19) 


The concentration profile in this case is: 


Cl Cioo 

ClO — Cioo 


1 — erf 


z 

\f\Dt 


( 20 ) 


where erf is the error function. The flux at the edge of 
the slab, that is, at z = 0, is: 


h = 



(Cl Cioo) 


( 21 ) 


This is the key result for this case. 

The cases of a thin film and a semi-infinite slab are 
especially important because they bracket observed 
behavior. More specifically, the result in Eq. 21 for 
a semi-infinite slab is compared with the flux across 
a thin film in Table 1 . In both cases, the flux will double 
if the concentration difference doubles. If the diffusion 
coefficient doubles, the flux across a thin film doubles, 
but the flux into the semi-infinite slab increases \fl 
times. If the film’s thickness doubles, the flux drops two 
times for the film, but is unchanged for the slab. If 
diffusion occurs for twice as long, the flux for the thin 
film keeps its steady value, but that for the slab drops by 
a factor of 1 /a/ 2. These limits will usually bracket all 
diffusion behavior because all shapes will be between 
the film and the slab. These two cases are key to under¬ 
standing the mathematics of diffusion. 

A third special case is especially important for envi¬ 
ronmental engineering. This is the decay of a pulse. In 
this case, a large amount of solute is released at 
a particular plane at z = 0. Solute diffuses away from 
this position in only one dimension. The solute 
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Table 1 Flux across a film or into a slab. These two cases 
are important because they bracket almost all diffusion 
problems. In this table, K is the equilibrium constant of the 
rapid chemical reaction 



Thin film 

Semi-infinite 

slab 

Concentration difference 

Act 

Act 

Diffusion coefficient 

D 

Vd 

Thickness 

£ _1 

~ 

Time 

- 

r 1 

Flux without reaction 

ii = j Aci 

II 

> 

n 

Flux with fast reversible 
reaction 

h = ?Ac, 

h - Ac 


concentration as a function of position z and time t 
gives the details of any environmental impact. 

The mathematics follows the same route as the slab: 
a mass balance is subject to initial and boundary con¬ 
ditions, which are combined with Fields Law and solved 
to give the concentration profile. The mass balance is: 


dci _ <9 2 ci 

dt dz 2 


( 22 ) 


This mass balance, identical with the mass balance for 
the slab (Eq. 16), occurs so frequently that some 
just call it “the diffusion equation.” The initial and 
boundary conditions are different from Eqs. 17-19 
for the slab: 


t = 0 all z 

Cl 

ii 

(23) 

*■+ 

II 

o 

N 

II 

O 

dc\ 

dz 

= 0 

(24) 

Z = 00 

Cl 

= 0 

(25) 


In these conditions, M is the total solute injected, A is 
the cross-sectional area, and <5(z) is the Dirac function, 
equal to zero everywhere except at z = 0, where it is 
infinity. While the mathematical solution of Eqs. 22-25 
is tricky, the answer is simple: 


' M/A ' 
yj AnDt_ 


e 4Df 


In this Gaussian concentration profile, the quantity in 
square brackets is the maximum concentration, which 
drops as time grows. The mathematical form of this 
important result is also observed for other environ¬ 
mental problems which do not depend only on diffu¬ 
sion, as described in the section “Dispersion.” 


Concentrated Diffusion 


One reason that diffusion has the reputation of being 
difficult comes from the major complexities of concen¬ 
trated solutions. These complexities are rarely impor¬ 
tant in environmental engineering and so should be 
ignored unless there are good experimental reasons not 
to do so. These complexities are mentioned here only 
to illustrate when they are important. 

To explore this, imagine putting a pot of room 
temperature water of 25° C on a stove that is turned 
off. The flux of any evaporating water will be given by 
Eq. 10. If the air above the stove were dry, c u would be 
zero. Because the water at 25°C has a vapor pressure of 
about 25 mmHg, c 10 is about: 


Go 


25 mm Hg ] 1 mol 

750 mm HgJ 22.4 x 10 -3 m 3 

mol 
1.5—— 


(27) 


If the liquid water in the pot is 0.1 m below the rim 
and the diffusion coefficient of water vapor in air is 
about 2.8 x 10 -5 m 2 /s, the flux is: 


h — ~J ( c 10 - c ll) 

2.8 x 10 -5 m 2 /s 


4 x 10 


0.1m 
mol 

m 2 s 


mol 
1.5—-0 


(28) 


-4 J 


Now imagine heating the liquid water in the pot to 
boiling. If the heat flux q is 20 kj/m 2 s, then the molar 
flux caused by boiling is: 


tii 


q 

AH va p 

20kJ/m 2 s 
48kJ / mol 
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( 29 ) 
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The boiling flux n x is 1,000 times greater than the 
diffusion flux at room temperature, but it is not 
a function of D. Thus slow dilute evaporation is a 
function of the diffusion coefficient; but boiling 
depends not on diffusion but on heating rate. 

But what about intermediate cases? For example, 
how fast will evaporation take place at 50°C? 

Answering this question requires a more complete 
form of Fick’s Law. Unfortunately, there is no single 
way to do this. One choice is the following: 

[Total flux] = [Diffusion flux] + [Convective flux] 


«i = ci(vi - v) + c x v = ; r + civ 


= - D d 4 i 

dz 


(30) 


Cl V 


where v is most often a volume average velocity. The 
most common alternative form of Fick’s law, strongly 
advocated by a few zealots, may be approximated as: 

V d=^(v2-vi) (31) 

where c 2 and v 2 are the concentration and velocity of the 
solvent. This form avoids choosing a convective velocity, 
but it can cloud the physical significance of the problem. 

Fortunately, the result for a problem like the water 
evaporation given above is the same for both forms of 
Fick’s Law given in Eqs. 30 and 31. If water at 50°C is 
evaporating from the pot above into dry stagnant air, 
the total flux is: 


Dc \ Cl o\ 

-2.8 x lO-’mVskjSfc) / 

0.1m V 

= 1.62 x 10 _3 mol/m 2 s 

This is about 7% greater than the result would be if 
calculated from Eq. 10 for a dilute solution at 50°C. 
The moral is clear: the effects of concentrated diffusion 
will only rarely be important in the atmospheric and 
aquatic environments. 

This completes our basic description of the three 
cases key to understanding diffusion. These are the thin 
film (80% of the cases), the semi-infinite slab (10% of 
the cases), and the decay of a pulse (5% of the cases). 
Other cases with different boundary conditions do 


92.5\ 

760/ 

(32) 


occur, and solutions for these are tabulated in the 
literature. However, these other cases are not as com¬ 
mon in practice. Similarly, diffusion in concentrated 
solutions is complicated but infrequently important. 
The simple form of Fick’s Law in these three cases is the 
best way to get started. 


Dispersion 

We now turn to an environmentally important prob¬ 
lem mathematically similar to diffusion but with a 
different physical origin. To make this problem specific, 
imagine dealing with the spill of a single toxin on the 
ground. Imagine the concentration of the toxin spreads 
in one dimension with time and groundwater flow, 
producing a roughly Gaussian concentration profile. 
We want to know how the spread of this toxin varies 
with the diffusion coefficient of the toxin. 

The answer is surprising: if the diffusion coefficient 
increases, the spread of the toxin maybe bigger, smaller, 
or unchanged. The toxin’s concentration profile is: 

M/A (z-vt) 2 

Ci = —== e 4Et 
y/AnEt 

where M is the total amount of toxin, A is the cross- 
sectional area across which the dispersion occurs, and v 
is the velocity of any flow through the soil. The disper¬ 
sion coefficient E has the same units as the diffusion 
coefficient but will often be much larger. This result, 
a complete analogue to Eq. 26, can be derived from 
parallels to Eqs. 22-25 by replacing the diffusion coef¬ 
ficient D with the dispersion coefficient E. However, 
while this mathematical parallel is complete, it does not 
explain the physics responsible for dispersion. 

To explore the physics involved, imagine the toxin is 
injected as a pulse into a small tube of diameter d. The 
toxin’s dispersion will be a strong function of how much 
flow is in the tube. If there is absolutely no flow, then the 
dispersion coefficient equals the diffusion coefficient: 

E = D (34) 

Increasing diffusion increases dispersion. If there is 
a small, laminar flow of velocity v, then: 


192 D 


( 35 ) 
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Increasing diffusion decreases dispersion. The velocity 
where Eq. 35 becomes dominant is when: 


v 2 d 2 
192 D 2 


» 1 


(36) 


For non-absorbing soil with the equivalent of 500 pm 
particles and diffusion in water liquid, D is about 
10 -9 m 2 /s, so v must be much greater than 30 pm/s 
or 2 m/day for Eq. 35 to swamp Eq. 34. If there is a 
large, turbulent flow (dv/v > 2,000), then: 


E = 


dv 

T 


(37) 


Eq. 37, comes from Taylor dispersion and is often the 
most important. The third term is new, the result of the 
rate of absorption. 

This overview of dispersion is intended as a caution 
and a starting point. The caution is that many Gaussian 
concentration profiles are due to diffusion coupled 
with other phenomena. The starting point in under¬ 
standing these profiles is recognizing that their spread 
can depend inversely on diffusion. In other words, slow 
diffusion may result in wide dispersion. 

Diffusion Coefficients 


Dispersion, now independent of diffusion, is due to the 
coupled turbulent fluctuations of concentration and 
velocity. The physical basis of Eqs. 33-37 is associated 
with G.I. Taylor. 

Those with a more practical bent may correctly be 
skeptical of modeling flow through a soil as occurring 
in a straight tube. Others sharing this skepticism have 
extended this analysis to flow in packed beds. The key 
results involve two new quantities: 

t = t( 1 T /c r ) (3^) 

1 —s 
8 



k' = 


Soil concentration 


Solution concentration 


where s is the void fraction available for flow. In phys¬ 
ical terms, t is the time corrected for any absorption by 
the soil, including material that diffuses into the soil’s 
pores. The quantity k! is a type of equilibrium constant 
between the soil and the solution, lumping together 
adsorption and absorption. In this case, a pulse of toxin 
may still be dispersed to give the Gaussian concentra¬ 
tion profile in Eq. 24, but with time t replaced by t. The 
dispersion coefficient E is now given by: 


E = D(1 + k') 


3 2 - 2 


d 2 v 2 

192 D 


.'\2 N 


1 + 6k 1 + ll(fc') 
1 + k! 


V 

3D 7 


1 + /c' 


(40) 


where 3 is an equivalent thickness of an absorbent and 
D! is the diffusion coefficient in the absorbent, not in 
the solution. The first term on the right-hand side of 
Eq. 40, the parallel of Eq. 34, is due to diffusion in the 
direction of flow. The second term, the analogue of 


Diffusion is an important process because it is slow, 
and diffusion coefficients thus often control the overall 
rate of processes involving diffusion, flow, and chemi¬ 
cal reaction. Typical values of diffusion coefficients, 
shown in Table 2, are chosen from the wide number 
of references in the literature but corrected to 
a temperature of 25°C. This wide literature is much 
less extensive than studies of other physical properties 
like viscosity or Young’s modulus, because diffusion 
coefficients are relatively difficult to measure. 

The values in Table 2 show diffusion in gases is 
about 10,000 times faster than diffusion in liquids, 
which is in turn over a billion times faster than dif¬ 
fusion in solids. Diffusion coefficients in gases fall 
around 10 -5 m 2 /s. Diffusion coefficients in liquids 
fall around 10 -9 m 2 /s. Diffusion coefficients in solids 
are much more variable, but are so slow that most of 
the mass transport occurs in fluid-filled gaps and pores 
within the solid. For example, in a bed of sand, most 
transport occurs in the spaces between sand grains 
and relatively little within the bulk of the grains them¬ 
selves. In environmental problems, diffusion in gases 
and liquids is more important. 

The diffusion coefficients, given in Table 2 for 1 atm 
and 25°C, do change with process variables, as outlined 
in Table 3. The variation with temperature in gases and 
liquids is small. For example, the temperature must be 
increased from 25°C (= 298° K) to 200°C (= 473° K) to 
double the diffusion coefficient in gases. Because the 
viscosity of a liquid drops as the temperature rises, the 
diffusion coefficient in a liquid changes faster with 
temperature, but the change is still modest. In contrast, 
diffusion coefficients in solids usually change more 
rapidly with temperature, doubling every 10°C or so. 











Chemicals in the Environment, Diffusive Transport 



Chemicals in the Environment, Diffusive Transport. 
Table 2 Diffusion coefficients. Values given are in m 2 /s and 
at 298 K and 1 atm 


Gases 

Gas pair 

Diffusion coefficient 

Air-H 2 0 

2.6 x 1 ( T 5 

co r o 2 

1.6 x 10~ 5 

H 2 -N 2 

7.8 x 1 ( T 5 

h 2 -0 2 

8.9 x 10“ 5 

n 2 -0 2 

2.2 x 1 ( T 5 

n 2 -h 2 o 

2.9 x 10“ 5 

o 2 -h 2 o 

2.8 x 1 ( T 5 

0 2 -octane 

0.7 x 10^ 5 

Solids 

Diffusion coefficient 

C in Fe (BCC) 

6 x 10' 25 

Fe in Fe (BCC) 

3 x 10' 52 

B in Si 

7 x 10“ 33 

He in Si0 2 

4 x 10' 14 

Na + in NaCI 

1 x 10“ 36 

Ag + in AgCI 

1 x 10' 19 

Liquids 

Solute-solvent 

Diffusion coefficient 

o 2 -h 2 o 

2.10 x 10~ 9 

co 2 -h 2 o 

1.92 x 10' 9 

h 2 s-h 2 o 

1.41 x 10~ 9 

hci-h 2 o 

3.33 x 10' 9 

NaCI-H 2 0 

1.61 x 10“ 9 

CaCI-H 2 0 

1.33 x 10~ 9 

nh 3 -h 2 o 

1.64 x 10“ 9 

Urea-H 2 0 

1.38 x 10~ 9 

Sucrose-H 2 0 

0.52 x 10“ 9 

Albumin-H 2 0 

0.08 x 10~ 9 

h 2 o-c 2 h 5 oh 

1.24 x 10~ 9 

Benzene-butanol 

0.99 x 10“ 9 

Hexane-heptane 

4.21 x 10~ 9 


Chemicals in the Environment, Diffusive Transport. 
Table 3 Variations of diffusion coefficients 



Gases 

Liquids 

Solids 

Typical value, m 2 /s 

10~ 5 

10~ 9 

Much smaller 

vs. T 

r 3/2 

T 

Large 

vs. p 

P -1 

- 

- 

vs. solute diameter 

size -2 

size -1 

size 2 

vs. viscosity p 

A 1 

A -1 

- 


Other process variables also have relatively small 
effects. The diffusion coefficient in gases does vary 
inversely with pressure; but the gas concentration varies 
directly with pressure; so the flux, related to the diffu¬ 
sion coefficient times the concentration, may remain 
more constant. The diffusion coefficient does vary 
inversely with the size of the diffusing species. While 
these effects are usually modest for gases and liquids, 
they can be much larger for solids. These generaliza¬ 
tions are justified by the approximate physical argu¬ 
ments given next. 

Gases. The diffusion coefficients in gases can be 
predicted with reasonable accuracy from kinetic the¬ 
ory. This theory assumes that a gas contains individual 
molecules moving with thermal motion and colliding 
with each other only as pairs. Under these cases, the 
diffusion coefficient is given by: 

D = Gv (41) 

where X is the distance between collisions and v is the 
molecular velocity. For a monatomic gas, this velocity 
is kinetic, related to the thermal energy: 

1 

- mv 1 = /c B T (42) 

where m is the molecular mass and /c B is Boltzmanns 
constant. Keep in mind that here v is a molecular 
velocity. It is the sonic velocity; it is much greater 
than the average velocity v used in Eqs. 30 and 31. 

We must now estimate the distance between colli¬ 
sions X. There are two cases. First, for the bulk gas, X is 
found from the volume occupied by one molecule: 

[Volume of one molecule] = 

[Distance between collisions] x 

[Area swept out between collisions] 
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(43) 


negative of the gradient of the chemical potential 
lii. Thus: 


where a is the molecular diameter. Equations 41-43 
can be combined to find: 


/4V2\ (hij 
\ 3n J po 2 \fm 


(44) 


This approximate relation is close to that found from 
more complex theories: D does vary inversely with p, 
er 2 , and ^/m; it does vary with T to a power greater than 
one and less than two. 

The second important limit of Eq. 41 occurs only 
for a gas diffusing in small pores of diameter d. In this 
case, the diffusing species is much more likely to collide 
with the pore walls than with other molecules. Thus the 
combination of Eqs. 41 and 34 becomes: 




(45) 


This case, called Knudsen diffusion, has a diffusion 
coefficient which depends on the pore diameter d, but 
not on the molecular diameter a. Now, the diffusion 
coefficient is independent of pressure, though it does 
vary inversely with the square root of solute mass. 
Under ambient temperature and pressure, Knudsen 
diffusion is important when the pores are much less 
than 0.1 pm. 

Liquids. Diffusion in liquids is normally not 
described by a kinetic theory but as the motion of 
a rigid, spherical solute diffusing in a continuum of 
solvent. Despite the major approximations obviously 
made by this simple model, it gives remarkably good 
results. It is the standard against which new predictions 
are always judged. 

The model begins by describing the friction on 
a solute sphere: 


Force = [Coefficient of friction /] x Velocity v\ 

(46) 


The velocity Vj now is the average and not the sonic 
value v used for gases. The coefficient of friction / is 
given by Stokes Law: 

/ = 6niuR (47) 

where p is the solvent velocity and R is the solute 
radius. The force was suggested by Einstein to be the 


~^ = [6tvK]vi 


=-v(fc B nn Cl ) = 


T dc\ 
Ci dz 


(48) 


Rearranging: 

_ ( k B T \ dci 
Cl Vl \6nfiR) dz 


(49) 


But (ciVi) is the total flux n 1? equal in dilute solution to 
the diffusion flux;!. Comparing this with Eq. 33 gives: 


D = 


hr 

6n/iR 


(50) 


The diffusion coefficient in liquids varies inversely with 
solute size and with solvent velocity. This simple rela¬ 
tion is called the Stokes-Einstein equation. 

Equation 50 often gives good estimates of diffusion 
in liquids. Its simplicity is an invitation to attempt 
improvements. These include assuming the sphere is 
not solid but gas, replacing the sphere with an ellipsoid, 
allowing the sphere to spin, and putting it in a 
small pore. Other attempts at improvement allow for 
nonideal solutions, assigning different friction coeffi¬ 
cients to solute and solvent, and considering changes 
close to the spinoidal. While none of these efforts is 
definitive, each can clarify the perspective of a particu¬ 
lar chemical system. Still, the simple Stokes-Einstein 
equation is the best place to start for understanding 
diffusion in liquids. 

Solids. As explained above, the diffusion in solids is 
so slow that most transport usually occurs in any 
fluid-filled flows and voids within the solid. Some 
solid processes are certainly dramatically affected by 
diffusion - metallic welds and doped semiconductors 
are two good examples - but the diffusion of chemicals 
in the environment is usually through fluids. 

The relative unimportance of diffusion in solids is 
fortunate, because diffusion coefficients in solids scat¬ 
ter. These coefficients do not cluster around a single 
value, and they depend strongly on crystal structures. 
For example, the diffusion of carbon in body-centered 
cubic iron is 10 10 times faster than the diffusion of 
carbon in face-centered cubic iron. Sometimes, an 
anomalously high coefficient reflects different types of 
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vacancies in the solid crystals. For example, silver ion 
diffuses 10 17 times faster in AgCl than sodium ion 
diffuses in NaCl. 

Estimates of diffusion in solids, which normally 
begin with a face-centered cubic lattice, assume a coef¬ 
ficient given by: 

D = R 2 Nco (51) 

where R is now the distance between atoms or ions in 
the crystal; N is the dimensionless fraction of vacant 
sites; and co is the jump frequency, the number of 
atomic or ionic movements per time. The size of R is 
estimated from crystal structure, and the fraction N 
from the free energy of mixing. The jump frequency 
co is often felt to have an Arrhenius temperature depen¬ 
dence. Arguments like this are not predictions but are 
rationales to organize data. 

Diffusion Across Interfaces 

Diffusion from one phase to another is an important 
and complex limit, a source of confusion for many. In 
this case, there are two limits that are close parallels to 
the cases of a thin film and a semi-infinite slab 
discussed above in the sections “Diffusion Across 
a Thin Film” and “Other Important Cases.” One case 
is exemplified by the so-called infinite couple, when 
two alloy bars of different but homogeneous com¬ 
position are closely joined together. In this case, 
each atomic species can diffuse between the two 
bars, giving concentration profiles that are known. 
This limit is rarely important in environmental science 
and engineering. 

The second, much more important limit occurs 
when solutes diffuse from one relatively well-mixed 
phase across a phase boundary to a second relatively 
well-mixed phase. This limit approximates what hap¬ 
pens when sulfur dioxide in the air diffuses into a lake. 
In this case, bulk air is often well-mixed, and the bulk 
water in the lake is, too. However, this good mixing 
does not extend all the way to the air-water interface. 
About the last one millimeter of air and about the last 
one-tenth millimeter of the water are not well-mixed. 
Diffusion across these two films, one in air and 
the other in the water, is what governs the rate of sulfur 
dioxide dissolution in the lake. 

We develop these ideas below. “The Mathematics of 
Mass Transfer” derives the mathematical framework. 


“Concentration Units” details transport across inter¬ 
faces. “The Meaning of q*” uses this framework to 
calculate the mass transfer in several environmentally 
relevant situations. 

The Mathematics of Mass Transfer 

To begin our study of mass transfer, imagine a small 
volume of air containing hydrogen sulfide at concen¬ 
tration q 0 that is suddenly contacted with a large 
volume of water. The sulfide dissolves in the water so 
that its concentration c 1 drops with time. Predicting 
this concentration change with the diffusion equations 
given above is possible, but difficult. Often, an easier 
prediction is to use an alternative tool, a mass 
transfer analysis, which is more suitable for engineering 
applications. 

This mass transfer analysis begins by writing a mass 
balance on the H 2 S in the air: 

[Accumulation in the air] = 

[Amount dissolved in water] 

V^=-AK(c l -c 1 *) (52) 

where V is the air volume, A is the interfacial area 
between air and water, and c 1 * is proportional to the 
concentration of the H 2 S in the water. When there is 
a lot of pure, well-mixed water present, this concentra¬ 
tion is zero. The rate constant K in Eq. 52 is an overall 
mass transfer coefficient, a function of H 2 S diffusion in 
both the water and the air. It has the units of velocity, 
that is, of length L per time t. This mass balance is 
subject to the initial condition: 

t = o Cl S= Cio (53) 

Integrating, Eq. 52 becomes: 

fL =e ~ K (4)t (54) 

Uo 

The H 2 S concentration in the air decays exponentially 
with time, as if it were undergoing a first-order chem¬ 
ical reaction. The rate constant of this reaction ( KA/V) 
has units of reciprocal time. However, the concentra¬ 
tion in air is not dropping because of a chemical reac¬ 
tion but because of diffusion of H 2 S from the air into 
the water. 




2026 


c 


Chemicals in the Environment, Diffusive Transport 


As a second example, imagine absorbing carbon 
dioxide from flue gas. The flue gas is steadily flowing 
upward in a small absorption tower. Excess strong base 
is steadily flowing downward through the tower. 
A mass balance on the carbon dioxide in a small dif¬ 
ferential volume dV in the tower results in: 

[Accumulation in dV\ = [C0 2 Flow in — out] 

+ [C0 2 Absorbed by base] 

0 = Q^-Ka( Cl -cn (55) 

where Q is the volumetric flow rate of flue gas, a is the 
interfacial area per volume in the tower, and q* is 
about zero because the base is strong and there is a lot 
of it. As before, K is an overall mass transfer coefficient 
describing the rate of reaction. This mass balance is 
subject to a boundary condition: 


II 

o 

II 

o 

(56) 

Integration gives: 


Q 

i 

II 

(57) 


In gases, the units are sometimes partial pressures; in 
liquids, the units are often mole fractions. 

Expressing concentrations as partial pressures or 
mole fractions leads to different definitions of mass 
transfer coefficients. In particular, the total flux across 
the interface Ni from one dilute gaseous solution into 
another dilute liquid solution may be defined as: 


n l I interface A I interface 

= K{c i - Cl *) 


(58) 


where q is the concentration of species “1” in the gas. 
The restriction to dilute solution is not a major 
constraint. Alternatively, the interfacial flux can be 
defined as: 


Ni = K p {p i - Pl *) (59) 

where Pl is the partial pressure of solute “1” in the gas, 
and K p is a new, different overall mass transfer coeffi¬ 
cient. But from the ideal gas law: 


Pi = 


n x RT 

V 


q RT 


(60) 


Comparing the two equations shows: 


The C0 2 concentration exiting the absorption column 
decreases exponentially as the column volume V is 
increased or as the column flow Q is decreased. Note 
that Eqs. 54 and 57 are complete mathematical paral¬ 
lels, even though the former describes unsteady disso¬ 
lution without flow, and the latter describes steady 
absorption with flow. 

Interfacial mass transfer is not hard. It is just an 
alternative description of diffusion which comple¬ 
ments that given by Fick’s Law. The three features do 
make interfacial mass transfer complicated. These three 
complications are the units of concentration, the 
detailed meaning of q*, and the values of the mass 
transfer coefficient K. Details of these features follow. 

Concentration Units 

The first issue, concentration units, results because the 
units used for clearly explaining the ideas are not always 
those easiest to use in practice. The concentration units 
implied in this essay are of the amount per volume, for 
example, moles per liter or grams per cubic meter. 
The concentration units used in practice are different. 


If K has units of meters per second, then K p may have 
units of moles per square meter per second per pascal. 

Similarly, for mass transfer from a liquid into a gas, 
an alternative definition is: 

Ni = K(* - q*) (62) 

where q is now the concentration of species “l”in the 
liquid, and K is an overall mass transfer coefficient 
different from that in Eq. 58. Alternatively, 

Ni =K x (x x -X!*) (63) 

where x x is the mole fraction of species “1” in the liquid, 
and K x is still another overall mass transfer coefficient. 
Because 

q = cx i (64) 

where c is the total concentration in the liquid, the two 
overall coefficients are related: 

K x = cK 


(65) 
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For example, if K is in meters per second, and c is in 
moles per meter cubed, then K x will have units of moles 
per square meters per second. Other definitions of 
coefficients are also possible, but are no harder. 

The Meaning of q* 

The meaning of the concentration q* appearing in 
Eqs. 52, 55, 58, 59, and 62 is the hardest step in this 
description. These flux equations all assert that the flux 
is proportional to a concentration difference. The flux 
will be zero when the concentration is zero. Thus, q * 
must be the hypothetical gaseous concentration of spe¬ 
cies “1” that is in equilibrium with species “1” dissolved 
in the liquid. This is harder than interfacial heat trans¬ 
fer: there, the heat flux is proportional to the temper¬ 
ature on one side of the interface minus that on the 
other side. Here, the mass flux is proportional to 
a concentration difference which equals to one real 
concentration that does exist minus a second one 
which is hypothetical. 

To be more specific, imagine the case in Eq. 58, 
where q is the actual concentration of species “1” in 
the well-mixed, bulk gas on one side of the interface. 
The concentration q * is equal to the concentration in 
the well-mixed, bulk liquid times some type of Henry’s 
Law constant, which describes equilibrium between gas 
and liquid. Sometimes, those studying this point for 
the first time can be helped by silently chanting 

► q* is the concentration that would be in the gas if it 

were in equilibrium with the liquid (which it isn't). 

Remembering this chant may help mastering this dif¬ 
ficult point. 

To try to make this point clearer, imagine in calcu¬ 
lating the flux of oxygen in air into wastewater with 
a concentration of 1 x 10 -4 mol/f. At equilibrium, 

q (gas) = 30q (liquid) (66) 

Thus, 

q*(gas) = 30 x 10 -4 mol/£ (67) 

As a result, 


Understanding problems like these is often helped by 
always checking what happens when the system is at 
equilibrium. 

Values of Mass Transfer Coefficients 

We now turn to the variations of the overall mass 
transfer coefficient with quantities like the diffusion 
coefficient in the adjacent phases. The most common 
case is that of transfer from a gas into a liquid. The 
concentration in the gas is expressed as a partial pres¬ 
sure, and the concentration in the liquid is expressed as 
a mole fraction. The flux Ni across the interface is then: 


Ni = K p ( Pl - Pl *) 

= k p ( P i - pu) (69) 

= k x (xu - x x ) 

where pi and are the average concentrations in the 
gas and liquid, respectively; and pu and x xi are the 
corresponding but unknown gas and liquid concentra¬ 
tions at the interface. The mass transfer coefficients k p 
and k x describe transport in the gas and in the liquid. 
Sensibly, the individual mass transfer coefficient k p is 
a function of diffusion in the gas, but not of diffusion in 
the liquid; and the individual mass transfer coefficient 
k x is the reverse. 

The concentrations across the interface will nor¬ 
mally be in equilibrium, so that: 

pu = Hx u (70) 

where H is a Henry’s Law constant. Combining this 
constraint with Eq. 69 gives: 


Ni = 


2 _ + K 

kp ^ A 


(pi - Hx i 


(71) 


By comparing this with the overall mass transfer coef¬ 
ficient K p yields: 

Kp = TT h ( 72 ) 

k p k x 

Pi* = Hxi (73) 


c 


Cl - Cl = 


- 30 x 10“ 


0.21 mol 

22Ai 
64 x 10 -4 mol j £ 


. mol 


( 68 ) 


These are the results sought. Obviously, similar equa¬ 
tions are possible for other concentrations and other 
equilibria analogous to Henry’s Law. 
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The way in which k p and k x vary with the diffusion 
coefficients can be estimated either from experiments 
or from theories. The experiments are summarized as 
correlations, most often in terms of dimensionless 
numbers. For example, for mass transfer into a liquid 
flowing through a packed tower with packing of size d, 
the most widely accepted correlation is: 

k x = /c(liquid)c(liquid) (74) 

* (Uquid) (4) 3 = °- 0051 (t;) 0 ' 67 (v) {ad)0A 

(75) 

where v is the kinematic viscosity of the liquid, g is the 
acceleration due to gravity, v is the superficial liquid 
velocity, a is the surface area per volume of the packing, 
and D is the diffusion coefficient in the liquid. 
The quantity (v/av) is one form of the dimensionless 
Reynolds number; the quotient (v/D) is the dimension¬ 
less Schmidt number. Correlations like this, which are 
based on extensive experiments, should be used for 
estimates whenever possible. 

In many cases, however, appropriate correlations 
may not be reliable, or may not be available at all. 
In these cases, estimates for liquids can be made by 
assuming that: 


values. They have discussed mass transfer coefficients 
as an alternative description of interfacial diffusion 
frequently valuable in environmental engineering. 
None of the ideas presented are especially difficult to 
understand. 

However, actually putting these ideas into practice 
can be complicated, largely because of difficult units 
and subtle definitions. This final section considers 
specific chemical examples that illustrate the ideas 
involved. These examples are approximate but can 
serve as a warning of where trouble can occur. 


Overall Mass Transfer Coefficient of Oxygen 
(Case #1) 


Imagine wanting to estimate the mass transfer coeffi¬ 
cient K p of oxygen from air into water. From Table 2 
and Eq. 77, 


fc(gas) = 


D 

1 


2 x 10“ 4 m 2 /s 
10 -3 m 


= 0.02 m/s 


(78) 


The coefficient k p is found from this by a unit 
conversion: 


_ k( gas) 
p RT 


0.02 m/s mol 

_ i __ g_ 

(8.2 x 10“ 6 ^f)298°K m 2 satm 

(79) 


t(liquid)=5 = _g_ (76) 

where D is the diffusion coefficient in the liquid and i is 
often called the film thickness or the boundary layer. 
This casual description can be confusing, because these 
terms are more specifically defined in theories of mass 
transfer. A corresponding estimate for gases is: 

•fr-)-!- iip= (77) 

where D is now the diffusion coefficient in gases, typ¬ 
ically 10 4 times larger than that in liquids. Equations 76 
and 77 are major approximations to be used only in 
desperation. 

Important Special Cases 

The sections above describe the mathematics of diffu¬ 
sion and dispersion. They have summarized character¬ 
istics of diffusion coefficients and listed some typical 


Similarly, from Table 3 and Eq. 76, 

7/1 . . 1N 2xl0 -9 m 2 /s _s / / \ 

fc(hquid) =--= 2x10 5 m/s (80) 

The coefficient k x has a different conversion: 


kx = fc(liquid)c = (2x10 5 
mol 

= !•! y 

ims 



1 mol 
18 x 10“ 6 




(81) 


Henry’s Law for this system is (cf. Eq. 70): 

pi = (4.3 x 10 4 atm)xi (82) 

Thus from Eq. 72, 


K p = 


1 

m 2 s atm _i_ 4.3 x 10 4 atm m 2 s 
8 mol ' 1.1 mol 


= 2.6 x 10 -5 


mol 

m 2 s atm 


( 83 ) 
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This coefficient describes oxygen transport between air 
and water when the concentration difference is 
expressed as partial pressures of oxygen. 


Overall Mass Transfer Coefficient of Oxygen 
(Case #2) 


The example above is straightforward because it 
matches the detailed equations given earlier. The 
same problem can be solved in different units. Thus, 
the flux Ah is: 


M = K c ( Cl * - Cl ) 


(84) 


where Ci is the actual oxygen concentration in water, 
Cj* is the oxygen concentration in water that is in 
equilibrium with air, K c is a different overall mass 
transfer coefficient given by: 

K c= m r~ I (85) 

k( gas) ' k (liquid) 

and m is a different form of Henry’s Law constant, 
defined by the equilibrium: 

Ci (liquid) = mci(gas) (86) 

Comparing Eqs. 81 and 82 gives: 


c(liquid)PT 
H 
mol 

18 x 10 -6 nT 
298°K 

4.3 x 10 _4 atm 
0.03 


8.2 x 10“ 


j nr atm 
mol°K 


(87) 


The numerical value of Henry’s Law constant is 
completely different. The combination of Eqs. 78, 80, 
85, and 87 gives: 


^ _ -1 

— 0.03 | 3 

0.02 m/s ' 2x10 5 m/s 

= 2 x 10 -5 m/s 


( 88 ) 


In both this formulation and that in Eq. 83, diffusion 
in the liquid dominates the mass transfer. This is 
often taken as a consequence of the slower diffusion 
in the liquid. This is not completely true, as the next 
example shows. 


Overall Mass Transfer Coefficient of Ammonia from 
Air into Water 


This example illustrates how the rate at which ammo¬ 
nia is dissolved in water can be estimated. The indi¬ 
vidual mass transfer coefficients of ammonia are 
easily found: 


fc(gas) 


£>(gas) 

t 


2.3 x 10 5 m 2 /s 
10 _3 m 


0.023 m/s 
(89) 


and 


. . . . .D(liquid) 1.6x10 9 m 2 /s 

= —— = , 0 -, m 

= 1.6 x 10 -4 m/s 


(90) 


One Henry’s Law constant for dilute acid is given in the 
literature as: 


pi (atm) = ( 7000 


. atmf? 
mol 


Ci (liquid, molar) (91) 


Converting the pressure into a molar concentration: 


Ci (gas, molar) = ( 7000 


atm £ 


mol°K 


mol J V 0.082 atm 


Ci (liquid, molar) 


298°K 
= 290ci (liquid, molar) 

Combining these results with Eq. 85 gives: 
1 

K c = 


290 


: + 


0.023 m/s 1 1.6x 10 -5 m/s 

= 1.3 x 10 -4 m/s 


(92) 


(93) 


The much higher solubility of ammonia in dilute acid 
means the mass transfer is now more affected by diffu¬ 
sion in air. 


Toxin Diffusion in a Biofilm 

The final example imagines a dilute toxin dissolved in 
water and metabolized irreversibly by microorganisms 
immobilized in a biofilm. This example, which is not as 
chemically specific as the first three, also assumes that 
the concentration of dissolved oxygen is much greater 
than the concentration of the toxin. Thus, the rate per 


c 
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biofilm area Ni is given by the overall rate of diffusion 
of the toxin to the biofilm, followed by the diffusion 
and reaction of the toxin within the biofilm. 

This overall rate is mathematically equivalent to 
mass transfer across an interface, where the solute dif¬ 
fused through the gas to reach the interface, quickly 
crossed the interface, and then diffused into the liquid. 
In fact, the biofilm case is often easier because most 
biofilms are largely water and hence their partition 
coefficient m is one. Thus, 

Ni = ! ? ! (94) 

/c(liquid) ' /c(biofilm) 

The coefficient k (liquid) can often be found from mass 
transfer correlations; Equation 76 provides a first guess. 
The value in the biofilm depends on the details of the 
reaction. However, delightfully, most theories give the 
same result: 


k( biofilm) = 


D (biofilm) 2 

T 


(95) 


where t is the half-life of the reaction. While beyond the 
scope of this entry, this result is carefully derived in 
most books on diffusion and reaction. This result 
underscores the value of the simple ideas of diffusion 
and reaction presented here. 


Future Directions 

After 150 years of concentrated effort, diffusion is 
an established subject. Active research does continue 
on, for example, semiconductors and polymer mem¬ 
branes, but this does not have major environmental 
application. 

Diffusion is an important tool for describing envi¬ 
ronmentally significant mass transfer. In many cases, 
this transfer can be described in terms of diffusion 
coefficients. In many air pollution problems, mass 
transfer can be described in terms of dispersion, 
which is mathematically similar to diffusion but due 
to coupled diffusion and flow. 

The underused description of mass transfer, espe¬ 
cially across interfaces, is in terms of mass transfer 
coefficients. These are functions of diffusion coeffi¬ 
cients and of other parameters, like velocity and vis¬ 
cosity. Exploiting this topic offers potential gain for 
environmental engineering. 
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Glossary 

Convection The movement of a constituent with 
movement of the fluid. 

Diffusion The spreading of fluid constituents through 
the motion inherent to atoms and molecules. 

Diffusion coefficient A coefficient that describes the 
tendency of molecules to spread a constituent mass. 

Dirac delta An impulse of a given quantity (mass) that 
occurs over an infinitely short time or space. 

Dispersion The process of mixing caused by a varia¬ 
tion in velocity and transverse diffusion or turbu¬ 
lent diffusion. 

Dispersion coefficient A coefficient that can describe 
the mixing caused by a transverse velocity profile 
and transverse diffusion or turbulent diffusion. 
A dispersion coefficient means that some sort of 
spatial mean velocity is being used to describe the 
flow. Then, the mixing lateral or longitudinal to 
the spatial mean velocity due to a combination of 
a velocity profile and diffusion or turbulent diffu¬ 
sion is described by the dispersion coefficient. The 
coefficient’s location in the mass transport equation 
is similar to diffusion coefficients, and the units are 
similar. 

Laminar flow Flow that has no turbulent eddies, 
where the fluid flows in laminas and diffusion cre¬ 
ates the mixing of the fluid. 

Turbulent diffusion The mixing of fluids through 
turbulent eddies created by convection. 

Turbulent diffusion coefficient A coefficient that 
comes from the multiplication of two turbulent 
velocities of the flow, divided by density of the 
fluid. The coefficient’s location in the mass trans¬ 
port equation is similar to diffusion coefficients, 
and the units are similar, so it is called a “turbulent 
diffusion coefficient.” 

Definition of Dispersive Transport in the 

Environment 

Dispersion is the enhanced mixing of material through 

spatial variations in velocity. When it is of interest 


(when we are not keeping track of the three- 
dimensional mixing), dispersion is typically one or 
two orders of magnitude greater than turbulent diffu¬ 
sion. The process of dispersion is associated with 
a spatial mean velocity, the assumption of plug flow, 
and a velocity profile. The means used in association 
with diffusion, turbulent diffusion, and dispersion are 
identified in Table 1. 

The means by which diffusion and possibly turbu¬ 
lent diffusion are combined with a spatial mean veloc¬ 
ity to result in dispersion is illustrated in Fig. 1. 
A velocity profile over space with mixing due to diffu¬ 
sion (and possibly turbulent diffusion) is combined 
into a cross-sectional mean velocity and a dispersion 
coefficient. Without the cross-sectional mean velocity, 
there is no dispersion coefficient. 

Introduction 

Dispersion was first developed as a means of dealing 
with reactors, where there was little interest in the 
processes creating mixing inside of the reactor, but 
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Table 1 Temporal or spatial means and scales used in 
association with various mixing processes 


Process 

Variable 

representing 

process 

Mean 

Scale of 

mean 

Diffusion 

Diffusion 

coefficient 

Temporal 

Molecular 

Turbulent 

diffusion 

Turbulent diffusion 
coefficient 

Temporal 

Minutes 

Dispersion 

Dispersion 

coefficient 

Spatial 

Scale of 
flow 


Diffusion or Turbulent Dispersion 

Diffusion 

=> 

+ /p + 

Velocity Profile Uniform Velocity Cross-Sectional 

y Assumption Mean Velocity 

Chemicals in the Environment, Dispersive Transport. 
Figure 1 

Representation of the process by which diffusion or 
turbulent diffusion is related to dispersion 
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great need to appropriately describe the output from 
the reactor. The physics of the mixing process is lost 
in the conversion to a spatial mean velocity profile, but 
the end result can still be modeled by dispersion. 

A similar spatial mean velocity (bulk mean velocity) 
is used for the plug flow reactor model. Thus, plug flow 
with dispersion is a natural match, where the mixing 
that truly occurs in any reactor or environmental flow 
is modeled as dispersion. This is the model that will be 
applied to utilize dispersion as a mixing model. 


Dispersion in Laminar Flow 


Any flow with a nonuniform velocity profile will, when 
spatial mean velocity and concentration are taken, 
result in dispersion of the chemical. For laminar flow, 
the well-described velocity profile means that we can 
describe dispersion analytically for some flows. Begin¬ 
ning with the diffusion equation in cylindrical coordi¬ 
nates (laminar flow typically occurs in small tubes): 


dC dC _ \d 2 C 1 dC d 2 C 
dt + U Q x dr 2 r dr dx 2 


( 1 ) 


where the x-coordinate is aligned with the flow velocity, 
and v = w = 0. We will outline the development of 
a dispersion coefficient for a fully developed laminar 
pipe flow. This means that we are far enough from the 
entrance that the velocity profile is essentially in equi¬ 
librium with the loss of pressure along the pipe. This 
flow has a velocity profile (developed in most fluid 
mechanics texts): 


u = IWU - r 2 /i? 2 ) 


( 2 ) 


as illustrated in Fig. 2. 

To convert Eq. 1 to cross-sectional mean values, we 
will assign: 


C = 


7 zR 2 


C2nr dr 


U = 


7 zR 2 


U 2nr dr = 


U m 


Then, in Eq. 1 : 


(3) 

(4) 


C= C+C'(r) 

and 

u=U + u’{r ) 

Now, if we equate Eq. 2 and 6, we get 


(5) 

( 6 ) 

(7) 


Substituting Eqs. 5, 6, and 7 into Eq. 1: 




= D 


<9 2 (C+C') 1 <9(C+C') <9 2 (C+C') 

dr 2 + r dr + dx 2 


+ S 
(8) 
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The velocity profile in a fully developed tubular flow is a paraboloid 
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An order of magnitude analysis will tell us that 
d 2 /dr 2 > > 3V3* 2 . In addition, dC/dr = 0, by defini¬ 
tion. Now, if we put everything that is known on the 
left-hand side of Eq. 8, the result will be: 


mean values. Thus, if D L is involved, we are discussing 
cross-sectional mean concentrations. 

Dispersion in Turbulent Flow 


dC dC 
dt + dx 


^ + u d -^ + u^ + u' 

dt dx dx 


dC 

dx 


d 2 C' DdC' d 2 C 

+£* 02^- f) - $ ~ + S 

or 1 r or ox 1 


(9) 


The second equality in Eq. 9 is a definition of 
longitudinal dispersion coefficient, D L . G. I. Taylor 
[1] assumed that some of the terms in Eq. 9 would 
cancel and that longitudinal convective transport 
would achieve a balance with transverse diffusive trans¬ 
port. He then solved the second equality in Eq. 9 for 
a fully developed tubular flow, resulting in the relation, 


The dispersion that occurs in turbulent flow can also be 
calculated, as long as the velocity profile is given. This 
was done by Taylor [2] for a tubular flow and by Elder 
[3] for a two-dimensional, open-channel flow. Both 
investigators assumed that a logarithmic velocity pro¬ 
file would apply in the entire flow field. The logarith¬ 
mic velocity profile is a fairly good description where 
shear stress can be assumed constant. It is a good 
assumption for a fully developed turbulent flow field, 
because the locations where the logarithmic profile 
applies are those with the greatest change in velocity. 

For a fully developed tubular flow, assuming a log¬ 
arithmic velocity profile, Taylor derived the equation, 


c 


R 2 U 2 _d 2 U 2 
48D “ 192D 


( 10 ) 


A similar relation can be developed for laminar flow 
down an inclined plate: 


32 h 2 U 2 
945 D 


(ii) 


The longitudinal dispersion coefficient is propor¬ 
tional to the square of the flow scale (d or h), propor¬ 
tional to the square of the velocity scale ( U ), and 
inversely proportional to the diffusion coefficient. The 
greater the diffusion, the less severe the spread of the 
chemical by the velocity profile because of local mixing, 
and the smaller the longitudinal dispersion coefficient. 
This result may seem illogical, but can be explained 
by the following: Longitudinal dispersion describes 
mixing only in terms of the cross-sectional mean con¬ 
centration, and transverse mixing actually slows down 
longitudinal dispersion. The governing Eq. 9 does not 
concern itself with local mixing issues. 

Knowing the relations given in Eqs. 10 and 11, we 
no longer need the more cumbersome middle portion 
of Eq. 8, and we can work to solve the equation, 


dC dC d 2 C 
~di +U ^c~ Dh ^ + S 


( 12 ) 


In expressing the equations for longitudinal disper¬ 
sion, we will drop the “hat” above the cross-sectional 


Dl = 5.05 d u * 


(13) 


where u* is the shear velocity at the wall. Elder [3] 
derived the following equation for a two-dimensional, 
open-channel flow: 

D l = 5.93 h u * (14) 


It is interesting to compare Eqs. 13 and 14 with 
those for a fully developed laminar flow, Eqs. 10 and 
11. In the chapter on Turbulent Transport, eddy diffu¬ 
sion coefficient in a turbulent boundary layer was 
found to be linearly dependent upon distance from 
the wall and on the wall shear velocity. If we replace 
the diffusion coefficient in Eq. 11 with an eddy diffu- 
sivity that is proportional to h u*, we get, 


Dl 


h 2 U 2 
hu* 


(15) 


Noting that for a given boundary roughness we can 
generally say that U ~ w*, Eq. 15 becomes, 

D l ~ hu* (16) 


which is what we have in Eq. 14. 

The relations developed for longitudinal dispersion 
coefficient are given in Table 2. The experimental 
results in rivers tend to have a large range because 
of the variety of lateral velocity profiles that exist in 
natural rivers and streams. 
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Table 2 Relationships for longitudinal dispersion 
coefficient in pipes and channels developed from theory 
and experiments 


Flow conditions 

D L 

Notation 

Laminar flow in a pipe [1] 

R 2 U 2 

48 D 

R = radius of 
tube 

Laminar flow down an 

32 h 2 U 2 

U = cross- 

sectional mean 
velocity 

D = diffusivity 
h = depth 

11/p = velocity of 

inclined plate 

945 D 

Laminar flow-linear 
velocity profile (Couette 
Flow) 

V 2 A 2 

p z 

120 D 

Turbulent flow in a pipe, 
assuming logarithmic 
velocity profile [2] 

10.1 Ru* 

upper plate 

A z = spacing of 
plates 
u * =shear 
velocity 

turbulent flow down an 
inclined plate, assuming 
logarithmic velocity 
profile [3] 

5.93 /iu* 

Open flume 
(experimental) 

8-400 hi/* 


Canals (experimental) 

8-20 hu * 


Rivers (experimental) 

8-7,500 hu* 



Solutions to Transport with Convection 


Diffusive transport with convection occurring simulta¬ 
neously can be solved more easily if we orient our 
coordinate system properly. First, we must orient one 
axis in the direction of the flow. In this case, we will 
choose the x-coordinate so that u is nonzero and v and 
w are zero. Second, we must assume a uniform velocity 
profile, u-U - constant with y and z. Then Eq. 30 of 
the Transport in the Environment chapter becomes 


dC UdC 
d t + R dx 


d r d 2 C 

R dx 2 


d 2 C d 2 Cl S 
d y 2 + dz 2 _ + R 

(17) 


where S is a source or sink term other than adsorption 
or desorption. In Eq. 17, we are assuming that chemical 
reaction does not take place on the surface of a solid, 
i.e., while the chemical is sorbed to solids. Now, we will 
convert our Eulerian (fixed) coordinate system to one 
that moves (Lagrangian) with velocity U/R and assign 
an independent variable 
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Figure 3 

Pulse response in fixed and moving coordinate systems 


x*=x-Ut/R (18) 

such that x* = 0 at x = U t/R. The response of the 
system to a conservative pulse, given in Fig. 3, indicates 
that in the Lagrangian coordinate system specified, 
there is no convection term, only diffusion. Then 
Eq. 17 becomes 

dC _D\d 2 C d 2 C d 2 C 
dt R dx* 2 + dy 2 + dz 2 

Determination of Dispersion Coefficient from 
Tracer Clouds 


+ f (19) 


The one-dimensional mass transport equation for 
plug flow with dispersion, and a retardation coefficient 
of 1, is 


dC dC d 2 C 
~N +U d^- Dl d^ + S 


( 20 ) 


We will convert our fixed coordinate system to 
a coordinate system moving at velocity U through the 
change of variables, x* = x — Ut. Then Eq. 20 is given as 


dC 

dt 


Dl 


d 2 C 

dx* 2 


( 21 ) 


Now, if we are determining the dispersion coeffi¬ 
cient through the use of a pulse tracer cloud, the 
boundary conditions are those of a Dirac delta: 


1. At t = 0, a pulse of mass M is released at x* = 0. 

2. As t => oo, C => 0. 


The solution to Eq. 21 with these boundary condi¬ 
tions [4] is: 


m/a -fr-^) 2 

C — _ - _ p 

y/iWWt 


( 22 ) 
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or, in dimensionless variables, 


C 


C 


C 0 Mj (AL) ^/4nCou/Pi 


:exp 


(1 — Cou ) 2 


4 Cou/Pe 


(23) 

Where Cou = U t/L = t/t r is a Courant number and 
Pe-U L/D l is a Peclet number. Comparing Eq. 23 to 
a Gaussian probability distribution, 

1 r .il' 

2(7 2 




■a/27T 


exp 


we can see that the plug flow with dispersion, when 
Pe> 10, can be fit to a Gaussian distribution in terms of 
C/C 0 and (1—9), with the relationships, 

^2 


a 2 — —~ = 2 Cou/Pe 


t 2 


where 


(24) 


J t 2 Cj Co dt 
o 

oo 

J C/Codt 


(25) 


and 


t r = - 


J t 2 C/ C 0 dt 

3 

oo 

J C/C 0 dt 


(26) 


In addition, if we could measure the tracer cloud 
over distance at one time, we would use the relation, 


a 


U 2 t r 


where 


(27) 


OO oo 

§x 2 C/C 0 dx ]xC/C 0 dx 

= ^ -* 2 and ^ = ^- 

J C/C 0 dx J C/C 0 dx 

o o 

(28) 

The response of a plug flow with dispersion model 
to a pulse input, Eq. 23, is given in Fig. 4 for various 
values of the Peclet number, Pe = U L/D l . 

Dispersion in Groundwater Flow 

Dispersion in a flow through porous media occurs due 
to heterogeneity in the media, i.e., the conductivity of 
the soil varies with space. This is shown on three levels 
in Fig. 5. On the particle scale, a thread of tracer will be 
split a number of times as it moves through the media. 
Each split of the tracer thread will move through the 
media at a speed corresponding to the resistance that it 
encounters. If you take a number of tracer threads 
coming out of the media at different times, and col¬ 
lected them in an outlet pipe, what you would see at the 


c 


o 

o 



t/t r 
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Response of the plug flow with dispersion model to a pulse input 
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Two flow paths on the particle scale move through the media at different rates. 


On a larger level, fingering is caused by layered beds with a low k 
(conductivity) and lenses with a high k 



Lenses cause 
both 

longitudinal 
and lateral 
dispersion 


On a still-larger spatial and temporal scale, a “dispersion” cloud 


New 
Brighten 
city limits 



Town scale 
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Illustration of dispersion in groundwater flow at various scales (From [5]) 


end of the pipe would be a dispersed pulse. This dis¬ 
persion would be much greater than the diffusion that 
would occur. A lateral dispersion would also occur 
because the media would move some of the tracer 
threads laterally. Thus, in groundwater flow, there gen¬ 
erally is longitudinal and lateral dispersion, created by 
the character of the media. 

On a larger scale, a similar process can occur. 
Fingering of the tracer is created by layered beds with 


a low conductivity and lenses with a high conductivity. 
The tracer that ends up in a lens travels at a relatively 
high speed. Those tracer molecules will reach the mea¬ 
suring point sooner than the tracer molecules stuck in 
the low conductivity beds, creating longitudinal disper¬ 
sion. As the lenses are not all parallel to each other, they 
will also create a lateral dispersion of the tracer. 

Finally, on a still larger scale, about the scale of 
a small town, there will be isopleths (lines of constant 
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concentration) of our tracer that look something like 
those given in the last illustration of Fig. 4. If enough 
particle and lens effects have occurred with an apparent 
randomness to our tracer cloud, then the cloud dis¬ 
perses in a manner similar to that illustrated. 

Incorporating the retardation coefficient for the 
chemical, discussed in the chapter on Transport in the 
Environment, the mass transport equation is then 
written as: 


dC dC (D + DJ &C (P + Dj ffC 

dt dx R dx 2 R dy 2 

(£> + £> z )d 2 C 
R dz 2 

(29) 


where R is a retardation coefficient, and D x , Dy, and D z 
are the dispersion coefficients in the x-, y-, and 
z-directions, respectively. 

The dispersion coefficients are dependent upon the 
character of the media and the flow velocity. It is 
difficult to predict these coefficients within an order 
of magnitude, so they are normally measured or fit to 
measured data in the field. For example, if a concen¬ 
tration variance in the x, y, and z directions can be 
measured in response to a pulse release, then an 
approximate solution would be: 


c _ MR 15 r -Rx* 2 -Ry 2 -Rz 2 ' 

- 8(t z 3 t 3 D x Dy D z ) 0 ' 5 exp 4 D x t 4 D y t 4 D z t 

(30) 

where x* is x — Ut/R. Comparing Eq. 30 to that for 
a three variable Gaussian distribution, we can see that: 


D x = 9 Ra x 2 / (2 1) 

(31a) 

D y = 9 R Gy 2 /(It) 

(31b) 

D z = 9 Ro 2 /{It) 

(31c) 


where a 2 , g 2 and o 2 z are the variance of concentration 
in the x, y, and z directions, respectively. They are given 
by the equations, 


J x 2 C dx f J x C dx\ 2 

J C dx \ J C dx J 

(32a) 

\ y 1 C dx 

J C dx 

(32b) 


9 f z 2 C dx 
x = J C dx 


(32c) 


The last term in Eq. 32a is the distance to the center 
of mass. In the diffusion equation, it is equal to Ut/R. 

Dispersion coefficients in groundwater flow. In a 
uniform media of particles, the longitudinal dispersion 
coefficient, D L , and the transverse dispersion coeffi¬ 
cient, Dp are both functions of the grain diameter and 
velocity. (In our previous example, D L was D x , and D t 
would indicate D y and D z ). The relevance of longitudi¬ 
nal and transverse dispersion relative to diffusion may 
therefore be very roughly characterized by a Peclet 
number, Pe: 


Pe = Ud/D (33) 

where d is the grain diameter and U is a bulk velocity, 
Q/A, where the cross section includes porous media. If 
the consideration is not of uniform media but is for 
a heterogeneous region of high and low groundwater 
flow permeability, then the appropriate length scale 
would be the size of these permeability regions, normal 
to the flow. The characterization is [6]; 

D l /D ~ Pe (34) 

Koch and Brady [7] have characterized transverse 
dispersion coefficient as a fraction of longitudinal dis¬ 
persion coefficient: 

D t /D L ~ 0.1 (35) 

Thus, longitudinal dispersion coefficient is roughly 
10 times the value of transverse dispersion coefficient 
in a uniform media. 

In the field, however, all media are heterogeneous, 
resulting in far greater dispersion than in uniform 
porous media. Because of the heterogeneities, the 
velocity profile can be highly variable over long dis¬ 
tances, creating a much greater dispersion. Gelhar et al. 
[8] provided a plot of field data that can be manipu¬ 
lated to result in an equation that applies between 
a scale of 1 and 100,000 m: 

Pe DL = l ^- = 2.5X10 1±L4 (36) 

where L is the horizontal scale of the measurement. The 
field data are given in Fig. 6, with the curves of Eq. 36. 
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Chemicals in the Environment, Dispersive Transport. Figure 6 

Field data on dispersion coefficients, taken from Gelhar et al. [8] with Eq. 36 added. Longitudinal dispersivity is D L /U 


Dispersion in Rivers 

Rivers are an excellent environmental flow for describ¬ 
ing the flow as a mean velocity with dispersion. The 
flow is confined in the transverse and vertical direc¬ 
tions, such that a cross-sectional mean velocity and 
concentration can be easily defined. In addition, there 
is less variation in rivers than there is, for example, in 
estuaries or reactors, both of which are also described 
by the plug flow with dispersion model. For that rea¬ 
son, the numerous tracer tests that have been made in 
rivers are useful to characterize longitudinal dispersion 
coefficient for use in untested river reaches. A sampling 
of the dispersion coefficients at various river reaches 
which were determined from tracer tests is given in 
Table 2. Also given are the relevant mean parameters 
for each reach. 

The question that we need to ask ourselves is 
whether the longitudinal dispersion can be predicted 
accurately for these rivers. Equation 16, which predicts 
that D L /(u*h) = constant, is shown in Table 2 to 


have a large range of constants, probably because of 
the variations in cross section and morphology seen in 
natural streams. Fischer [9] observed that this constant 
seemed to depend upon mean surface width, W, and 
substituted W for h in the numerator of Eq. 16 to 
develop the following empirical equation to character¬ 
ize longitudinal dispersion coefficient in rivers: 

U 2 W 2 

D L p = 0.011 -— 37 

u*h 

where D Lp is the predicted value of D L , h the mean 
depth, U the cross-sectional mean velocity, and u * the 
mean shear velocity of the river reach. When compared 
to the data given in Table 3 the root mean square error 
relative to the measurement, given by the equation, 

Relative = ^ /- (1- —(38) 

rms Errors W H ^ \ D L / 

where n, the number of measurements, is 1.71. This 
relative rms error means that roughly 67% of the 










Chemicals in the Environment, Dispersive Transport. Table 3 Measurements of longitudinal dispersion coefficient in laboratory flumes and rivers (after Fisher 
etal, 1979) 
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predictions are within a factor of 1.71 (within 71%) of 
the observed. 


Tracer Determination of Longitudinal Dispersion 
Coefficient in Rivers Tracers are generally used to 
determine longitudinal dispersion coefficient in rivers. 
Some distance is required, however, before the lateral 
turbulent diffusion is balanced by longitudinal convec¬ 
tion, similar to Taylor’s [1] analysis of dispersion in 
a laminar flow. This transport balancing distance, X* is 
given by the equation: 


X* 


0.2 UW 2 


(39) 


The region x < x* can be visualized as a mixing 
region, which can skew the results of a tracer study. 
Downstream of this region, where turbulent diffusion 
is balanced by longitudinal convection, the variance of 
a tracer pulse grows linearly with distance. It is best to 
begin the measurements a distance X* below the tracer 
release. Once dispersion coefficient at one or more river 
discharges have been measured, Eq. 37 can be used to 
adjust the dispersion coefficient to all discharges: 


P LQl = f U 2 W 2 \ ( u*h \ 

Duh V "*h ) Qi Wwi ) Q2 


(40) 


The technique used to perform the analysis of tracer 
studies will be the subject of Example 4. 


Example Applications 

Example 1: Determination of retardation coefficient . 

As part of a forensic investigation of a continuous 
Malathion spill, you need to determine the retardation 
coefficient of the soil at the site for Malathion. You have 
decided to do so in a column experiment with the soil, 
illustrated in Fig. 7. Also given in the figure are the 
results of a pulse test with the non-sorptive tracer, 
chloride, and the results of a pulse test with Malathion. 
What is the retardation coefficient, R? 

Chloride: 

t r = t r i = 20 min. 

o tx - 3 min. 

Malathion: 

t r = t r2 = 400 min. 

o t2 = 66 min. 



Chemicals in the Environment, Dispersive Transport. 
Figure 7 

Illustration of the column test for retardation coefficient 
and results of the tracer tests 


This test can result in a retardation coefficient from 
a comparison of both residence times and the variance. 

Residence times: The residence time of the chloride 
can be given as t r \ - L/U , and the residence time of the 
Malathion can be given as t r2 = R L/U. Thus, 

tr 2 400 min 

R = — =-= 20 

tr\ 20 mm 

Variance of tracer curves : Eq. 27 provides the 
relationships: 

a 2 = a 2 /t 2 =2D L Cou/(UL). 

Therefore, assuming that Cou = 1, 


o — 


2 DlR 
LR U" 


tr; = 


2 Dl 
UL 


trt — o; 


tri 


= R (T i 


or, 

R = o> 2 /cr fl = 66 min/3 min = 22 

Our two means of determining the retardation 
coefficient in the column gave R = 20 and R = 22. We 
can also check whether the organic carbon content of 
the soil fits what is generally known from the literature. 
First, there is the relation for R: 

R=l + f^K d 
8 


C 


( 41 ) 
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Second, we have the equation from Karikhoff 
et al. [10]: 


< cm 

k-d = 0.411-I fK m 


(42) 


Combining Eqs. 29 and 30 results in an equation for 
the organic fraction: 


/ = 


(R-l)s 


0.41 p B (g/cm 3 ) K 0] 


= 0.030 - 0.033 


(43) 


For Malathion, K ow = 230. From soil tests, 8 = 0.3, 
and pp = 2.0 g/cm 3 . Then Eq. 43 gives/= 0.030 — 0.033. 
This number is about right for the organic-rich soil 
that was tested. Thus, we know that our column tests 
are of the right order. 


Example 2: Atrazine spill into an irrigation well 
(three-dimensional dispersion with convection). 

Three kilograms of 1,000 g/m 3 Atrazine pesticide is 
accidentally dumped down an old farm irrigation 
well, placed to pump water out of porous sandstone, 
as illustrated in Fig. 8. Estimate the movement of the 
Atrazine plume over time and the concentrations in 
the plume. 

Given: 

U = 10-4 m/s d (sandstone) = 1 mm 

/= 10-4 (not soil) D = 10-1° m 2 /s 
8 = 0.3 p b /8 = 6 g/cm 3 

Our solution is [4]: 


C = 


M 

8e (ft f/-R) 3/2 y D* D y D z 

1 4 D x t 4D y t 


Rz 2 

AD y t 


(44) 
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Figure 8 

Illustration of well and Atrazine spill (From [5]) 


Now, estimate the maximum concentration loca¬ 
tion of this maximum and spread of the Atrazine cloud 
as a function of time. 

Dispersion coefficients: As a first guess, let us use 
the empirical relations provided in Eqs. 34 and 35. We 
will assume no heterogeneity in rock porosity and no 
lenses. 


L) x _ Dj 
~D~ ~D 



or 


D x = (10 -4 m/s)(10 _3 m) = 10“ 7 m 2 /s 


D 


— = 0.1 Pe or 
D 


D y ,D z = 0.1(10 7 m 2 /s) = 10 8 m 2 /s 


Retardation Coefficient: 

Lehman et al. [11] gives: 

log (K 0w ) = 2.75 for Atrazine 

■'■K 0w = 10 2 ' 75 = 562 

Using Karikhoff et al. [10] relationship: 

K d = /3fK 0w where /? = 0.41 cm 3 /g and /= 10 -4 
Then 


K d = 0.41 cm 3 /g (10“ 4 )(562) = 0.023 cm 3 /g 
and 

R = 1 + — K d = 1 +6 A- (0-023 cm 3 /g) = 1.14 
8 cm 5 

Even with a fairly sorptive organic compound, the 
retardation coefficient in rock is not much different 
from 1.0. Eq. 44 gives a maximum at y - 0, z = 0, 
x-U t/R. Thus, 


Ch¬ 


at 

88(7it/R) 3 ^ 2 ^D x D y D z 


and 


X max = -jy= °- 88 x 10 " 4 ( m / s ) f W 

We will indicate spread by 4er, which corresponds to 
95% of the total mass of the cloud, for a Gaussian 
distribution like this equation provides. 

For 1-D diffusion: er 2 = 2 Dt/R _ 

For 3-D diffusion: o 2 = | Dt/R o = ^ 2 ^ R 
and 

A x /2PjjR 


3 
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Table 4 Estimated concentration over time and space with 
transport through a uniformly porous sandstone 


Time 

^max 

(g/m 3 ) 

(m) 

4cr x 

(m) 

4cr y 

(m) 

1 hr 

4.0 x 10 s 

0.32 

0.04 

0.014 

1 day 

3.4 x 10 6 

7.6 

0.17 

0.054 

1 month 

2.1 x 10 4 

229 

0.90 

0.29 

1 year 

500 

2,800 

3.1 

0.98 

2 years 

180 

5,500 

4.4 

1.39 

10 years 

16 

27,000 

9.9 

3.13 


Chemicals in the Environment, Dispersive Transport. 
Table 5 Estimated concentration over time and space with 
transport through a sandstone media with 1 m 
heterogeneities 


Time 

^max 

(g/m 3 ) 

/(max 

(m) 

4 o- x 
(m) 


1 hr 

1.25 

0.32 

1.3 

0.4 

1 day 

.011 

7.6 

5.5 

1.7 

1 month 

6.6 x 10^ 5 

229 

29 

9.2 

1 year 

1.57 x 10“ 6 

2,800 

99 

31 

2 years 

5.6 x 10^ 7 

5,500 

140 

44 

10 years 

5.0 x 10^ 8 

27,000 

320 

99 


4 a z = 4 Gy = 


4^/2 Dy t/R 
3 


Note that in Table 4, the concentrations at 1 h, 
1 day, and 1 month are above the initial Atrazine 
concentration (1,000 g/m 3 ). This is one problem with 
Dirac delta boundary conditions because they initially 
have no volume, only mass. At greater elapsed time, 
however, the inaccuracies of the Dirac delta solution 
have a minimal impact on the resulting concentration. 

However, virtually all media have heterogeneities of 
high and low porosity regions, as well as lenses that 
form around cracks. If we assume that the spacing of 
these regions is a mean of 1 m, instead of the 1 mm 
grain size, then our estimated dispersion coefficients 
are increased by a factor of 10 3 . Applying the above 
equations to these parameters results in a reduction in 
Qnax by a factor of 3.2 x 10 4 and an increase in both 4a 
values by a factor of 32. The resulting estimates of the 
pertinent parameters are given in Table 5. These values 
are likely to be more realistic for transport through 
a groundwater aquifer. 

The unknown dispersion coefficient is not uncom¬ 
mon in groundwater transport problems. It is typically 
one of the parameters fitted to measurements in 
groundwater transport. 


Example 3: Drinking water pollution by trichloroeth¬ 
ylene (steady state groundwater transport with lateral 
dispersion). A military ammunition plant in Arden 
Hills, Minnesota, used trichloroethylene (TCE) as a 


metal cleaning solvent for many years. Trichloroethy¬ 
lene is currently believed to be a carcinogen. Unaware 
of the hazardous nature of TCE, plant personnel placed 
the waste grease and TCE in a trench to burn (the 
grease) or soak into the ground (the TCE) and disap¬ 
pear from sight for many years. What was not known, 
however, is that they were placing the TCE into an 
aquifer that surfaces near the armory. Four kilometers 
downstream, the City of New Brighton used this aqui¬ 
fer as a source of municipal water supply. What is the 
expected TCE concentration in the New Brighton water 
supply and what should be done in the adjacent cities? 
Is the TCE plume sufficiently captured by the New 
Brighton wells? The following conditions were approx¬ 
imated from available data: 

Supply of TCE = 100 kg/day 
U= 1.6 x 10 -5 m/s 
Aquifer thickness, H = 30 m 
R = 1.0 in the aquifer for TCE. 

Aquifer porosity, s = 0.3 
New Brighton extraction, Q = 0.25 m 3 /s 
Drinking water recommended limit for TCE = 
5 ng/L (5 x 10“ 3 g/m 3 ) 

As a first assumption, we will assume that the New 
Brighton well was located in the center of the plume, 
compute the capture zone, and then the concentrations 
within this capture zone. The capture zone is given by: 

Q , x 

Y = —— = 1740 m (45) 
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At a velocity of 1.6 x 10 -5 m/s, the 4 km distance 
would be covered in 8 years, which is short compared to 
the ~40 years of dumping TCE. We will therefore 
assume that the system is at steady state. We have 
these boundary conditions: 

At x = 0, M = lOOkg/day = 1.2 g/s 
At y - oo, C = 0 
At x = oc, C = 0. 


Mr = 2 sH 


Cdy 


Y/2 


Using the rough approximations of Eqs. 35 and 36, 
we get 


A. = 0.0026m 2 /s 


The solution with these boundary conditions is: anc j 


M (~ Uy2 \ 

Hs(4tzxUD,) 1/2 CXP V 4 D t x) 


with a capture zone mean of 



Cdy 


where Y is the width of the capture zone, and a leakage 
from the capture zone, M L of 


D t = 2.6X 10 -4 m 2 /s. 

Note that Eq. 36 gives D L = 0.0026 m 2 /s with 
a 67% confidence interval of between 0.064 and 
1 x 10 -4 m 2 /s, or 1.4 orders of magnitude. This vari¬ 
ation would need to be considered in any preliminary 
analysis of this problem. 

The solution for this application is given in Fig. 9 at 
various lateral distances from the peak concentration. 
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Chemicals in the Environment, Dispersive Transport. Figure 9 

Predicted concentration of trichloroethylene in groundwater plume versus distance for the ammunitions plant 
release - City of New Brighton case study 
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The capture zone mean is 0.42 g/m 3 , or almost 
100 times the recommended limit, which would raise 
concern in New Brighton. The leakage from the capture 
zone at x = 4,000 m for this scenario is computed to be 
47 g, or sufficient mass to result in a concentration of 
9 x 10 -4 g/m 3 for a similar capture zone. It is possible, 
then, that almost all of the plume was captured by the 
City of New Brighton, so the problem may not cover 
a wider area than the immediate downstream cities. 
This, at least, is one positive result of the low transverse 
dispersion of groundwater plumes. 

Example 4: Determination of D L in a river. A tanker 
car carrying a solvent derailed on a bridge and fell 
into the Nemadji river, Wisconsin. The forensic inves¬ 
tigation team has a computational model that will 
simulate the spill, if some coefficients are determined, 
including D L and t r . The most cost-effective means 
of determining these parameters would be to per¬ 
form a conservative tracer pulse test and adjust the 
parameters from discharge on the day of the tracer 
test (30 m 3 /s) to discharge on the day of the spill 
(70 m 3 /s) with some predictive equations that have 
been developed. 

The location of the pulse input is identified as x = 0. 
The measured tracer concentrations and other relevant 
data are given in Fig. 10. From this data, determine the 
D l and t r parameters on the day of the test for the reach 
from x = 8,000 m to x = 35,000 m. 

Because the variance of the tracer curve grows lin¬ 
early with distance after x = /, we can make the 
following statements: 

A t r — t r2 t r] 

and 

Ao 2 t = a 2 t \ 2 - a 2 ^ 

and finally, 

Ao 2 2Di 2Di A t r 

At] UAx Ax 2 

At x= 8,000 m, 

\ Ct dt Y] Ct At 
tri = J ~Cdt ~ £ CAt 


Using all of the At values set equal to 0.1 h, At can be 
cancelled out of the equation, and 


.30hg/nr 
0.67 g/m 3 


Also, 


J Ct 2 dt 
1 “ J Cdt 


2 ^ J2 Ct 2 _ 2 

rl “ £ CAt rl 


or, again cancelling the equal At’s: 


O'? Il = 


2.59 h 2 g/m 3 
0.666 g/m 3 


- (1.94 h) 2 = 0.125 h 2 


We have assumed that the location 8,000 m down¬ 
stream of the pulse injection will be out of the “mixing” 
region, as specified by Eq. 39. We are now ready to 
check this assumption with the parameters of the 
stream flow at x = 8,000 m. First, U = Q/A = 8 m 3 /s/ 
10.8 m 2 = 0.74 m/s, and from the chapter on Turbulent 
Transport: 


s y = (0.6 =b 0.3) u*h 


(46) 


where 


w* = yjghS = ^/(9.8m/s 2 )(0.6m) (10 3 ) — 0.077m/s 
Then, 


= (0.6 =b 0.3) (0.077m/s)(0.6m) 
= 0.028 ± 0.014 m 2 /s 


and Eq. 41 gives, 


0.2 (0.74m/s) (30 m) 2 

X, =- 2 --—A-t -= 4760 m 

0.028 m 2 /s 


With the approximations in Eq. 39, x = 8,000 m will 
be assumed sufficient to begin our determination of 
Dl, especially since that is one of the few access points 
(a bridge) into this reach of the river. 

We will now perform a similar calculation on the 
tracer cloud at x = 35,000 m. 


tr 2 = 11.54 hr 

2| 14 hr 2 g/m 3 

a *' 2 ~ 0.105 g/m 3 


(11.54 hr) 2 = 0.162 hr 2 


c 


Ax = 35, 000 m — 8, 000 m = 27,000 m 
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HWY 35 BRIDGE 

W = 30 m X = 8000 m M=500g A = 10.78 m 2 S = 10“ 3 h = 0.6m 



CNTY RD C BRIDGE 



Chemicals in the Environment, Dispersive Transport. Figure 10 

Tracer measurements taken at the Highway 35 and County Road C bridges 
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Now, 

_ A(7 2 Ax 2 _ (0.162 - 0.125) hr 2 (27,000 m) 2 
~ 2 A l 2 _ 2 (11.54 - 1.94) 3 hr 3 


only a rough approximation, however, which can be 
inaccurate at the trailing edge of the chemical cloud. 

Future Directions 


or, 

Dl = 15,200m 2 /hr = 4.2m 2 /s 

We can use Eq. 37 to adjust our dispersion coeffi¬ 
cient from the 8 m 3 /s with 0.6 m mean depth on the day 
of the tracer test to the 3 m 3 /s with 0.4 m mean depth 
that existed as the river discharge on the day of the spill: 

Dl = 0.011 (Q/Wh) 2 W 2 /(^/ghSh) (47) 

and then, assuming that the slope does not change and 
that the banks are fairly steep, such that dA-W dh , 
and assigning the subscripts t and 5 to indicate tracer 
and spill: 


As computational power of our computers continues 
to improve, less attention will be paid to dispersive 
transport in the environment. The merging of velocity 
profiles and mixing to make a dispersion coefficient 
will be of less value when a more exact solution is 
available on desktop computers. The simplicity of the 
analysis, however, still will make the dispersion equa¬ 
tions valuable in developing an understanding of mixing 
and transport problems in environmental settings. 
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m 2 /3m 3 /! 
= 4.2 — 1 ' 
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8m 3 /sy \0.4m 
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In addition, t r can be adjusted as well: 
At 


. Q t h s , X1 8m 3 /s 0.4m 

At, = (11-54 - 1.94) h- i- - 

Q s h t 3m 3 /s0.6m 


so 


A t r$ = 17 h 

The use of an empirical relation, such as Eq. 37, to 
adjust parameters for discharge is more accurate than 
simply using the equation itself, because the coefficient 
and other variables that can have substantial uncer¬ 
tainty are eliminated from the equation. 

Conclusions 

Dispersive transport is handled in a manner similar to 
turbulent diffusion and diffusion, with almost the same 
equations and solutions. The primary difference is that 
a dispersion coefficient is one or two orders of magni¬ 
tude greater than turbulent diffusion coefficients and 
about 10 orders of magnitude greater than diffusion 
coefficients. Assuming that the dispersion coefficient 
will result in a Gaussian distribution of a chemical is 


1. Taylor Gl (1953) Dispersion of soluble matter in solvent 
flowing slowly through a tube. Proc R Soc Lond Ser A 219:186 

2. Taylor Gl (1954) The dispersion of matter in turbulent flow 
through a pipe. Proc R Soc Lond Ser A 223:446 

3. Elder JW (1959) The dispersion of marked fluid in turbulent 
shear flow. J Fluid Mech 5:544 

4. Crank J (1975) The mathematics of diffusion, 2nd edn. Oxford 
University Press, Oxford 

5. Gulliver JS (2007) An introduction to chemical transport in the 
environment. Cambridge University Press, Cambridge 

6. Freeze RA, Cherry JA (1979) Groundwater. Prentice-Hall, 
Englewood Cliffs 

7. Koch DL, Brady JF (1985) Dispersion in fixed beds. J Fluid Mech 
154:399 

8. Gelhar L, Welty C, Rehfeldt KR (1992) A critical-review of 
data on field-scale dispersion in aquifers. Water Resour Res 
28(7):1955 

9. Fisher HB (1973) Longitudinal dispersion and turbulent mixing 
in open-channel flow. Ann Rev Fluid Mech 5:59 

10. Karikhoff SW, Brown DS, Scott TA (1979) Sorption of hydro- 
phobic pollutants on natural sediments. Water Res 13:241 

11. Lehman WJ, Reehl WF, Rosenblatt DH (1990) Handbook of 
chemical property estimation. American Chemical Society, 
Washington, DC 

Books and Reviews 

Fisher HB, List JE, Koh RCY, Imberger J, Brooks NH (1979) Mixing in 
inland and coastal waters. Academic, San Diego 

Kreyszig E (1982) Advanced engineering mathematics, 4th edn. 
Wiley, New York 

Levenspiel O (1962) Chemical reaction engineering. Wiley, 
New York 


C 









2048 


c 


Chemicals in the Environment, Turbulent Transport 


1 Chemicals in the Environment, 
Turbulent Transport 

John S. Gulliver 

Department of Civil Engineering, University of 
Minnesota, Minneapolis, MN, USA 

Article Outline 

Glossary 

Definition of Turbulent Transport in the Environment 
Introduction 

Mass Transport Equation with Turbulent Diffusion 
Coefficients 

Prandtl’s Mixing Length Hypothesis for Turbulent Flow 

Example Applications 

Conclusion 

Future Directions 

Bibliography 

Glossary 

Diffusion The spreading of fluid constituents through 
the motion inherent to atoms and molecules. 
Diffusion coefficient A coefficient that describes the 
tendency of molecules to spread a constituent mass 
Dirac delta An impulse of a given quantity (mass) that 
occurs over an infinitely short time or space. 
Kinematic viscosity The fluid viscosity divided 
by the fluid density, resulting in units that are 
similar to a diffusion coefficient, or length squared 
per time. 

Laminar flow Flow that has no turbulent eddies, 
where the fluid flows in laminas and diffusion cre¬ 
ates the mixing of the fluid. 

Prandtl’s mixing length The mean length that the 
turbulence in the flow will transport mass, momen¬ 
tum, or energy. 

Reynolds number The ratio of inertial to viscous 
forces, resulting in a meaningful velocity times a 
meaningful distance divided by kinematic viscosity. 
Turbulent diffusion The mixing of fluids through 
turbulent eddies created by convection. 

Turbulent diffusion coefficient A coefficient that 
comes from the multiplication of two turbulent 
velocities of the flow, divided by density of the 


fluid. The coefficient’s location in the mass trans¬ 
port equation is similar to diffusion coefficients, 
and the units are similar; so it is called a “turbulent 
diffusion coefficient.” 

Definition of Turbulent Transport in the 
Environment 

It is fairly safe to state that, except for flow through 
porous media, the environment experiences turbulent 
flow. To emphasize this point, the constriction of a 
water flow or airflow that would be required will be 
considered to have the other option, laminar flow. 

An experimentally based rule of thumb is that lam¬ 
inar flow typically occurs when the pipe Reynolds 
number, Vd/v , is less than roughly 2,000, or when an 
open-channel Reynolds number, Vh/v, is less than 
roughly 500, where Vis the cross-sectional mean veloc¬ 
ity, d is the pipe diameter, v is the kinematic viscosity of 
the fluid, and h is the channel depth. The diameter or 
depth that would not be exceeded to have laminar flow 
by these experimental criteria is given in Table 1 . 

Table 1 shows that with the boundary conditions 
present in most environmental flows, that is, the earth’s 
surface, ocean top and bottom, river or lake bottom, 
etc., turbulent flow would be the predominant condi¬ 
tion. One exception that is important for interfacial 
mass transfer would be very close to an interface, 
such as air-solid, solid-liquid, or air-water interfaces, 
where the distance from the interface is too small 
for turbulence to occur due to the high viscous 


Chemicals in the Environment, Turbulent Transport. 
Table 1 Maximum diameter or depth to have laminar flow, 
with the transition Reynolds number for a pipe at 2,000 


Water (»>= 10 6 m 2 /s) 

Air (v ~ 2 x 10 

" 5 m 2 /s) 

V (m/s) 

D (m) 

Mm) 

Mm) 

10 

2 x 1(T 4 

5 x 1(T 5 

0.004 

3 

7 x 10“ 4 

1.5 x 10“ 4 

0.014 

1 

0.002 

0.0005 

0.04 

0.3 

0.007 

0.0015 

0.14 

0.1 

0.02 

0.005 

0.4 

0.03 

0.07 

0.015 

1.4 

0.01 

0.2 

0.05 

4.0 
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dissipation. Because turbulence is an important source 
of mass transfer, the lack of turbulence very near the 
interface is also significant for mass transfer, where 
diffusion once again becomes the predominant trans¬ 
port mechanism. 

Introduction 

What is turbulent flow? The simple illustration of a 
free-surface flow given in Fig. 1 is used to describe 
the essential points of the turbulence phenomena. 
Turbulent open-channel flow can be described with 
a temporal mean velocity profile which reaches 
a steady value with turbulent eddies superimposed 
upon it. These turbulent eddies are continually moving 
about in three dimensions, only restricted by the 
boundaries of the flow, such that they are eliminated 
from the temporal mean velocity profile, u in Fig. 1. It 
is this temporal mean velocity profile that is normally 
sketched in turbulent flows. 

There will also be a temporal mean concentration. 
If there is a source or sink in the flow, or transport 
across the boundaries as in Fig. 1, then the temporal 


mean concentration profile will eventually reach 
a value such as that given in Fig. 1. This flux of com¬ 
pound seems to be from the bottom toward the top of 
the flow. Superimposed upon this temporal mean con¬ 
centration profile will be short-term variations in 
concentration caused by turbulent transport. The con¬ 
centration profile is “flatter” in the middle of the flow 
because the large turbulent eddies that transport mass 
quickly are not as constrained by the flow boundaries 
in this region. Now, if a concentration-velocity probe is 
placed into the flow at one location, the two traces of 
velocity and concentration versus time would look 
something like that shown in Fig. 2. 

Turbulent diffusion is thus not really diffusion, but 
the mixing of chemicals through turbulent eddies cre¬ 
ated by convection. Turbulent diffusion is thus a form 
of convection. Although it has the appearance of diffu¬ 
sion in the end, that is, random mixing similar to 
diffusion, the causes of diffusion and turbulent diffu¬ 
sion are very different. Since the end products are 
similar, diffusion coefficients and turbulent diffusion 
coefficients are often simply added together. 


c 



Chemicals in the Environment, Turbulent Transport. Figure 1 

Turbulent eddies superimposed on a temporal-mean velocity and temporal-mean concentration profiles (From 
Gulliver [1]) 




Chemicals in the Environment, Turbulent Transport. Figure 2 

Time traces of typical measurements of velocity and concentration in a turbulent flow 
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It is convenient to divide the velocity and con¬ 
centration traces into temporal mean values and fluc¬ 
tuating components: 


taking the temporal mean of each term (e.g., the entire 
equation). Then, the mean value of a fluctuating com¬ 
ponent will be equal to zero, or 


U = U + li 

1 

(1) 

L 

c = c + c' 

(2) 


where u is the temporal mean velocity at a point loca¬ 
tion, u' is the fluctuating component of velocity 
(variable over time), C is the temporal mean concen¬ 
tration at a point location, and C' is the fluctuating 
concentration component of concentration which is 
also variable over time. Formal definitions of u and C 
are as follows: 


At 



0 


and 


At 



0 


(4) 


DC _ d(C + C) 
dt dt 


dC dC' _ dC 
~di + ~dt “ ~dt + ° 


(6) 


Equation 6, the change of a temporal mean over 
time, may seem like a misnomer, but it will be left in to 
identify changes inC over a longer time period than At. 
Continuing, 


dC _ d(C + O) _ dC dC _ dC 
dx dx dx dx dx 


dC _ d{C + C') _ dC dC _dC 

dy dy dy + dy dy 

dC _ d(C + CQ _ dC dC _dC 

dz dz dz dz dz 

However, the temporal mean value of two fluctuat¬ 
ing components, multiplied by each other, will not 
necessarily be zero: 

VC ^ iiC' (10) 


where At is long compared to the time period of the 
oscillating components. 


Mass Transport Equation with Turbulent 
Diffusion Coefficients 


In this section the most common equations for dealing 
with mass transport in a turbulent flow will be derived. 
Beginning with the mass transport equation developed 
in the entry “ ► Transport in the Environment,” 


dC d(uC) d(vC) d(wC) 
dt dx "I” dy "I” dz 


d (^dC 


dx V dx 


d 


dy 


( dC\ d / 
{ D ay) + dz\ 1 


dC\ 


= ^\D—)+—iD—)+—lD—)+S 


dz J 


(5) 


the temporal mean of the entire equation will be taken 
and eventually one will end up with an equation that 
incorporates turbulent diffusion coefficients. 

In a turbulent flow field, Eq. 5 is difficult to apply 
because C, u , v, and w are all highly variable functions 
of time and space. Osborne Reynolds [2] reduced the 
complexities of applying Eq. 5 to a turbulent flow by 


This is similar to a least-square regression, where 
the mean error is zero, but the sum of square error is 
not. The x-component of our convective transport 
terms will be dealt with first: 


uC — (u T + d) — uC T uC f T u!C T u f C f 

(ii) 

Three of the four terms in Eq. 11 may be reduced to 


something known: 

uC = uC (12) 

UC = 0 (13) 

u i C = 0 (14) 

but, the fourth term will take some additional consid¬ 
eration, because it is not equal to zero: 

VC ^ 0 (15) 

By inference, the following can be written for all 

three convective transport terms: 

uC =uC + u’C’ (16) 
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vC=vC+v'C' (17) 

and 

wC=wC-\-w'C' (18) 

Finally, applying continuity (w + v + iv = 0) to 
Eq. 5 and taking the temporal mean results of Eqs. 6, 
7, 8, 9, 16, 17, and 18 


dC 

dt 


dC _dC 

l -h V - 

dx dy 


dC 

t — 

dz 


d 


d 


d 


- — u'C' - — v'C' - — w'C' + — 
ox dy dz ox 

d f dC\ d f dC\ „ 

+ TT- + -7T I D—~ +S 


dy\ dy J dz\ dz 



(19) 


Eq. 19 from left to right, we see an unsteady term, three 
mean convective terms, the three “unknown” terms, 
the diffusive terms and the source/sink rate terms. 
The “unknown” terms are the only possibility to 
describe turbulent diffusion. 

In the late nineteenth century, Boussinesq [3] prob¬ 
ably went through something similar to the thought 
process described above. The end result was the 
Boussinesq eddy diffusion coefficient 


c 


— u'C' = s x 

- 7 C = By 

— w'C' — s z 


dC 

dx 

dC 

dy 

dC 

dz 


(20a) 

(20b) 

(20c) 


where the turbulent convective transport term can be 
moved to the right-hand side, because the concentra¬ 
tion distribution that results from these terms looks 
similar to diffusion. 

With this temporal mean process, we have reduced 
the terms for which we will have difficulty defining 
boundary conditions in turbulent flow fields from 
seven in Eq. 5 to three in Eq. 19. We will now deal 
with these three terms. 

The diffusion equation is a useful and convenient 
equation to describe mixing in environmental flows, 
where the boundaries are often not easily defined. It 
also lends itself to analytical solutions and is fairly 
straightforward in numerical solutions. Although an 
alternative technique for solutions to mixing problems 
is the mixed cell method described in the entry 
“► Chemicals in the Environment, Dispersive Trans¬ 
port,” there are complications when applied to multiple 
dimensions and to flows that vary with space and time. 
Finally, we are comfortable with the diffusion equation, 
so we would prefer to use that to describe turbulent 
mixing if possible. 

Therefore, let us consider the following thought 
process: if the end result of turbulence, when visualized 
from sufficient distance, looks like diffusion with seem¬ 
ingly random fluctuations, then we should be able to 
identify the terms causing these fluctuations in Eq. 19. 
Once identified, they can be related to a “turbulent 
diffusion coefficient” that describes the diffusion 
caused by turbulent eddies. Looking over the terms in 


where s y , and s z are the turbulent (or eddy) diffusion 
coefficients, with units of m 2 /s similar to the (molecu¬ 
lar) diffusion coefficients. 

Then Eq. 19 with Eqs. 20a, 20b, and 20c becomes 


dC 

~dt 


dC _dC 


dx 


d 

dy 


in¬ 


dy 

N dC 


dC 
) - 

dz 

d 

+ ¥z 


d_ 

dx 


. dC 
{D + 8x) d^ 


. dC 
(D + £2 ) — 


f 5 

( 21 ) 


Turbulent diffusion is created by the flow field, 
which can vary with distance. Hence, turbulent diffu¬ 
sion coefficient cannot be assumed constant with dis¬ 
tance. Removing that assumption leaves turbulent 
diffusion coefficient inside of the brackets. 


Character of Turbulent Diffusion Coefficients 

A turbulent eddy can be visualized as a large number of 
differently sized rotating spheres or ellipsoids. Each 
sphere has sub-spheres, and so on until the smallest 
eddy size is reached. The smallest eddies are dissipated 
by viscosity, which explains why turbulence does not 
occur in narrow passages: there is simply no room for 
eddies that will not be dissipated by viscosity. 

The cause of the rotation is shear forces created by 
solid boundaries or variations in velocity lateral to the 
primary flow direction. A buoyant plume of smoke or 
steam, for example, will have a temporal mean velocity 
profile develop laterally to the plume, as the rising 
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plume mixes with the ambient air. Turbulent eddies are 
formed by this velocity gradient, and can be seen at the 
edge of the smoke or steam plume. The magnitude of 
turbulent diffusion coefficients is primarily dependent 
upon the scale of turbulent eddies and the speed of the 
eddy rotation. As illustrated in Fig. 3, a large eddy will 
have greater eddy diffusion coefficient than a small 
eddy because it will transport a compound (or solute) 
farther in one rotation. Likewise, a faster spinning eddy 
will have a larger eddy diffusion coefficient than one 
which is the same size but spinning more slowly 
because the solute simply gets there faster. These two 
facts provide meaning to the following observations: 

1. The largest scale of turbulence is roughly equal to the 
smallest overall scale of the flow field. This maybe seen 
in comparing the size of eddies at the edge of the 
smoke or steam plume to the width of the plume. 

2. The rotational eddy velocity is roughly propor¬ 
tional to the velocity gradient times the eddy scale. 

3. Eddy size decreases near boundaries to the flow 
field. Since the eddy size is zero at a solid boundary, 
and often close to zero at a fluid density interface 
(like an air-water interface), the turbulent eddy size 
has to decrease as one approaches the boundaries. 
In addition, since the flow cannot go through 
a boundary, the largest eddy size cannot be greater 
than the distance from the center of the eddy to the 
boundary. 

4. Turbulent diffusion occurs because turbulent 
eddies are transporting mass, momentum, and 


O Distance an eddy 
encompasses 
increases £ 

e large >e small 

O Speed at which 
it spins also 
increases £ 

£ fast > £ slow 

Chemicals in the Environment, Turbulent Transport. 
Figure 3 

Character of turbulent diffusion coefficients (From 
Gulliver [1]) 




energy over the eddy scale at the rotational velocity. 
This transport rate is generally orders of magnitude 
greater than the transport rate due to molecular 
motion. Thus, when a flow is turbulent, diffusion 
is normally ignored because s > > D. The exception 
is very near the flow boundaries, where the eddy 
size (and turbulent diffusion coefficient) decreases 
to zero. 

Thus, what influences the velocity and scale of 
eddies? For the most part, it is the velocity gradients 
and scale of the flow. Velocity gradients are the change 
in velocity over distance. If we have a high velocity, we 
typically have a large velocity gradient somewhere in 
the flow field. At solid walls, for example, the velocity 
must go to zero. Thus, the large velocity difference results 
in large velocity gradients, which results in faster spinning 
eddies and a larger turbulent diffusion coefficient. This 
process is illustrated in Fig. 4. 

The scale of the flow field is also important because 
the larger eddies perform most of the transport. The small 
eddies are always there in a turbulent flow, and their 
existence is important for local mixing. It is the large 
eddies, however, that are the most responsible for 
transport, as illustrated in Fig. 5. 

The four observations, listed above, were enough 
for Ludwig Prandtl [4] to hypothesize a simple 
model for describing turbulent transport that works 
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Figure 4 

Eddy formation at the edge of a jet issuing into a tank 
illustrates the importance of velocity gradients in eddy 
diffusion coefficient (From Gulliver [1]) 
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Chemicals in the Environment, Turbulent Transport. 
Figure 5 

Large and small eddies in an open-channel flow. The large 
eddies perform most of the top-to-bottom transport 
(From Gulliver [1]) 



z-€/2 
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Figure 6 

Illustration of the relationship between velocity profile, 
turbulent eddies, and mixing length 


c 


surprisingly well, considering the complexity of turbu¬ 
lent flow. 

Prandtl's Mixing Length Hypothesis 
for Turbulent Flow 

Prandtl’s mixing length hypothesis was developed for 
momentum transport, instead of mass transport. The 
end result was a turbulent viscosity, instead of a 
turbulent diffusivity. However, since both turbulent 
viscosity and turbulent diffusion coefficient are prop¬ 
erties of the flow field, they are related. Turbulent 
viscosity describes the transport of momentum by tur¬ 
bulence, and turbulent diffusivity describes the trans¬ 
port of mass by the same turbulence. Thus, 

£* = iUfx/p, s y = Vty/p, and e 2 = pjp 

where [i tx , ji ty , and [i tz are the turbulent viscosity in the 
x, y, and z directions. Now, for the x-component of 
momentum ( pu ), the Boussinesq approximation is 


— du 

- p u’u’ = p tx — 

(22) 

— du 
-p V ' U '=p ty - 

(23) 


(24) 


Let us consider the fully developed velocity profile 
in the middle of a wide-open channel, with x-, y-, and 
z-components in the longitudinal, lateral, and vertical 
directions, respectively. It is fully developed because 
du/dx is close to zero. The fact that it is a wide channel 


means that du/dy also is very small in the middle. 
From Eqs. 22 and 23, we can see that the turbulent 
transport of momentum in the x- and y-directions will 
be small because the gradients are small. Equation 24 
indicates that there will be a net turbulent transport of 
momentum in the z-direction. 

_ r)j2 

-wV = 8 z — ^0 (25) 

Now, half of the w' values will be positive, and the 
other half will be negative. We will use this criterion to 
divide them into two parts: 

w'u' = w'u' + + w'u' (26) 

where w'u' + has a value when w' is positive and is equal 
to zero when w' is negative. w'u'~ has a value when 
w' is negative and is equal to zero when w' is positive. 
Consider the cases when w' is positive. Then Eq. 26 
becomes 

w'u' = w'u' + + 0 (27) 

Let us assume that an eddy of length L is pulling 
a blob of fluid upward, as illustrated in Fig. 6. On 
average, the blob will have an x-component of velocity 
equal to u(z — L/ 2), where z is the location where u' is 
to be estimated. Thus, the eddy pulls up, on average, the 
u value that is at z - LI2. This will become the deviation 
from the temporal mean velocity at location z: 

u — u — utt u{z — L/2) — u(z) = - (m(z — L) — u(z)) 

( 28 ) 
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Equation 28 is a relation for a difference in velocity, 
which can be written as a velocity gradient times 
a distance: 


Prandtl also made another assumption in this 
region, that w'u' could be approximated by a con¬ 
stant equal to the mean wall shear stress, or 


du (L 


u = — 


dx\2 


Velocity 

= velocity 

X 

difference 

gradient 


Then, 

—J—T+ W' 

\u(z — L) — u(z)\ « 

w' du 

w'u' «— 
2 

2 dz 


(29) 

distance 


the development is similar for w'u' : 


: W'U' 


w' du 
2 dz 


Now combining Eqs. 26, 30, and 31 gives 


- du 

w'u' = —w'L— 
dz 


(30) 


(31) 


(32) 


Because turbulent eddies tend to be close to spher¬ 
ical in shape: 


\w\~\u\ 

and from Eq. 29: 

, duf 
w — L—— 
dz 


(33) 


(34) 


If we substitute Eq. 34 into Eq. 32, and then substi¬ 
tute the result into Eq. 24, we get 


-" = E 'S = £! iiS 


du^ 2 


(35) 


or 


s z = L 1 


du 

dz 


(36) 


Equation 36 is PrandtEs mixing length hypothesis , 
and it works well, considering that the basis for the 
equation is so empirical. However, Eq. 36 does present 
a challenge for us that mixing length, L, still needs to be 
specified. Measurements have shown us the following: 

1. Near a wall, L = kz , where k is von Karmans 
constant [5] and is very close to 0.4, and z is the 
distance from the closest wall. 


-w’u’ = x /p = ul (37) 

Then, eliminating w'u’ from Eqs. 35 and 37 
results in the well-known logarithmic velocity profile: 



(38) 


where u * is the shear velocity at the wall, t is the wall 
shear stress, and z 0 is an integration constant, often 
called the dynamic roughness. Table 2 provides some 
typical dynamic roughness lengths for atmospheric 
boundary layers. Applying Eq. 36 to 37 results in an 
equation for e z in this region: 

s z = ku*z (39) 

2. Very near a wall (approaching the laminar sublayer 
where the turbulence is so small that it is eliminated 
by the viscosity of the fluid), that is, for zu*/v < 35, 
L~/[6], 

Making the same assumption that u'w' is 
approximately equal to wall shear stress, this rela¬ 
tion for L results in the following relation for veloc¬ 
ity profile very near the wall: 

~ = P— (40) 

U^f. U^f. * z 


Chemicals in the Environment, Turbulent Transport. 
Table 2 Dynamic roughness lengths, z 0l for typical atmo¬ 
spheric surfaces (Turner [7]) 


Surface Type 

z 0 ( m ) 

Urban 

1.-3 

Forest 

1.3 

Deciduous forest in winter 

0.5 

Desert shrubland 

0.3 

Wetland 

0.3 

Cropland (summer) 

0.2 

Cropland (winter) 

0.01 

Grassland (summer) 

0.1 

Grassland (winter) 

0.001 

Water 

-0.0001 
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Equation 40 is not used in mass transport 
calculations near a wall or interface because the 
unsteady character of mass transport in this region 
is very important, and Eq. 39 is for a temporal mean 
velocity profile. 

3. Away from a wall, where the closest wall does not 
influence the velocity profile, L is a function of 
another variable of the flow field (Prandtl [8]). 
For example, consider the jet mixer given in 
Fig. 4. In this case, the mixing length, L, is a function 
of the width of the jet or plume. As the jet/plume 
grows larger, the value of L is larger. 

Here, it is easier to simply give the experimental 
relation for eddy diffusivity: 

g z — (3 u m2LX b (41) 

where b is the width of the mixing zone, [3 is a 
constant, and w max is the maximum velocity in the 
jet at the given location, x. 


m* = 0.01 Uio (8 + 0.65 Uio ) 1/2 (44) 

which indicates that as the waves get larger at high 
wind speeds the boundary roughness effect upon u * 
increases by the factor (8 + 0.65 Hio) 1 ^ 2 , where U\q is 
given in m/s. 

Then, 

s z ( m 2 /s) = 0.01 k U\oz(8 + 0.65 iqo )^ 2 = 0-32 z 

when z is given in meters. Now, the diffusion coefficient 
of water vapor in air is calculated to be D = 2.6 x 
10 -5 m 2 /s. Then, the elevation at which the diffusivity 
of water vapor would equal eddy diffusivity in this case 
would be, 

0.32z = 2.6 x 10- 5 

or, 

z = 8 x 10 5 m = 0.08 mm = 80 jum 


c 


Figure 7 gives some relationships for eddy diffu¬ 
sion coefficient profiles under different conditions that 
will be handy in applications of turbulent diffusive 
transport. 

Example Applications 

Example 5.1: Profile of eddy diffusion coefficient. 

Estimate the eddy diffusivity profile for a wind velocity 
of 18 m/s measured at 10 m over a large lake (Fig. 8), 
and calculate the elevation above the water surface 
where s z = D for water vapor. 

There is only one assumption needed: 

1. The wind fetch is sufficient so that U io is influenced 
by shear at the water surface (10 m is inside the 
boundary layer of the lake surface at this point). 

Then, mixing length theory may be used with 
momentum transport to derive: 


and, 

s z = k u*z (43) 

Now, Wu [9] has provided the following equation 
from a fit of field data: 


Thus at z = 80 pm elevation above the water surface, 
eddy diffusivity will be equal to the diffusivity of water. 
A similarly small elevation would result for almost any 
environmentally relevant compound. We can thus see 
that both s and D need to be considered simultaneously 
in Eq. 20 only very close to surfaces in turbulent flow, 
where e approaches the diffusion coefficient. Otherwise, 
diffusivities can be ignored in solving turbulent flow 
transport problems, since s +D is essentially equal to s. 

Example 5.2: Concentration profile of suspended sed¬ 
iment in a river (assuming e z is constant). We will 
apply Eq. 20 to solve for the concentration profile of 
suspended sediment in a river, with some simplifying 
assumptions. Suspended sediment is generally consid¬ 
ered similar to a solute, in that it is a scalar quantity in 
Eq. 20, except that it has a settling velocity. We will also 
change our notation, in that the bars over the temporal 
mean values will be dropped. This is a common proto¬ 
col in turbulent transport, and will be followed here for 
conformity. Thus, if an eddy diffusion coefficient,e, is 
in the transport equation, 

u means u 

v means v 
w means w and 
C means C 
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(m 3 /s) 


'^"5 




t> 




b(x) 




£ y = pu m b = constant 


c Jetlike flows (not close to a boundary) 



depth = h 

d Depth averaged transverse turbulent diffusion in a river 
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Profiles of eddy diffusion coefficient for various types of applications (From Gulliver [1]) 
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Figure 8 

Velocity profile over a large lake 


Then, Eq. 20 becomes 


dC d 
Vs dz dz 


dC 

( D + 8 2 ) — 


(45) 


where we have not yet applied assumption 4. We can 
move the settling velocity into the partial term: 


d(-VsC) 

dz 


d_ 

dz 


,dC 

CD + ez ) — 


(46) 


and since both sides of Eq. 46 are a gradient with 
respect to z, the terms inside of the gradients must 
also be equal: 


— v s C — (D + s z ) 


dC 

dz 


(47) 



C, 


Equation 47 is converted to an ordinary differential 
equation because all variables are only a function of z. 
Now, we will deal with assumption 4. Fig. 7 gives the 
equation developed by Rouse [10] for s z : 

s z = k u*z(l — z/h) (48) 

where u * is the shear velocity at the bottom of the 
channel, or 

W* = y/xT p (49) 


C 
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Figure 9 

Lateral and longitudinal cross sections of a typical river 


where t is the shear stress at the wall. For a fully devel¬ 
oped open-channel flow in a wide channel, the follow¬ 
ing relation is easily derived: 

w* = yfghS (50) 


throughout the remainder of this entry. Fig. 9 gives 
a longitudinal and lateral cross section of our river. 
We will make the following assumptions: 

1. The flow is steady over the long term, so that 
dC/dt= 0. 

2. The flow is fully developed, such that any gradient 
with respect to x is equal to zero (dC/dx =0). 

3. The river can be divided into a series of longitudinal 
planes with no significant interaction, such that v = 
0 and 8 y = 0 (this is the assumption of the stream- 
tube computational models). 

4. The vertical eddy diffusivity, 8 Z , is a constant value. 

Assumptions 3 and 4 are more difficult to justify. 
The solute will have a vertical velocity, w = — v s , 
where v s is the settling velocity of the suspended 
sediment. 


This derivation can be found in a text on fluid 
mechanics or open-channel flow. Assumption 4 states 
that 8 Z = e z for all values of z, where 8 Z is the depth 
average, or 

h h 

8 Z = - j 8 z dz — J z(l — z/h)dz = 0.067 u*h 

0 o 

(51) 

where h is the depth of the stream. The term 8 Z is almost 
always much greater than D in a turbulent flow. Thus, 

D + 8~ z = 8~ z 

Now, substituting these equations into Eq. 47 
results in 

_dC 

8 z - r + v s C = 0 
dz 
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We will solve this by separating variables, 

— ~—dz 
C s z 

integrating, and taking both sides of the solution to the 
power of e : 

C = p x e^ z 

Now, we need a boundary condition to determine 
Pi. This is difficult with suspended sediment profiles. 
We can develop a fairly good estimate of the distribu¬ 
tion of suspended sediment once we have a known 
concentration at some location in the flow field. In 
the sediment transport field bed load and suspended 
load are often discussed. The relation between the two, 
and some experience and measurements of both simul¬ 
taneously, can be used to predict an equivalent 
suspended sediment concentration at the bed. Then, 
the relevant boundary condition is 



C/Co 


Chemicals in the Environment, Turbulent Transport. 
Figure 10 

Suspended sediment concentration profile for Example 2 


1. At 

z = 0, C = Cq. 

where C 0 is the concentration that has been deter¬ 
mined from the bed load-suspended load relation¬ 
ship. Applying this boundary condition gives Pi = 
C 0 , and our solution is 

C = Co e~ v ^ 

The result is illustrated in Fig. 10. This problem can 
also be solved without assumption 4 [10]. 

Example 5.3: Concentration of organic compounds 
released into the air by an industrial plant (applica¬ 
tion of the product rule to error function solutions). 

There is some concern about the emissions from the 
adhesives produced in an industrial plant. Specifically, 
the town of Scream Hollow is 1 km away from the 
plant, where citizens have begun to complain about 
odors from the plant and of headaches. One culprit, 
aside from a haunting, may be the release of Acrolein, 
C 3 H 4 O, a priority pollutant that is an intermediary of 
many organic reactions. The average release from the 
200 m x 200 m x 10 m plant sketched in Fig. 11 is 
assumed to be 20 g/h. If the wind is blowing directly 
toward Scream Hollow, at 3 m/s measured at 3 m 
height, with a dynamic roughness of 0.2 m for the 
farmland, what concentrations will the Scream Hollow 
inhabitants experience? Is this above the EPA threshold 
limit of 0.1 ppm(v)? 

We will need to make some assumptions to formu¬ 
late this problem. They are: 

1 . The Acrolein release is distributed over the most 
downwind plane of the building. With the impor¬ 
tant concentration being 1 km away, this is not 
a bad assumption. Then, the Acrolein will be 


Town 
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Illustration of toxic chemical release into the atmosphere, with the wind blowing toward a town 
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released over the plane that is 200 m x 10 m. If 
20 g/h = 0.0056 g/s are released into a wind moving 
at 3 m/s, the initial concentration is 


C 0 = 


0.0056 g/s 


3 m/s(200 m)(10 m) 


= 9.3 x 10" 7 g/m 3 


2. We will use a cross-sectional mean velocity of 
U = u at 3 m height, or U= 3 m/s. 

3. We will use s z = s y = 8 Z at 3 m height. 

4. We will not consider any of the source or sink terms 
for Acrolein. 


up at y - — Ay and a step down at y - Ay. We will also 
use the product rule (Example 3, ► Transport in the 
Environment) to indicate that the solution to our 
governing equation for the y- direction should be mul¬ 
tiplied times the solution in the z-direction. Then the 
solution can be given as 


C 

Co 


Po + 


p 1 er f 


( (z + Az) \ 


+ Pi erf 


( (z~Az) \ 

[y^/uj 


X 


Pi erf 


( (r+Ay) ) 


+ p A erf 


( (y-Ay) ) 

\y^y/u) 


C 


We will also set up the coordinates so that (x,y,z) = 
(0,0,0) occurs on the ground at mid-plant width, and 
will orient the wind in the x-direction. 

With these assumptions, the governing equation 
becomes 


TT dC _ d 2 C _ d 2 C 

fa~ ey W +ez ^ r 

The boundary conditions are 


(52) 


1. At (x,y,z) = (0, — 100 m => 100 m, 0 => 10 m), C = C 0 . 

2. As x => oo, y => oo, or z => oo, C => 0. 

3. Zero mass flux at z = 0. 


These boundary conditions, illustrated in Fig. 12, 
will give us a concentration front, but in two dimen¬ 
sions. In addition, we have a zero flux condition that 
will require an image solution. We will use the solution 
of Example 5 in the entry “► Transport in the Environ¬ 
ment” to develop a solution for this problem. The 
solution, before applying boundary conditions, was 



Now, we need an image to the concentration front 
about the z = 0 plane. In the y-direction we have a step- 


z 
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Figure 12 

Illustration of the coordinate system for Example 5 


where Ay = 100 m and Az- 10 m. 

Now, to see if our boundary conditions can be 
satisfied with the form of the solution: 


@ x => oo, C => 0. Thus P 0 = 0. 

@ z => oo, C => 0. Thus - —P 2 - 

=> oo, C => 0. Thus P 3 = ~P 4 . 

@ x => 0, and (y, z) = (0,0), C/C 0 = 1. 

With the last boundary condition, Eq. E5.5.3 
becomes 

i = (Pi-PiM-P4) 

or, 

1 = 2 PiX 2P 3 
or, 

l = ip 2 x 2P 4 

Finally, @ x => 0 and (y,z) = (0,dz), C/C 0 = Vi. Thus 

Pi = V2- 

Applying this last boundary condition results in 
P 3 = Vi, p 2 = —V 2 , and p 4 = —Vi. Thus, the solution to 
Eq. 52 is 


C 

Co 


1 

2 

x 


erf 


/ (z + Az) \ 
\y/4e z x/Uj 


erf 


( (y + Ay) ) 

\y^z/u) 


- erf 


(z — Az) ^ 

j 


— erf 


( (7-AyA l 

{ya^/uj j 


(53) 


Now, if we use Az= 10 m , Ay - 100 m, U= 3 m/s, 
the only remaining parameter to find is s. Using Eq. 43 
given in Example 1: 

8 Z =8 y = KU*Z (43) 
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Note that the logarithmic boundary equation can 
be written as 



contains many features of the diffusive mass 
transport equation, and can be solved by similar 
techniques. 

4. PrandtEs mixing length is a relatively accurate sim¬ 
plification for many turbulent flows. 


where z 0 is the dynamic roughness, assumed to be 0.2 m 
for the crop land between the plant and Scream 
Hollow. Then, 


u 3 m/s 

£n(z/z 0 ) ~ £n(J 


1.1 m/s 


and 

s z - 0.4(l.lm/s)(3m) = 1.3 m 2 /s 


Future Directions 

The future for turbulent transport in the environment is 
in the direction of computational mass transport. This 
requires a simultaneous fluid dynamics-mass transport 
solution. On typical environmental scales, the compu¬ 
tational power of our computers still must be advanced 
to solve these large problems while resolving the scale of 
the smallest turbulent eddies. Direct numerical simula¬ 
tion cannot deal with the scale of these problems, and 
large eddy simulation cannot keep both the scale and 
grid refinement required. 


If we now plug all of the parameters for the industrial 
plant into Eq. 53, we get C = 0.25 pg/m 3 = 2.5 x 
10 -7 g/m 3 . In terms of ppm(v), we will use p air = 
1.2 g/m 3 , and the molecular weights of air and Acrolein 
of 29 and 56 g/mole, respectively. Then, 

c _ 2.5 x 10 ~ 7 g/m 3 MWair 

Pair MWqh 4 0 

2.5 x 10 -7 g/m 3 20 g/mole 

1.2 g/m 3 56 g/mole 

7 moles C 3 H 4 O 

= 1.08 x 10- 7 - , . 

mole air 

This is right at the threshold for continuous expo¬ 
sure, and the pollution from the plant should be inves¬ 
tigated in more detail. 

Conclusions 

1. Although turbulent diffusion is a convection trans¬ 
port, and not a diffusive transport, the result looks 
similar to diffusion, and can be described by 
a turbulent diffusion coefficient. 

2 . Most environmental flows are turbulent. The 
exceptions are flow through porous media and 
flows that are very close to an interface. 

3. Reynolds averaging and the Boussinesq assumption 
result in a turbulent transport equation that 
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Glossary 

C0 2 Carbon dioxide is the predominant greenhouse 
gas. Anthropogenic C0 2 emissions are created 
largely by the combustion of fossil fuels. 

GCM General circulation model, a term commonly 
used to describe climate models maintained at 
large research centers. 

GHG GHG is a greenhouse gas such as C0 2 and meth¬ 
ane. These gases contribute to global warming by 
capturing some of the outgoing infrared radiation 
before it leaves the atmosphere. 

IPCC The Intergovernmental Panel on Climate 
Change was formed in 1988 by the World Meteo¬ 
rological Organization and the United Nations 
Environmental Program. It reports research on cli¬ 
mate change. Their assessments are closely watched 
because of the requirement for unanimous 
approval by all participating delegates. 

Sustainability The most widely quoted definition of 
sustainability and sustainable development is that 
by the Brundtland Commission of the United 
Nations in 1987: “sustainable development is devel¬ 
opment that meets the needs of the present without 
compromising the ability of future generations to 
meet their own needs.” 


Definition of the Subject 

Climate change is one of many dynamic processes 
impacting global water sustainability. Other processes 
(e.g., change in population size and location, economic 
development and land use, aging infrastructure, and 
changing social values) also have major influences on 
water sustainability and must be considered along with 
climate change in a holistic approach to global water 
resources management [ 1 ]. 

Climate change has the potential to affect many 
public and private sectors in which water managers 
play an active role, including water availability, water 
quality, flood risk reduction, ecosystems, coastal areas, 
navigation, other key industrial sectors such as power 
and energy, food and beverage. These changes may 
have adverse or positive impacts on one or more sec¬ 
tors. Any or all of these changes could occur gradually 
or abruptly [ 1 ]. 

Increases in temperature can affect the amount and 
duration of snow cover, which in turn, can affect timing 
of streamflow. Glaciers are expected to continue 
retreating, and many small glaciers may disappear 
entirely. Peak streamflow may move from late spring 
to early spring/late winter in those areas where snow- 
pack is important in determining water availability. 
Changes in streamflow have important implications 
for water and flood management, irrigation, and plan¬ 
ning. If supplies are reduced, off-stream users of water 
such as irrigated agriculture and in-stream users such 
as hydropower, fisheries, recreation, and navigation, 
could be most directly affected [2, 3]. 

Besides more well-known impact of climate change 
on water quantity and water supply, climate change will 
affect ambient water quality. In its 2009 National Water 
Program Strategy: Response to Climate Change , the US 
Environmental Protection Agency (USEPA) notes that 
it expects more waterbodies will be labeled “impaired” 
even if pollution levels remain stable, because warmer 
air temperatures will result in warmer water, which 
will hold less dissolved oxygen and therefore be more 
likely to foster harmful algal blooms and change the 
toxicity of some pollutants. Likewise, projected 
increases in intense rainfall and extreme water events 
would wash more nutrients, pathogens, and toxins into 
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waterbodies. Such changes may significantly deterio¬ 
rate aquatic ecosystem health in various areas over the 
next 50 years [4]. 

The objective of this entry is to link climate 
change and water sustainability with a global view, 
from both public/government and private/business 
perspectives. In addition, this entry summarizes 
future research needs for global water sustainability 
in the context of climate change based on literature 
review. 

Introduction 

Definition of Global Water Sustainability 

Sustainability has become a wide-ranging term that can 
be applied to almost every facet of life on Earth, from 
local to a global scale and over various time periods. 
There are certain conceptual foundations that have 
been developed to aid in understanding sustainability. 
The importance of the 1987 Brundtland Commission 
definition is recognized “sustainable development is 
development that meets the needs of the present without 
compromising the ability of future generations to meet 
their own needs,” which relies on maintaining equity 
between generations to help define terms. Beyond this, 
however, systems analytic concepts are embraced to 
sharpen the definition. For example, the Brundtland 
definition assumes that future generations will need 
the same mix and share of resources that are enjoyed 
at present. This may not be so if technology changes 
to require less of X and more of Y than is used at 
present [5, 6]. 

Sustainability has been defined as a property of 
a system that can be maintained in a certain state 
indefinitely. As used with regard to the environment, 
it refers to the potential longevity of human cultures 
that are supported by critical external factors, such as 
agriculture, forestry, and fisheries, all of which have 
a water component. History includes many examples 
of earlier societies that have eventually died out, 
sometimes as a result of their failure to successfully 
relate to their environment. In modern terms, a case of 
use of resources by increasing demand can be seen, 
and a fixed supply provided by a single world. The 
challenge is to determine how to create long-term 
sustainability [5, 6]. 


Daly [7] suggests the following three operational 
rules defining the condition of sustainability: 

- Renewable resources such as fish, soil, and ground 
water must be used no faster than the rate at which 
they regenerate. 

- Nonrenewable resources such as minerals and fossil 
fuels must be used no faster than renewable sub¬ 
stitutes for them can be put into place. 

- Pollution and wastes must be emitted no faster than 
natural systems can absorb them, recycle them, or 
render them harmless. 

The short way to describe sustainability requires 
that human activity, at a minimum, only uses nature’s 
resources at a rate at which they can be replenished 
naturally. 

Definition of Climate Change 

Climate is the average weather in a given place, usually 
over a period of more than 30 years. While the weather 
can change in just a few hours, climate changes occur 
over longer timeframes. Climate is defined not only by 
average temperature and precipitation, but also by the 
type, frequency, and intensity of weather events such as 
heat waves, cold waves, storms, floods, and droughts. 
Climate has natural year-to-year variations, and 
extremes in temperatures and weather events have 
occurred throughout history. 

Climate change refers to any significant change 
in measures of climate (such as temperature, precipi¬ 
tation, or wind) lasting for an extended period 
(decades or longer). Climate change might result 
from natural factors and processes or from human 
activities [8]. 

The term “climate change” is often used inter¬ 
changeably with the term global warming. Global 
warming refers to an average increase in the tempera¬ 
ture of the atmosphere near the Earth’s surface, which 
can contribute to changes in global climate patterns. 
However, rising temperatures are just one aspect of 
climate change. The phrase “climate change” is grow¬ 
ing in preferred use to “global warming” because it 
helps convey that there are changes in addition to rising 
temperatures [9]. 

The Earth’s climate depends on the balance between 
the amount of energy received from the sun and the 
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amount of energy that is absorbed or radiated back into 
space. Natural influences can alter how much heat is 
reflected or absorbed by the Earth’s surface, including 
changes in the sun’s intensity, volcanic eruptions, and 
multiyear climate cycles. Human activities such as 
deforestation and the production of greenhouse gases 
also affect this balance. These alterations, in turn, affect 
climate on local, regional, and global scales [8]. 

Key Characteristics of Water Sustainability Solutions 

Water sustainability solutions are characterized by cer¬ 
tain systems analysis properties that may indicate the 
overall stability of the system. It is important for sus¬ 
tainability that solutions to problems should be viable 
for long durations; that the system should not be 
required to undergo extreme change in short periods 
of time to reach the desired solution; that the solution 
sought is often a compromise of available options, and 
not some extreme case; and, is not overly complex. 
Even with these precautions, the most successful solu¬ 
tions often possess the additional property of revers¬ 
ibility. Only actual experience can tell if the right track 
is being followed, and if not, changes should be made. 
Whether the geographic scale is a local water problem 
or a national system, these rules tend to produce sus¬ 
tainable solutions [6, 10]. 

Water sustainability solutions are characterized by 
certain systems analysis properties that may indicate 
the overall stability of the system. It is important for 
sustainability 

- That solutions to problems should be viable for 
long durations 

- That the system should not be required to undergo 
extreme change in short periods of time to reach the 
desired solution 

- That the solution sought is often a compromise of 
available options and not for some extreme case 

- And is not overly complex 

One can imagine far more concepts than can be 
captured as indicators. Some are quantitative, but 
others qualitative and not readily measured. Therefore, 
some limited number of indicators has to be chosen. 
Too many will overwhelm the policy-making process 
and may paralyze decision-making. Too few will fail to 


describe issues in sufficiently comprehensive manner. 
There is no single set of criteria for what makes a good 
indicator. However, perhaps a simple checklist will 
make it possible to at least begin this difficult selection 
process [6]. 

Solutions to water sustainability problems are char¬ 
acterized by properties that may indicate the overall 
stability of the system. Selecting the right indicator is a 
complicated process. Some indicators are quantitative, 
but others are qualitative and not readily measured. 
Choosing the number of indicators to track is 
a delicate balance; too many will overwhelm the pol¬ 
icy-making process, while too few will fail to describe 
issues sufficiently [6]. Table 1 is based on the US 
General Accounting Office (GAO) Report “Environ¬ 
mental Indicators: Better Coordination Is Needed to 
Develop Environmental Indicator Sets That Inform 
Decisions” [11]. 

Climate Change Indicators by USEPA 

In 2010, USEPA has published a report entitled “ Cli¬ 
mate Change Indicators in the United States” to help 
readers interpret a set of important indicators to better 
understand climate change. The report presents 24 
indicators, each describing trends in some way related 
to the causes and effects of climate change. The indica¬ 
tors focus primarily on the United States, but in some 
cases global trends are presented in order to provide 
context or a basis for comparison. The indicators span 
a range of time periods, depending on data availability. 

For each indicator, this report presents one or more 
graphics showing trends over time; a list of key points; 
and text that describes how the indicator relates to 


Climate Change and Global Water Sustainability. 
Table 1 Ten criteria used to select indicators most 
frequently cited by GAO Survey Respondents [11] 


Measurable 

Data quality 

Relevant 

Importance 

Appropriate geographic 

Appropriate temporal 

scale 

scale 

Understandable 

Data comparability 

Data available 

Trend data available 
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climate change, how the indicator was developed, and 
any factor that might contribute to uncertainty in the 
trend or the supporting data. 

USEPA selected the 24 indicators presented in this 
report from a broader set of 110 indicators, many of 
which were identified at an expert workshop in 2004 on 
climate change indicators convened by the National 
Academy of Sciences and funded by USEPA. The indi¬ 
cators in this report were chosen using a set of screen¬ 
ing criteria that considered usefulness, objectivity, data 
quality, transparency, ability to show a meaningful 
trend, and relevance to climate change. 

All of the indicators selected for this report are 
based on data that have been collected and compiled 
by following rigorous protocols that are widely 
accepted by the scientific community. Various govern¬ 
ment agencies, academic institutions, and other orga¬ 
nizations collected the data. 

Global Water Sustainability and Climate Change 

Outlook of Global Water Sustainability in Year 2050 

A recent report [12] entitled “Vision 2050” by World 
Business Council for Sustainable Development 
(WBCSD) addresses three questions: What would 
a sustainable world look like in 2050? What are the 
pathways and solutions for achieving sustainability? 
What are the roles business can play in ensuring more 
rapid progress toward that world? 

As “Vision 2050” has perceived, it should be strived 
to build a world in which around 9 billion people are 
not just living on the planet, but living well and within 
the limits of the planet in 2050 [12]. By “living well,” 
WBCSD is describing a standard of living where 
people have access to and the ability to afford educa¬ 
tion, health care, mobility, the basics of food, water, 
energy and shelter, and consumer goods. By “living 
within the limits of the planet,” WBCSD means living 
in such a way that this standard of living can be 
sustained with the available natural resources and 
without further harm to biodiversity, climate, and 
other ecosystems. 

Vision 2050 seeks to provide a common under¬ 
standing so leaders can make the decisions that deliver 
the best outcomes possible for human development 
over the next 4 decades. The guiding principle is an 
attempt to help leaders across governments, businesses, 


and civil society avoid repeating mistakes of the past 
such as making decisions in isolation that result in 
unintended consequences for people, the environment, 
and planet Earth. It is also a platform for ongoing 
dialogue, so the important questions that need to be 
answered can be raised continuously in order to make 
progress in this uncharted territory. 

Although distinct, the elements also show the inter¬ 
connectedness of issues such as water, food, and 
energy - relationships that must be considered in an 
integrated and holistic way, with trade-offs that must 
be understood and addressed. 

The top three critical pathways listed in “Vision 
2050” include: 

- Addressing the development needs of billions of 
people, enabling education and economic empow¬ 
erment, particularly of women, and developing rad¬ 
ically more eco-efficient solutions, lifestyles, and 
behavior 

- Incorporating the cost of externalities, starting with 
carbon, ecosystem services, and water 

- Doubling of agricultural output without increasing 
the amount of land or water used 

The “Vision 2050” or similar documents provide an 
outlook on how a sustainable world would like in 2050 
and a forward-looking perspective on strategic path¬ 
ways toward global water sustainability. The following 
sections discuss the interlinkage between global water 
sustainability and climate change from both public/ 
government and private/business perspectives. 

Global Water Sustainability and Climate Change 
from Public Perspective 

The most recent report by the Intergovernmental Panel 
on Climate Change states that global warming will lead 
to “ changes in all components of the freshwater system ,” 
and concludes that “ water and its availability and qual¬ 
ity will be the main pressures on, and issues for, societies 
and the environment under climate change” 

The future effects of climate change on water 
resources in the United States and other parts of the 
world will depend on trends in both climatic and non- 
climatic factors. Evaluating these impacts is challenging 
because water availability, quality, and streamflow are 
sensitive to changes in temperature and precipitation. 
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Other important factors include increased demand for 
water caused by population growth, changes in the 
economy, development of new technologies, changes 
in watershed characteristics, and water management 
decisions. 

All regions of the world show an overall net negative 
impact of climate change on water resources and fresh¬ 
water ecosystems. Areas in which runoff is projected to 
decline are likely to face a reduction in the value of the 
services provided by water resources. The beneficial 
impacts of increased annual runoff in other areas are 
likely to be tempered in some areas by negative effects of 
increased precipitation variability and seasonal runoff 
shifts on water supply, water quality, and flood risks [2] . 

In addition to the typical impacts on water man¬ 
agement, climate change introduces an additional ele¬ 
ment of uncertainty about future water resource 
management. Water resources in the United States are 
heavily managed and supplies are scarce in some 
regions of the country. Strategies have been developed 
and continue to evolve to address these issues. Imple¬ 
mentation of adaptation measures, such as water con¬ 
servation, use of markets to allocate water, and the 
application of appropriate management practices will 
have an important role to play in determining the 
impacts of climate change on water resources [13, 14]. 


seasonal timing of precipitation). Generally speaking, 
in areas where precipitation increases sufficiently, net 
water supplies may not be affected or they may even 
increase. In other areas where precipitation remains the 
same or decreases, net water supplies would decrease. 
Where water supplies decrease, there is also likely to be 
an increase in demand, which could be particularly 
significant for agriculture, which is the largest con¬ 
sumer of water, and also for municipal, industrial, 
and other uses. 

Increases in temperature can affect the amount and 
duration of snow cover, which in turn, can affect timing 
of streamflow. Glaciers are expected to continue 
retreating, and many small glaciers may disappear 
entirely. Peak streamflow may move from late spring 
to early spring/late winter in those areas where snow- 
pack is important in determining water availability. 
Changes in streamflow have important implications 
for water and flood management, irrigation, and plan¬ 
ning. If supplies are reduced, off-stream users of water 
such as irrigated agriculture and in-stream users (such 
as hydropower, fisheries, recreation, and navigation) 
could be most directly affected [2]. 

In summary, the cause-effect impact analysis rep¬ 
resents four major chains of causation that may be 
expected to result from global warming [15]. 


c 


Water Availability Generally, increases in the Earth’s 
surface temperature will increase evaporation from the 
oceans and land, leading to more overall precipitation. 
However, this additional precipitation will not be dis¬ 
tributed evenly, and shifting storm patterns will likely 
cause some areas to experience more severe droughts. 
Scientists have suggested that extreme weather events 
such as storms, floods, and hurricanes will likely also 
become more intense. There is natural variability in the 
intensity and frequency of such events, however, so care 
must be taken to determine whether observed trends 
reflect long-term changes in the Earth’s climate system. 

Changes in temperature, precipitation patterns, 
and snowmelt can have impacts on water availability. 
Temperature is predicted to rise in most areas, but is 
generally expected to increase more in inland areas and 
at higher latitudes. Higher temperatures will increase 
loss of water through evaporation. The net impact on 
water supplies will depend on changes in precipitation 
(including changes in the total amount, form, and 


- First, as temperatures rise, it is expected that sea 
levels will rise due to warmer ocean temperatures 
and melting of land ice such as glaciers. 

- Next, warmer overall temperatures are expected to 
produce two important changes in seasonal condi¬ 
tions over most of the continental United States. 
Warmer and shorter winters are expected. 

- Third, warmer and drier summers are expected. 

- Lastly, warming is expected to accelerate and amplify 
the functioning of the hydrologic cycle to produce, 
among other things, more intense rainfall events. 

Water Quality Higher water temperatures and 
changes in the timing, intensity, and duration of precip¬ 
itation can affect water quality. Higher temperatures 
reduce dissolved oxygen levels, which can have an effect 
on aquatic life. Where streamflow and lake levels fall, 
there will be less dilution of pollutants. However, 
increased frequency and intensity of rainfall will produce 
more pollution and sedimentation due to runoff [2]. 
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Flood magnitudes and frequencies will very likely 
increase in most regions, mainly a result of increased 
precipitation intensity and variability. Increasing tem¬ 
peratures are expected to intensify the climate’s hydro - 
logic cycle and melt snowpacks more rapidly [2]. 
Flooding can affect water quality, as large volumes of 
water can transport contaminants into water bodies 
and also overload storm and wastewater systems. 

Higher temperatures, particularly in the summer, 
earlier snowmelt, and potential decreases in summer 
precipitation could increase risk of drought. The fre¬ 
quency and intensity of floods and droughts could 
increase, even in the same areas. 

Sea level rise may also affect freshwater quality by 
increasing the salinity of coastal rivers and bays and 
causing saltwater intrusion, movement of saline water 
into fresh ground water resources in coastal regions. 

Changes in water quality could have implications 
for all types of uses. For example, higher temperatures 
and changes in water supply and quality could affect 
recreational use of lakes and rivers or productivity of 
freshwater fisheries. Certain species of fish could find 
temperatures too warm and migrate to more northern 
or higher altitude locations where water is cooler. 

Case Studies: Lower Low Flows Expected from Climate 
Change and Its Impact on Water Quality Climate 
change is expected to make low flows more common 
in some areas, resulting in less contaminant-dilution 
capacity and higher pollutant concentrations. In many 
arid or semiarid areas (places with overall decreased 
runoff), water quality deterioration will be even more 
pronounced. 

Cruise et al. [16] assessed current and future water 
quality conditions in the US Southeast. Results showed 
that in many southeastern areas, streamflow was 
projected to decline over the next 30-50 years, exacer¬ 
bating water quality problems. Between 2020 and 2039, 
water quality conditions might worsen because of drier 
late springs, summers, and falls. 

Changes in low flows could affect Total Maximum 
Daily Load (TMDL) implementation and water quality 
compliance. Eheart et al. [17] showed that the assumed 
climate change scenarios could substantially affect low- 
flow standards in the US Midwest. For example, a 25% 
decrease in mean precipitation cut critical low flows by 
63% (without irrigation) to 100% (with irrigation). 


The frequency of single low-flow violations increased 
several-fold with irrigation. Likewise, the frequency of 
multiple low-flow violations in a 3-year period rose 
from about 20% to nearly 100% as climate change 
became more severe. The paper also noted that existing 
TMDLs for rivers and streams may become obsolete 
due to worsening low-flow events in the future. 

The extra pressures that climate change may place 
on waterbodies could increase treatment costs; how¬ 
ever, the magnitude of such costs greatly depends on 
the response elsewhere in the watershed. A 2003 study 
by USEPA’s Office of Research and Development [18] 
examined how climate change could affect the cost of 
water quality compliance and TMDL implementation. 
It focused on 235 publicly owned treatment works 
(POTWs) discharging to impaired streams or rivers 
due to organic enrichment and low dissolved oxygen 
in the Great Lakes region. These POTWs will probably 
be required to meet TMDLs more stringent than cur¬ 
rent discharge levels, and the effect of climate change 
on streamflow could result in even more stringent 
discharge limitations. This screening study suggests 
that even a small reduction in the design flow would 
have significant cost implications, that is, holding all 
other factors constant, climate change may make it 
significantly more costly to meet water quality goals 
in the Great Lakes region. 

A related study assessed the potential costs of 
reducing TMDL-driven effluent limits for biochemical 
oxygen demand (BOD) to maintain ambient water 
quality due to climate change for the water industry 
in England and Wales under the European Union’s 
Water Framework Directive [19]. Under the climate 
change impact, the low flow within receiving waters 
collecting water treatment facility effluent is expected 
to further decline. This study found that costs were 
highly sensitive to increases in stringency of effluent 
limits and the changes in design flow due to climate 
change factors added significantly to the costs of 
TMDL implementation and overall water quality 
management. 

Water Resources Planning Under 

Uncertainty Traditionally, water resource managers 
have accumulated significant experience successfully 
planning and operating their systems in the face of 
uncertainty regarding future hydrology, weather, 
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available water supply, and projected water demand. In 
recent years, however, it has become clear that today’s 
planning approaches may no longer be adequate. 
Dynamic population and economic growth have 
placed increasing pressure on water supplies and 
concerns about environmental impacts have grown [ 1 ]. 

In addition, the potential for climate change has 
confronted water managers with an additional novel 
challenge. While in any particular service area the pre¬ 
cise impacts of climate change for temperature, pre¬ 
cipitation, evaporation, storm water capture, and 
imported supplies remain deeply uncertain, it has 
become increasingly clear that past weather and hydro- 
logical records no longer provide an accurate guide to 
future weather and hydrology. 

The linkage between the rate of temperature rise 
and the timing and location of these other more com¬ 
plex effects on Earth systems is where the science is 
most challenged and the uncertainties are greatest [ 1 ]. 
Many effects may be expected to advance gradually as 
a function of gradual temperature rise, and yet the 
same phenomena may be subject to stochastic influ¬ 
ences that could produce extreme events much sooner 
than implied by the gradual trend. 

Climate change presents a much greater degree of 
uncertainty than is typically encountered in facilities 
and operations planning. How do you prepare for 
changes of unknown magnitude on an unknown time¬ 
line? There is a great temptation to wait and see if 
climate science can be improved to provide a better 
basis for action. But, it would be a mistake to just 
ignore climate change and resume a business as usual 
approach to planning. 

The uncertainty in climate forecasts is magnified 
many times by the complexities of tracing climate 
effects through the subsequent hydrologic and environ¬ 
mental processes that produce the changed operating 
conditions that wastewater and storm water agencies 
will have to face [15]. 

The familiar risk management paradigm consists of 
three steps: (1) risk identification, (2) risk assessment/ 
characterization, and (3) risk management. The appli¬ 
cation to climate change adaptation planning is quite 
unique. It is necessary to first take the problem apart 
and examine it piece by piece to perform a thorough 
risk identification analysis. The possible impacts of 
increasing temperatures are far reaching when all the 


secondary effects on hydrologic and environmental 
processes are taken into account. 

Near-Term and Long-Term Climate Risks for Water 
Infrastructure Climate variations can directly or 
indirectly affect many aspects of human society - in 
both positive and disruptive ways. For example, 
warmer temperatures might reduce heating costs and 
improve conditions for growing some crops; yet, 
extreme heat can cause illness or death among 
vulnerable populations. Precipitation can replenish 
water supplies and nourish crops, but intense storms 
can damage property, cause loss of life and population 
displacement, and temporarily disrupt essential 
services such as transportation, telecommunications, 
and energy and water supplies. 

Climate change certainly involves some near-term 
threats to existing assets that will call for protective 
measures to sustain them. However, the current infra¬ 
structure configuration (including water infrastruc¬ 
ture) may not be the most sustainable choice for 
the long term. Future generations may need a differ¬ 
ent type of infrastructure that is designed from the start 
to be more sustainable in a changing environment, 
involving nontraditional types of investments such as 
decentralized treatment, green infrastructure, water¬ 
shed buffers, and wetlands [15]. 

The prospect of climate change sometimes evokes 
a misperception that the sky is falling and leads people 
to skip right over the risk characterization step and 
begin evaluating adaptation options as though every¬ 
thing is happening at once. It is prudent to first under¬ 
take a vulnerability analysis to assess how soon the 
impacts may materialize at a level that represents 
a meaningful threat to existing or planned facilities 
and operations in order to define the most appropriate 
adaptive responses. In some instances, adaptation may 
become a multigenerational strategy where the respon¬ 
sibility of current water managers is limited to laying 
the right planning foundation to enable selection and 
implementation of the best adaptations by future gen¬ 
erations [15]. 

Climate Change Indicators Relating to Global Water 
Sustainability Environmental indicators are a key 
tool for evaluating existing and future programs and 
supporting new decisions with sound science. In the 
years to come, the indicators in this report will provide 


c 




2068 


c 


Climate Change and Global Water Sustainability 


data to help the federal agencies decide how best to use 
its policy making and program management resources 
to respond to climate change. Ultimately, these 
indicators will help government agencies evaluate the 
success of their climate change efforts. 

USEPA selected the 24 indicators presented in this 
report from a broader set of 110 indicators, many of 
which were identified at an expert workshop in 2004 on 
climate change indicators convened by the National 
Academy of Sciences and funded by USEPA. The indi¬ 
cators in this report were chosen using a set of screen¬ 
ing criteria that considered usefulness, objectivity, data 
quality, transparency, ability to show a meaningful 
trend, and relevance to climate change [8]. 

The climate change indicators are divided into five 
categories: 

- Greenhouse Gases : The indicators characterize the 
amount of greenhouse gases emitted into the 
atmosphere through human activities, the con¬ 
centrations of these gases in the atmosphere, and 
how emissions and concentrations have changed 
over time. 

- Weather and Climate : This entry focuses on indica¬ 
tors related to weather and climate patterns, includ¬ 
ing temperature, precipitation, storms, droughts, 
and heat waves. These indicators can reveal long¬ 
term changes in the Earth’s climate system. 

- Oceans : The world’s oceans have a two-way rela¬ 
tionship with climate. The oceans influence climate 
on regional and global scales, while changes in 
climate can fundamentally alter certain properties 
of the ocean. The indicators in this category exam¬ 
ine trends in ocean characteristics that relate to 
climate change, such as acidity, temperature, heat 
storage, and sea level. 

- Snow and Ice : Climate change can dramatically alter 
the Earth’s snow- and ice-covered areas. These 
changes, in turn, can affect air temperatures, sea 
levels, ocean currents, and storm patterns. The indi¬ 
cators in this category focus on trends in glaciers; 
the extent and depth of snow cover; and the freezing 
and thawing of oceans and lakes. 

- Society and Ecosystems: Changes in the Earth’s cli¬ 
mate can affect public health, agriculture, energy 
production and use, land use and development, and 
recreation. Climate change can also disrupt the 


functioning of ecosystems and increase the risk of 

harm or even extinction for some species. 

Among the major categories of climate change indi¬ 
cators mentioned above, oceans and snow and ice are 
closely related to water sustainability, besides weather 
and climate. Summary finding and more detailed dis¬ 
cussions are given as follows based on USEPA [8]. 

The oceans and the atmosphere interact constantly, 
both physically and chemically, exchanging heat, water, 
gases, and particles. This relationship influences the 
Earth’s climate on regional and global scales. It also 
affects the state of the oceans. 

Covering nearly 70% of the Earth’s surface, the 
oceans store vast amounts of energy absorbed from 
the sun and move this energy around the globe through 
currents. As greenhouse gases trap more energy from 
the sun, the oceans will absorb more heat, resulting in 
an increase in sea surface temperatures, rising sea levels, 
and possible changes to ocean currents. These changes 
will very likely lead to alterations in climate patterns 
around the world. For example, warmer waters pro¬ 
mote the development of more intense storms in the 
tropics, which can cause property damage or loss of life. 

The oceans are also a key component of the Earth’s 
carbon cycle. Over geological time, much of the world’s 
carbon has come to reside in the oceans, either within 
plants and animals or dissolved as carbon dioxide. 
Although the oceans can help lessen climate change 
by storing a significant fraction of the carbon dioxide 
that human activities emit into the atmosphere, 
increasing levels of dissolved carbon dioxide can 
change the chemistry of seawater and harm certain 
organisms. These effects, in turn, could substantially 
alter the biodiversity and productivity of ocean 
ecosystems. 

Changes in ocean systems generally occur over 
much longer time periods than in the atmosphere, 
where storms can form and dissipate in a single day. 
The interactions between ocean and atmosphere occur 
slowly, over many years, even decades. For this reason, 
even if greenhouse gas emissions are stabilized tomor¬ 
row, it will take many more years, decades or centuries, 
for the oceans to adjust to the climate changes that have 
already occurred. 

Even if greenhouse gas emissions are stabilized 
tomorrow, it will take many more years - decades or 
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centuries - for the oceans to adjust to the climate 
changes that have already occurred. 

Example climate change indicators related to 
oceans include: 

- Sea Surface Temperature: The surface temperature 
of the world’s oceans increased over the twentieth 
century. Even with some year-to-year variation, the 
overall increase is statistically significant, and sea 
surface temperatures have been higher during the 
past 3 decades than at any other time since large- 
scale measurement began in the late 1800s. 

- Sea Level Rise: When averaged over all the world’s 
oceans, sea level has increased at a rate of roughly 
six-tenths of an inch per decade since 1870. The rate 
of increase has accelerated in recent years to more 
than an inch per decade. Changes in sea level rela¬ 
tive to the height of the land vary widely because the 
land itself moves. Along the US coastline, sea level 
has risen the most relative to the land along the 
Mid-Atlantic coast and parts of the Gulf Coast. 
Sea level has decreased relative to the land in parts 
of Alaska and the Northwest. 

- Ocean Acidity: The ocean has become more acidic 
over the past 20 years, and studies suggest that the 
ocean is substantially more acidic now than it was 
a few centuries ago. Ocean acidification already is 
degrading seawater quality and adversely affecting 
marine ecosystems. A 2004 report on ocean acidifi¬ 
cation by the United Kingdom’s Royal Society esti¬ 
mates that pH will average 7.65 by 2100. Rising 
acidity is associated with increased levels of carbon 
dioxide dissolved in the water. Changes in acidity 
can affect sensitive organisms such as corals. 

- In 2007, environmental group in the United States 
filed petitions asking eight coastal states to address 
the effects of climate change on ocean water quality 
(such as ocean acidification), list certain segments 
of the Atlantic and Pacific Oceans as impaired for 
pH, and establish TMDLs for carbon dioxide. This 
is a creative approach to reducing air emissions of 
carbon dioxide. USEPA formally responded in early 
2009, setting out a public process to evaluate ocean 
acidification’s effects on water quality and deter¬ 
mine whether the current pH criterion should be 
modified. If USEPA strengthens the pH criterion for 
oceans, then states will have to follow suit because 


the Clean Water Act requires state standards to be at 
least as protective as USEPA ones. Moreover, states 
are required to designate waterbodies that do not 
meet water quality standards as “impaired” and 
take action to limit their pollution. Stronger water 
quality standards for pH could translate into mea¬ 
sures that regulate pollution from carbon dioxide, 
the primary cause of ocean acidification. 

Furthermore, climate change can dramatically alter 
the Earth’s snow- and ice-covered areas. Unlike other 
substances found on the Earth, snow and ice exist 
relatively close to their melting point and can change 
from solid to liquid and back again. As a result, 
prolonged warming or cooling trends can result in 
observable changes across the landscape as snow and 
ice masses shrink or grow over time. 

Changes in snow and ice cover, in turn, affect air 
temperatures, sea levels, ocean currents, and storm 
patterns. For example, melting polar ice caps add 
fresh water to the ocean, increasing sea level and pos¬ 
sibly changing currents that are driven by differences in 
temperature and salinity. Because of their light color, 
snow and ice reflect more sunlight than open water or 
bare ground, so a reduction in snow cover and ice 
causes the Earth’s surface to absorb more energy from 
the sun. 

Changes in snow and ice could not only affect 
communities and natural systems in northern and 
Polar Regions, but also have worldwide implications. 
For example, thawing of frozen ground and reduced sea 
ice in the Arctic could affect biodiversity on local and 
global scales, leading to harmful effects not only on 
polar bears and seals, but also on migratory species 
that breed or feed in these areas. These same changes 
could affect human societies in several ways, such as by 
compromising food availability. Reduced snow cover 
could diminish the beneficial insulating effects of snow 
for vegetation and wildlife, while also affecting water 
supplies, transportation, cultural practices, travel, and 
recreation for millions of people. 

Example climate change indicators related to snow 
and ice include: 

- Glaciers: Glaciers in the United States and around 
the world have generally shrunk since the 1960s, 
and the rate at which glaciers are melting appears 
to have accelerated over the last decade. Overall, 
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glaciers worldwide have lost more than 2,000 cubic 
miles of water since 1960, which has contributed to 
the observed rise in sea level. 

- Snow Cover : The portion of North America covered 
by snow has generally decreased since 1972, 
although there has been much year-to-year variabil¬ 
ity. Snow covered an average of 3.18 million square 
miles of North America during the years 2000- 
2008, compared with 3.43 million square miles dur¬ 
ing the 1970s. 

- Snowpack : Between 1950 and 2000, the depth of 
snow on the ground in early spring decreased at 
most measurement sites in the western United 
States and Canada. Spring snowpack declined by 
more than 75% in some areas, but increased in a 
few others. 

Global Water Sustainability and Climate Change 
from Business Perspective 

Business Risks of Water and Climate Change 

Despite a growing consensus among climate experts 
that freshwater is one of the resources most vulnerable 
to long-term climate change, there is little awareness 
and discussion about the potential consequences for 
businesses and their shareowners. For all of its impor¬ 
tance, to sustain the fast-growing global population 
and to ensure future prosperity, few companies and 


investors are thinking strategically about the profound 
business risks that will exist in a world where climate 
change is likely to exacerbate already diminishing water 
supplies [20]. Climate change is likely to exacerbate 
water availability and quality, which will have a wide 
range of implications for business (Table 2). 

It should be noted that climate change will likely 
exacerbate all three types of risks: physical, reputa¬ 
tional, and regulatory [20]. 

Physical Risks Water scarcity directly affects business 
operations, raw material supply, intermediate supply 
chain, and product use in a variety of ways. Declines or 
disruptions in water supply can undermine industrial 
and manufacturing operations where water is needed 
for production, irrigation, material processing, 
cooling, washing, and cleaning. Drought attributable 
in significant part to climate change is already causing 
acute water shortages in large parts of Australia, Asia, 
Africa, and the United States. 

Water quality risks are often overlooked but may 
have significant financial implications. The quality of 
process water is critical in many industrial produc¬ 
tion systems, and contaminated water supply may 
require additional investment and operational costs 
for pretreatment. 

Water scarcity, changes in precipitation patterns, 
and glacier melt caused by climate change directly 


Climate Change and Global Water Sustainability. Table 2 Business impacts from climate change and water sustain¬ 
ability (Adapted from [20, 21]) 



Climate change impact on 
water sustainability 

Business impacts could include 

1 

Increasing water demand 

- Higher costs for water 

- Regulatory caps for water use 

- Conflicts with local communities and other large-scale water users 

- Growing demand for water-efficient products and technologies 

2 

Water scarcity and unsustainable 
supply 

- Impacts on future growth and license to operate 

- Decreased amount of water available for business activities 

- Operational disruptions and associated financial loss 

- Increased costs for water 

3 

Declining water quality 

- Increased costs for pretreatment to obtain desired water quality 

- Increased costs for wastewater treatment to meet more stringent regulations 

- Regulatory restrictions for specific industrial activities and investments 

- Increased responsibility (and costs) to implement community water 
infrastructure and watershed restoration projects to mitigate reputational risks 

- Increased health costs for employees in the countries that are impacted 
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affect power generation, curtailing hydro-based power 
production, and also impacting any power plants that 
run steam turbines. Whether powered by coal, natural 
gas, or nuclear energy, electricity generation relies on 
having an adequate supply of cooling water. Businesses 
that depend on highly reliable energy from these power 
sources will be at risk. 

Physical water resource constraints make compa¬ 
nies more susceptible to reputational risks. Declines in 
water availability and quality can increase competition 
for clean water. In water-scarce regions, tensions can 
arise between businesses and local communities, 
particularly in developing countries where local 
populations often lack access to safe and reliable drink¬ 
ing water. Community opposition to industrial water 
withdrawals and perceived or real inequities in use can 
emerge quickly and affect businesses profoundly. 

Reputational Risks Reputational risks increase as 
people become more aware of their rights to access 
water. The concept of “access to clean water as a 
human right” is gaining more recognition globally. 
However, international and local businesses may find 
themselves using copious amounts of water in regions 
where people lack sufficient water to meet basic needs. 

Growing awareness around the ecological impacts 
of water withdrawal and discharge increases both rep¬ 
utational and regulatory risks. Healthy freshwater eco¬ 
systems are an essential part of local communities and 
livelihoods, not only by serving as a source of clean 
drinking water, but also by providing cultural, social, 
aesthetic, and economic value. As a result, significant 
water withdrawal or wastewater discharge, regardless of 
the extent of actual impacts on the neighboring com¬ 
munities or ecosystems, inevitably increases the risk of 
conflict with local communities. Moreover, the vulner¬ 
ability of ecosystems is expected to increase due to 
climate change. 

Regulatory Risks Physical and reputational pressures 
affecting water availability and wastewater discharge 
can result in more stringent water policies. Water 
scarcity, coupled with increased concern among local 
communities about water withdrawals, will put pressure 
on local authorities and policymakers to consider water 
reallocations, regulations, and development of water 
markets that cap usage, suspend permits to draw water, 
and lead to stricter water quality standards. 


Water scarcity will increase water prices. Among 
other factors, water scarcity is driving shifts toward 
full-cost pricing aimed at providing economic incen¬ 
tives for efficient water use. In many places, artificially 
low water prices are rising as subsidies are phased out. 

Utilities in industrial countries are increasingly 
implementing tiered rate system, where water users 
pay more for increased consumption. These tariff 
structures are specifically designed to encourage com¬ 
mercial and industrial users to use water efficiently. 

Water-intensive products and services face increased 
sociopolitical risks. As water scarcity becomes a serious 
problem in many parts of the world, there may be 
corollary pressure, both regulatory and reputational, 
on products that require a significant quantity of 
water. Products and services that require large 
amounts of water or energy to produce or to use may 
be phased out by law, lose market share to less water¬ 
intensive products, or cause reputational damage for 
the company. 

Taken together, this means that businesses will face 
vastly increased uncertainty about the availability and 
quality of their water supplies. One of the strongest 
conclusions in the latest IPCC report is that “ climate 
change will challenge the traditional assumption that 
past hydrological experience provides a good guide to 
future conditions .” Therefore, it will become increas¬ 
ingly crucial for businesses to incorporate climate 
change factors when assessing and managing their 
water risks. 

Example of Climate Change Impact on Industrial 
Sectors For most industry sectors, the largest portion 
of their water footprint is embedded in the production 
of raw materials such as food crops, fibers, and metals. 
Many companies’ raw material production lies far 
upstream from direct operations. As a result, they typ¬ 
ically fail to assess water-related risks in this segment of 
their value chains, according to Ceres and Pacific Insti¬ 
tute (2009). Severe drought, flooding, or changes in 
precipitation patterns due to climate change can 
decrease agricultural yields and quality, which may 
increase input costs. In addition, water scarcity and 
increased competition for freshwater resources can 
change pricing structures or subsidies for irrigation 
water for crop or livestock production, which can also 
drive up costs. 
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Beverage Sector Potable water is the primary and 
most important ingredient for the majority of 
beverage products, making beverage companies’ direct 
operations especially vulnerable to water availability 
and quality concerns. Beverage manufacturing requires 
high-quality source water, putting the water use of this 
industry in direct competition with local populations 
and their drinking water needs. Large water withdrawals 
can result in controversies in regions where water is 
scarce and local populations lack access to affordable 
drinking water. 

In cases of severe shortages or contamination of 
drinking sources, bottling and manufacturing facilities 
risk shutdowns, as PepsiCo and Coca-Cola bottlers expe¬ 
rienced in Kerala, India. Public concern about clean 
water access is impacting bottled water sales. Consumer 
demand is soaring in developing countries, but falling 
in large developed countries such as the United States. 

Food Sector The industry’s most significant water- 
related exposure is in raw material production (i.e., 
precipitation and irrigation needed to grow food and 
maintain pastureland for grazing). Many of the world’s 
croplands are in semiarid areas that are expected to 
become drier due to climate change. 

Among the regions at risk is the High Plains aquifer 
that provides water for 27% of the irrigated land in the 
United States and 70-90% of the irrigation water for 
three of the country’s top producing grain states - 
Texas, Kansas, and Nebraska. Human-induced stresses 
on the aquifer have already resulted in water table 
declines greater than 100 ft in some areas. 

Increased temperature and dry weather due to cli¬ 
mate change will raise water requirements for livestock 
whose numbers are growing as global demand for meat 
increases. As water problems become more severe, the 
impacts of agriculture generally will draw even more 
attention. 

Higher water temperature due to climate change 
may increase the concentration and variety of water¬ 
borne pathogens. As a result, food supplies may face 
greater risk of contamination and recall, disrupting the 
supply of ingredients for food manufacturers and nega¬ 
tively affecting consumer confidence in food products. 

Electric Power/Energy Sector The electric power 
industry requires a consistent supply of water. 
Thermoelectric-power withdrawals account for 48% of 


total water use, 39% of total freshwater withdrawals for 
all categories in the United States [22]. Furthermore, 
heated discharges from power plants have a harmful 
effect on water quality and local ecosystems, which is 
only exacerbated as water levels drop. 

Hydropower plants are at increased risk due to 
decreases or changes in water flow caused by climate 
change. Hydropower generation is likely to be most 
directly affected by climate change because it is sensi¬ 
tive to the amount, timing, and geographical pattern of 
precipitation and temperature. For example, in parts of 
the United States such as Alaska and the Rocky Moun¬ 
tains, natural water storage in snowpack and glaciers 
has been reduced. Climate change may also increase 
evaporation rates of reservoirs in arid parts of the 
United States such as Lake Mead and Lake Powell on 
the lower Colorado River [23]. 

The sector has significant water-related risks 
embedded in raw material supply. Extraction and 
processing of fossil fuels require large volumes of 
water. Water scarcity or a change in water supply due 
to climate change may disrupt fuel supply or increase 
the cost of fuel for power generation. 

What Businesses Can Do to Manage Water-Climate 
Risks Strategically The impacts of companies’ water 
use vary greatly depending on local hydrological, social, 
economic, and political factors. Unlike greenhouse gas 
emissions, which have the same impacts regardless of 
where the emissions are generated, one unit of water is 
not equal to another. The same amount of water with¬ 
drawn in an arid urban area versus a rural wet region 
has completely different impacts and associated risks 
for companies. The absolute volume of water used is 
often less important than the timing of its use. Because 
of water’s regional and timing-specific character, com¬ 
panies need to convert water footprint data into actual 
water impacts and risks. 

As mentioned earlier, three main types of water- 
related risks include: 

1. Physical risks: Local hydrological conditions (poten¬ 
tial shortage risks, water quality risks, flooding risks, 
and possible impacts of climate change on future 
water supply and demand) 

2. Regulatory risks: Socioeconomic conditions as they 
relate to water (trends in regional demand, local 
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water governance capacity, and regional water pric¬ 
ing) and potential regulatory costs 

3. Reputational risks: Potential impacts of a 
company’s water withdrawal or wastewater dis¬ 
charge on local communities and ecosystems, and 
disparities or inequities in local and regional 
water access 

To evaluate and effectively address water and climate 

change risks, business and private sectors can take the 

following actions, based on a synthesis of [20, 21, 24]. 

1. Measure the company’s water footprint (i.e., water 
use and wastewater discharge) throughout its entire 
value chain, including suppliers and product use. 
Companies can only mange what they measure, so 
in order to accurately assess risks and opportuni¬ 
ties, a first step for companies is to conduct a 
comprehensive and integrated water and carbon 
accounting. By aligning measurement of water and 
carbon/energy, businesses can identify how the three 
are interlinked, providing key basic information for 
developing a holistic management strategy. 

2 . Assess physical, regulatory, and reputational water 
risks associated with climate change. Explicit atten¬ 
tion should be paid to understanding any potential 
competing demands the company may have for 
water and energy. If possible, companies should 
also seek to align their water and climate risk assess¬ 
ments. Regardless, having a detailed understanding 
of local water conditions, including hydrological, 
social, economic, and political factors, can give 
companies room to anticipate and plan for a wide 
range of scenarios. 

For key areas of operation and sourcing in high- 
risk regions, contingency plans should be developed 
to respond to risks such as decreasing water avail¬ 
ability and quality, higher water prices, extreme 
hydrologic events, and local economic develop¬ 
ment. Potential climate-related impacts should be 
explicitly considered in contingency plans. Contin¬ 
gency plans should also include demand-side and 
supply-side strategies. For example, conservation 
and efficiency improvement measures or process 
modifications that reduce or eliminate water use 
will decrease the company’s dependency on fresh¬ 
water supplies. Supply-side measures include col¬ 
laboration with local authorities and stakeholders 


to improve local water security and participation in 
integrated water management or ecosystem resto¬ 
ration to protect local water sources. 

3. Integrate water and climate issues into strategic 
business planning and governance structures. 
When developing water management plans, com¬ 
panies will need to consider and integrate the 
potential impacts of climate change on water sup¬ 
plies and water quality. Climate-related impacts on 
water should also be considered when making 
a range of strategic business decisions from factory 
design and siting to new product development. 
Companies should also evaluate the energy impli¬ 
cations of water management plans and strategies 
and seek combined and integrated solutions to 
water and energy. 

4. Engage key stakeholders as a part of water and 
climate risk assessment, long-term planning, and 
implementation activities. When developing a cor¬ 
porate water and climate change management plan, 
managers can benefit from sharing information with 
employees, investors, customers, local communities, 
and other key stakeholders in order to gain valuable 
feedback. Through early and continuous engage¬ 
ment with concerned stakeholders, companies can 
better understand, anticipate, and respond to emerg¬ 
ing issues and expectations. Open dialogue with 
water providers and local communities may also be 
helpful in preventing and reducing the risk of future 
water and climate change-related disputes or disrup¬ 
tions. Such discussions may also identify pivotal 
inputs that help prioritize action steps. 

5. Disclose and communicate water performance and 
associated risks. Companies should publicly report 
management activities and key metrics on their 
water and energy performance. This information 
can help shareholders and stakeholders assess how 
companies are addressing their water and climate 
change risks. Such metrics are also a useful tool for 
engaging employees across the enterprise. 

6 . Seek opportunities for collective action, including 
public and private partnership. Because water 
is a shared resource, its management can be 
a sensitive social, cultural, and environmental 
issue, particularly in times of drought and water 
restrictions. Companies can rarely achieve the best 
water management outcomes on their own, as most 
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solutions to water supply, quality, and sanitation 
issues require comanagement approaches involving 
collective action and partnerships. Such approaches 
can give companies competitive advantages 
through alignment of their corporate water strate¬ 
gies with public policy goals and multi-stakeholder 
initiatives. These activities can include collabora¬ 
tions with local communities, NGOs, and govern¬ 
ment agencies, as well as peer-to-peer collaborations 
with other businesses. 

The United Nations, in concert with national gov¬ 
ernments, has established a set of goals, known as the 
Millennium Development Goals. One of these goals is 
to reduce by half the number of people worldwide 
without access to clean water or adequate sanitation 
by 2015. There is a growing recognition that businesses 
are well positioned to play a role in achieving these 
goals. The public, in turn, has growing expectations 
that the private sector, often perceived as complicit in 
global water threats, should do its part regionally and 
internationally to address these challenges. Especially 
in regions under high water stress, or where substantial 
populations lack safe and affordable water, there is 
increased pressure for companies to work with local 
stakeholders, including water agencies, community 
groups, and other industry water users to share and 
manage limited resources more equitably and 
efficiently. 

In addition, The CEO Water Mandate established in 
2007 is one of the most comprehensive and visible 
cross-sectoral, public-private partnerships on water. 
It represents both a call-to-action and a strategic frame¬ 
work for responsible water management by business. It 
is voluntary in nature, but is built around six core areas 
of responsibility with which its endorsers must commit 
to and demonstrate improvement: direct operations, 
supply chain and watershed management, collective 
action, public policy, community engagement, and 
transparency. 

Future Directions 

Climate change has the potential to have adverse or 
positive impacts on many sectors in which water 
resource managers play an active role, including water 
availability, water quality, floods, ecosystems, coastal 
areas, power and energy, and other industrial sectors [ 1 ]. 


Data Needs for Climate Change Projection 

Water resources planning involves making assump¬ 
tions about future plausible hydroclimatic conditions 
(e.g., temperature, precipitation, and river flows). The 
choice of information supporting these assumptions is 
affected by a changing climate. While science is not 
capable of predicting the exact magnitude of the 
changes, there are methods to characterize the range 
of possible changes [1]. 

The instrumental record has historically been vital 
in planning for both norms and extremes, as it provides 
information on the conditions under which water 
resource systems may operate. Given a changing cli¬ 
mate, it may be appropriate to evaluate the system 
response for a range of hydroclimatic variability wider 
than in the historical record. 

Paleoclimate information and stochastic modeling 
can be useful for developing climate scenarios that 
include a wide range of potential hydroclimatic condi¬ 
tions. The expanded variability may allow a more 
robust evaluation of planning alternatives, particularly 
when there is concern that study outcomes and deci¬ 
sions may be sensitive to climate assumptions. 

Current expectations about future climate may 
indicate a need to supplement historical climate infor¬ 
mation. Planning assumptions might instead be related 
to projections of future temperature and precipitation. 
This can be accomplished using a multitude of 
approaches, although a best approach has yet to be 
determined. Incorporating climate projections gener¬ 
ally involves surveying available climate-projection 
information, deciding what portion to relate to plan¬ 
ning assumptions, and conducting the intervening 
impact analysis on natural and social systems. These 
results are then related to planning assumptions about 
supplies, demands, and operating constraints. In many 
planning studies, assumptions are developed relative to 
a stationary reference climate. 

Those engaged in planning need to share informa¬ 
tion, plan together, and collaboratively modify 
existing policies and procedures to ensure efficient 
and effective solutions. The exchange of information, 
resources, best practices, and lessons learned across 
jurisdictional lines and among different groups of 
stakeholders is a key element of successful adaptation 
planning [25]. 
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Adaptive Management Approaches for Decision- 
Making 

Water resource managers make decisions in the face of 
uncertainty associated with varying space and time 
scales, and they make assumptions about supplies, 
demands, weather, climate, and operational constraints 
at those scales. However, current planning approaches 
have generally assumed that future climate conditions 
will be similar to the historical record, an assumption 
that may be suspect if climate is changing. A robust 
decision criterion supports selection of plans that will 
perform well over a wide range of possible future sce¬ 
narios, although uncertainties will remain no matter 
how future scenarios are generated. A sequential deci¬ 
sion process allows alternative courses of action, given 
different possible future conditions, and can be 
changed as new information becomes available. 

Adopting alternatives that perform well over a wide 
range of future scenarios could improve system flexi¬ 
bility. These approaches are not mutually incompatible 
and can be used in conjunction with current water 
management planning methods that primarily employ 
cost-benefit analysis and sensitivity analysis. 

Adaptive management is an approach where deci¬ 
sions are made sequentially over time and allows 
adjustments to be made as more information is 
known. Incorporation of adaptive management can 
build in flexibility and reevaluation of decisions that 
evolve over time in response to new information. The 
use of multiple scenarios in the context of robust and 
adaptive planning will enhance decision-making, par¬ 
ticularly if the scenarios span a wide range of possible 
outcomes. 

Adaptation options include operational, demand 
management, and infrastructure changes. Because cli¬ 
mate has such a large effect on water resource system 
design and operations, it is apparent that climate 
change would translate into modified design and oper¬ 
ational assumptions for determining resource supplies, 
system demands, system performance requirements, 
and operational constraints. Several water manage¬ 
ment or system development options might be consid¬ 
ered to facilitate adaptation under climate change, 
including operational, demand management, and 
infrastructure changes. Options should be evaluated 
across appropriate spatial areas; specific options will 


vary from system to system, as will the preference 
among these options. 

Business action on climate change has primarily 
focused on managing the risks and opportunities asso¬ 
ciated with emerging regulations and changing market 
demands. But as recognition grows that some climate 
impacts are already occurring, and many more are 
likely inevitable, companies are beginning to develop 
adaptation plans to complement existing climate 
strategies. 

Incorporating the Concept of Non-stationarity 

Non-stationarity presents many challenges to the water 
resource managers. The assumption of stationarity has 
been used throughout the development of infrastruc¬ 
ture for managing water resources. Many other types of 
analyses are also integrally linked to stationary concepts 
(e.g., hydrologic statistics such as flood frequency, low- 
flow frequency, and flow duration, and design criteria). 
Widely accepted methods for incorporating non- 
stationarity do not yet exist. While research proceeds, 
efforts can be made to bring analyses up to date, so that 
they use the most recently available data. 

Climate change is expected to make certain aspects 
of water availability and water quality estimation more 
uncertain. For example, “ stationarity ; the idea that nat¬ 
ural systems fluctuate within an unchanging envelope of 
variability, is dead and should no longer serve as a central, 
default assumption in water-resource risk assessment and 
planning,” according to US Geological Survey and 
National Oceanic and Atmospheric Administration sci¬ 
entists in a February 6, 2008, Science article [26]. 

Nonstationary hydrologic variables can be modeled 
stochastically to describe the temporal evolution of 
their probability distributions, with uncertainty esti¬ 
mates. For example, water management professionals 
could define plausible planning futures based on cli¬ 
mate projection data and then expand the set of such 
futures via stochastic modeling. However, such down- 
scaled projections only should be used in water quality 
management programs when the planning horizon 
extends at least several decades into the future and the 
projections contain information reliable enough for 
planning purposes. 

Using such projection techniques and data implies 
that the reference period underlying the stochastic 
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model is relevant to the planning period. If there is a lot 
of uncertainty about how a particular factor might 
affect a decision that could produce undesirable out¬ 
comes, then stakeholders should conduct activities to 
reduce that uncertainty. 

Changes to planning and analysis that better 
accommodate non-stationarity will improve water 
management. Collaboration in all these activities may 
allow more rapid results and improved communica¬ 
tion both within the water resources community and to 
other stakeholders. 

For example, if non-stationarity is incorporated 
into water quality management, water management 
professionals should adopt a more innovative approach 
for accepting the growing list of impaired waters that 
will require a TMDL because of climate change. They 
should assess the TMDL, its implementation plan, and 
the effectiveness of both under projected climate- 
change scenarios and related management responses. 
If u stationarity should no longer serve as a central, default 
assumption in water-resource risk assessment and plan¬ 
ning ,” then trade-offs need to be made between ambi¬ 
ent water quality simulations based on long-term 
historical data and those based on stochastic models 
of such data. By better understanding how to define 
critical conditions and worst-case scenarios, water 
management professionals can formulate appropriate 
strategies to control point and nonpoint source pollu¬ 
tion to meet TMDL standards. 

Meanwhile, TMDL and National Pollutant Dis¬ 
charge Elimination System (NPDES) permit regula¬ 
tors will need to adapt by considering the long-range 
implications of climate change on waterbody impair¬ 
ment and revising permit limits and guidance 
accordingly. Some water quality standards and pol¬ 
lutant-specific goals may have to change to provide 
the environmental protection needed in an altered 
climate. Ongoing studies should give policymakers 
the information and tools needed to adapt cost- 
effectively [4]. 

Opportunities for Future Research 

There are significant gaps in knowledge, monitoring, 
and practice that limit incorporation of climate change 
considerations into water resources planning and man¬ 
agement. Research and monitoring are both needed to 


fill knowledge gaps and set up advances in planning 
capabilities. Although neither will eliminate all uncer¬ 
tainties, they will provide significant improvements in 
understanding the effects of climate change on water 
resources, including quantity and quality, and in eval¬ 
uating associated uncertainties and risks required for 
more informed decision-making [1]. 

Many uncertainties remain with respect to climate 
change and its potential impacts on water resources. 
Major gaps must be bridged to achieve success in 
addressing the effects of climate change. Additional 
research may narrow uncertainties, provide informa¬ 
tion in more useable forms, or develop more robust 
strategies for incorporating uncertainty into decision¬ 
making. However, additional research will not 
eliminate uncertainty entirely, and robust and flexible 
operations and planning approaches can be used to 
help manage water resources effectively despite this 
uncertainty. 

Recognize that adaptation must happen at local and 
regional levels. Climate changes and their associated 
impacts vary greatly from location to location. 
Although national and international action is essential, 
many important decisions about how best to manage 
systems affected by climate change are made at local 
and regional levels. 

At press time, there was no “best” procedure for 
considering projected climate information in either 
class of long-range decision-making. Until there is less 
uncertainty or time reveals which methods best char¬ 
acterize future conditions, an array of reasonable 
approaches exist. 

However, many of the prioritized gaps address 
themes common to multiple-resource management 
issues [1]: 

- Developing downscaled climate-projection infor¬ 
mation at finer spatial-temporal resolutions than 
what is already available (e.g., sub-monthly and less 
than 1/8° resolution), for more variables, and using 
methods that do not rely on “stationarity” between 
the relation of large-scale atmospheric circulation 
and local land-surface climate 

- Improving the ability to assess and anticipate 
region-specific impacts of climate change on land 
cover and ecosystems and potentially land-cover 
feedbacks on climate 
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- Improving the ability to assess the potential 
response of regional precipitation and hydrologic 
extremes to global climate change 

- Improving the understanding of how social systems 
mediate climate change impacts and establish 
expectations for water resource system perfor¬ 
mance and reliability, which would presumably 
need to be preserved under a changing climate 

Climate change must be quantified with respect to 
the myriad of other natural and cultural issues that face 
the water managers. Sound water management is built 
on long-term hydrological and meteorological moni¬ 
toring networks that provide sound, accurate, timely, 
and consistent data that can be used readily to develop 
and assess decision-making tools needed to quantify 
uncertainty, forecast change, and create the multiphase, 
multilevel climate scenarios that will provide reason¬ 
able and relevant management. Changes to planning 
and analysis that better accommodate non-stationarity 
will improve water management. Collaboration in all 
of these activities may allow more rapid results and 
improved communication, both within the water 
resources community and to other stakeholders. 

In summary, climate change is one of many dynamic 
processes impacting water resources management. A 
holistic approach must be considered on climate change 
impact on water resources management. Adaptation ser¬ 
vices are emerging as governments, businesses, and com¬ 
munities worldwide are recognizing the need to address 
current and potential climate change impacts. The jour¬ 
ney toward global water sustainability needs to include 
the consideration of climate change impact in the plan¬ 
ning and implementation over a long period of time. 
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Glossary 

Climate change Changes in the statistical distribution 
of weather over periods of time that range from 
decades to millions of years. It can be a change in 
the average weather or a change in the distribution 
of weather events around an average (e.g., greater or 
fewer extreme weather events). 

Emerging contaminants Synthetic or naturally occur¬ 
ring chemicals or microorganism that is not com¬ 
monly monitored in the environment but has the 
potential to enter the environment and cause 
known or suspected adverse ecological and (or) 
human health effects. 

Endocrine disrupting compounds Exogenous sub¬ 
stances that act like hormones in the endocrine 
system and disrupt the physiologic function of 
endogenous hormones. 

Potable water Water of sufficiently high quality that 
can be consumed or used without risk of immediate 
or long-term harm. 

Water quality The physical, chemical, and biological 
characteristics of water. It is a measure of the con¬ 
dition of water relative to the requirements of one 
or more biotic species and or to any human need or 
purpose. It is most frequently used by reference to a 
set of standards against which compliance can be 
assessed. 

Water reuse The practice of using wastewater or 
reclaimed water from one application for another 
application. The deliberate use of reclaimed water 
or wastewater must be in compliance with applica¬ 
ble rules for a beneficial purpose. 

Definition of the Subject 

Because of the rapid economic development and the 
subsequent population growth and urbanization, the 
freshwater supply in some regions of the world is 
experiencing a shortage due to increasing demand 
and decreasing replenishment. The water quality in 
these regions is often deteriorating as a result of heavy 
pollution and lack of adequate wastewater treatment. It 
is also realized that climate change and the consequent 
impact to drought and precipitation conditions may 
have severed this shortage. This has put pressure on 
water utilities to adopt new technologies to treat and 
deliver water resources that were previously not 
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considered to be amenable for such purpose, for exam¬ 
ple, desalination and water reclamation. For metropol¬ 
itan areas that are not in close vicinity of ocean, water 
reclamation and reuse has become an increasingly 
attractive option to supplement existing water 
resources and to meet consumer demand. However, 
recent research has suggested potential health risk asso¬ 
ciated with water reuse for increased concentrations of 
pollutants that are not effectively removed by treatment 
processes. Such concern is especially heightened with 
the discovery of a group of “emerging contaminants” 
that are previously unknown or not commonly moni¬ 
tored in the water environment. With the rising public 
awareness and the potential of regulation of these con¬ 
taminants by governmental agencies, research has been 
conducted to study the effects of anthropogenically 
driven processes, such as climate change, on the envi¬ 
ronmental concentrations of these compounds. This 
article is a review of such efforts. In addition, two case 
studies are discussed in detail to demonstrate the 
impacts of climate change on water use patterns and 
the subsequent effect on water quality. 

Introduction 

Population Growth/Urbanization 

Since the industrial revolution, the world’s urban pop¬ 
ulation has experienced significant growth, largely due 
to the growing demand for concentrated labors in 
factories and buildings. This growth is more significant 


during the twentieth century, during which time the 
world’s urban population increased more than tenfold, 
while rural population increased about twofold [1]. 
Today, half the world’s population lives in urban cen¬ 
ters, compared to less than 15% in 1,900 [1]. This 
growth trend is expected to continue and possibly 
accelerate in the foreseeable future (Fig. 1). However, 
the rate of growth varies greatly among different 
regions of the world. After the World War II, the 
increase of urban population happened mostly in 
Africa, Asia, and Latin America, while Europe and 
North America have seen more people moving out of 
the city centers (Fig. 2). This trend of urbanization, in 
addition to the increasing importance of large cities, 
was a result of the transformations of national econo¬ 
mies and employment patterns, as more employment 
opportunities were created in urban-based industry 
and services. Virtually all nations experienced an 
employment shift away from agriculture, and most of 
the growth in those working in industry and services 
were in urban areas. 

The trend toward more urbanized societies and the 
growing number of people living in large cities has very 
large implications for freshwater use and wastewater 
management. On a global scale, about 3,800 cubic 
kilometers of water per year are withdrawn to meet 
societal needs. The total available volume of renewable 
freshwater is several times more than is needed to 
sustain the current world population. However, popu¬ 
lation centers can access only about 31% of the 
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Estimated and projected urban population in the world (adapted from [1]) 
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Climate Change Impacts on Emerging Contaminants. Figure 2 

Comparative distribution of the world's urban population, 1950-2000 (adapted from [1]) 


renewable water because of geographical constraints 
and seasonal variations [2]. In addition, although agri¬ 
culture remains the largest user of freshwater resources, 
the water demands from city enterprises and con¬ 
sumers have become increasingly important. At the 
same time, there has also been a rapid growth in the 
contribution of city-based enterprises and consumers 
to water pollution. As a result, many major cities have 
had to draw freshwater from increasingly distant water¬ 
sheds, as local surface and groundwater sources no 
longer meet the demand for water, or as they become 
depleted or polluted [2]. 

The main water-related urban challenge in low- and 
middle-income nations is to provide adequate access to 
stable and safe water supply for drinking and sanita¬ 
tion. Currently, about one-third of the world s popula¬ 
tion lives in countries that face moderate to severe 
water shortages [3], In order to meet the need of 
increasing urban population over the next century, 
the quantity of available water must increase by 
25-60% from current level. However, as indicated by 
United Nations, urban populations have grown more 
rapidly than the capacity of developing countries to 
manage them, including putting into place institu¬ 
tional structures to ensure good provision for water, 
sanitation, and wastewater management within each 
locality [4]. This discrepancy will only become more 


significant with the continuing process of industriali¬ 
zation and urbanization in these countries. 

Climate Change 

There is empirical evidence of impacts on water 
resources from global warming. Land-based and 
mountain glaciers have generally experienced a world¬ 
wide retreat and thinning during the last century. Nota¬ 
bly, glacier melting has considerably accelerated on 
a global basis during recent years [5, 6]. The mean 
mass balance decrease that took place during the period 
1990-1999 was three times greater than that of the 
previous decade [7]. The Intergovernmental Panel on 
Climate Change (IPCC), in cooperation with new part¬ 
ners, has begun to address this issue in addition to their 
more traditional focus on greenhouse gases and tem¬ 
perature changes [8]. 

On the other hand, global warming should also 
enhance extreme events like floods and droughts 
[9-14]. However, the increasing occurrence and 
magnitude of drought periods associated with intense 
evaporation would favor water scarcity and hydraulic 
stress. Consequently, the lower water renewal (through 
melting glacier or rainy periods) due to climate change 
directly jeopardizes freshwater availability, particularly 
when water withdrawal (through irrigation, industrial 
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or domestic use) is simultaneously increased by 
anthropogenic activities. 

Global weather patterns are major controlling fac¬ 
tors in the climate and water balance for both south¬ 
west United States and northeast Australia. In 
southwest United States, one of the major factors in 
determining the water balance is total winter precipi¬ 
tation that falls as snow in the Rocky Mountains and 
the subsequent meteorological factors that influence 
the timing and rate of snowmelt. The subsequent run¬ 
off has a direct influence on water yield to the Colorado 
River, which directly or indirectly supplies water 
to meet potable or irrigation needs for more than 
30 million people. In contrast the water budget for 
northeast Australia, specifically the subtropical region 
of southeast Queensland is determined by a combina¬ 
tion of the global influence of the Southern Oscillation 
Index (El Nino/La Nina - ENSO) and more regional 
factors that influence the frequency and magnitude of 
precipitation in the coastal catchments (e.g., orographic 
rainfall as well as other influences including sea breezes, 
land use/vegetation cover influencing transpiration, 
and the availability of moisture in the atmosphere). 

A number of studies over the last 20 years have 
suggested that there will be a decrease in runoff over 
the Southwestern United States because of global 
warming [15-18]. The Colorado River will be greatly 
impacted by this decrease as most of its water comes 
from snowmelt and subsequent runoff. The decrease 


Lake Mead 



will be caused by increasing temperatures, evaporation, 
and decreasing precipitation. The statistical/empirical 
studies, as well as climate model studies of the last few 
years all show a decrease in runoff to the Colorado 
River [19-22]. The estimates of runoff reduction from 
these studies are remarkably similar, and range between 
10% and 30% over the next 30-50 years (Fig. 3). Given 
the agreement about both size and timing of runoff 
reduction, it is important to examine its impact on 
regional population in terms of water supply and 
water quality. In its recent report on Colorado River 
Basin water management, the National Academy of 
Sciences notes future potential problems with availabil¬ 
ity of water in the region. It calls for a comprehensive 
analysis of water needs and uses in the region. 

Southeast Queensland has a subtropical coastal cli¬ 
mate that is influenced by occasional monsoonal 
weather patterns, resulting in cyclones and tropical 
rain depressions providing large amounts of precipita¬ 
tion over several days or weeks. However, the climate is 
generally dominated by regional influences associated 
with weather patterns that provide southeast winds and 
sea breezes in summer and lighter westerly winds in 
winter. This results in winters being predominantly dry 
periods with the majority of precipitation being deliv¬ 
ered in summer storms with hot dry weather periods 
between storms. Although there has always been large 
inter annual variability in precipitation, the water sup¬ 
ply for this region has always relied on its storage 
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Total reservoir storage in Lakes Mead and Powell (million acre feet) as a function of lake surface elevation above mean sea 
level (feet) (adapted from [80]) 






2082 


Climate Change Impacts on Emerging Contaminants 



capacity being large enough to meet demand between 
the large tropical storms, which usually return the 
reservoirs to full supply. 

Climate change predictions for southeast Queens¬ 
land only predict a small net reduction in total annual 
average precipitation. However, the pattern of delivery 
for this precipitation is likely to have a large impact on 
both total water yield from the catchments and trans¬ 
port of emerging contaminants. The impact of climate 
change is predicted to increase the period between large 
storms, with longer more severe droughts between 
events as well as increase the severity of the tropical 
storms when they do occur. This will potentially result 
in more mobilization of pollutants, including emerging 
contaminants. 

The combined effect of changes in the frequency 
and magnitude of large storms and precipitation events 
in general is expected to reduce the total water yield to 
reservoirs, but also increase the pollutant loads deliv¬ 
ered to reservoirs during large events. The lack of any 
large precipitation/flow events between 2000 and 2008 
and long periods of rainless days between small precip¬ 
itation events resulted in the worst drought in the last 
century and very low soil moisture readings in the 
catchments. 

Water Reuse 

The need for a dependable water supply is crucial, 
particularly for large cities with limited water resources 
in the surrounding area. However, as the population 
continues to grow, coupled with the impact to existing 
water supply by climate change, more and more cities 
are in need of supplemental sources of water. New 
sources may include water transfer from neighboring 
regions, internal water sources developed by optimiz¬ 
ing the city’s water use, water conservation, and water 
reclamation and reuse. San Diego County in California 
is an example of projected changes in the distribution 
of water sources in an urban setting. Currently, about 
90% of the municipal water supply is obtained from 
a wholesale water provider. However, water demands 
are projected to increase by 50%, as a result of growing 
development and urbanization [23]. Increased use of 
reclaimed water and desalination of seawater will have 
to make up the difference. At inland locations or where 
desalination would cost too much, reclaimed water 


may contribute even more significantly to the overall 
water supply. It may be used for irrigation or industry, 
and it can replenish groundwater and surface water 
resources. 

As the size of cities and their population expand, 
the quantity of wastewater increases proportionally. 
Wastewater treatment has gained much advance during 
twentieth century due to rapid growth of urban popu¬ 
lation and the consequent needs for sanitation and 
clean environment. Historically, wastewater was col¬ 
lected from industrial and residential discharge and 
went through a series of physical, chemical, and bio¬ 
logical treatment before discharging into receiving 
water or for land disposal. However, over the past few 
decades, people have recognized the potential for 
recovering and reusing water from wastewater. The 
advances in wastewater treatment technology, such as 
membrane filtration, also helped to make this potential 
a reality by providing end water with better quality. 
These processes can now effectively remove biodegrad¬ 
able material, nutrients, and pathogens so the treated 
waters have a wide range of potential applications 
(Table 1). 

Over the past 2 decades, the amount of municipal 
wastewater recovered for reuse has increased through¬ 
out the world [24]. The technique of water reuse is 
being applied in many countries including the United 
States, Mexico, Germany, Mediterranean and Middle 
Eastern countries, South Africa, Australia, Japan, 
China, and Singapore. The primary incentives for 
implementing water reuse are augmentation of water 
supplies and/or pollution abatement. Water reuse can 
be categorized as “direct” or “indirect,” depending on 
whether the reclaimed water is used directly or mixed 
with other sources. On an international scale, direct 
non-potable water reuse is currently the dominant 
mode for supplementing public water supplies for irri¬ 
gation, industrial cooling water, river flow augmenta¬ 
tion, and other applications [25]. 

On the other hand, unplanned, indirect potable 
water reuse, through wastewater effluent disposal to 
streams, rivers, and groundwater basins, has been an 
accepted practice around the world for centuries. 
Communities situated at the end of major waterways 
have long histories of producing potable water from 
river water sources that have circulated through mul¬ 
tiple cycles of withdrawal, treatment, and discharge. 
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Table 1 Potential application for reclaimed water 


Application settings 

Examples 

Urban use 

Unrestricted 

Landscape irrigation (parks, 
playgrounds, school yards), fire 
protection, construction, 
in-building uses (toilets, air 
conditioning) 

Restricted 

Irrigation of areas where public 
access is infrequent and 
controlled (golf courses, 
cemeteries, residential, 
greenbelts) 

Agriculture irrigation 

Food crops 

Crops grown for human 
consumption and consumed 
uncooked 

Nonfood crops 

Fodder, fiber, seed crops, 
pastures, commercial nurseries, 
sod farms, commercial 
aquaculture 

Recreation use 

Unrestricted 

No limitations on body contact 
(lakes and ponds used for 
swimming, snowmaking) 

Restricted 

Fishing, boating, and other 
noncontact recreational 
activities 

Environmental use 

Artificial wetlands, enhanced 
natural wetlands, and sustained 
stream flows 

Groundwater 

recharge 

Groundwater replenishment, 
saltwater intrusion control, and 
subsidence control 

Industrial reuse 

Cooling system makeup water, 
process waters, boiler feed 
water, construction activities, 
and washdown waters 

Potable reuse 

Blending with municipal water 
supply (surface water or 
groundwater) 


Similarly, riverbeds or percolation ponds may recharge 
underlying groundwater aquifers with wastewater- 
dominated water, which, in turn, is withdrawn by 
down-gradient communities for domestic water 


supplies. The safeguards for this unplanned indirect 
potable reuse are often carried out by advanced water 
treatment technologies. Planned indirect water reuse 
involves linking the discharge of treated wastewater 
with potential downstream water uses. For example in 
Los Angeles County, Calif., a groundwater recharge 
program has reclaimed water for indirect potable use 
since 1962 [26, 27], and in northern Virginia, the 
discharge of highly treated municipal wastewater by 
the Upper Occoquan Sewage Authority to the upper 
reaches of the Occoquan Reservoir has been in effect 
since 1978. Virginia’s reservoir serves as the principal 
drinking water supply source for approximately one 
million local residents [28]. 

Recent documents from World Health Organiza¬ 
tion (WHO) [29] and the United States Environmental 
Protection Agency (EPA) [30] address the state-of-the 
art aspects and future trends in water use, both of 
which predict increased development and use of the 
above-mentioned practice to augment water supply 
sources in order to meet demands. The WHO guide¬ 
lines for wastewater reuse first published in 1995 are 
being updated with a planned release date of 2006 [29] . 
According to water reuse surveys, the best water reuse 
projects in terms of economic viability and public 
acceptance are those that substitute reclaimed water 
in lieu of potable water for use in irrigation, environ¬ 
mental restoration, cleaning, toilet flushing, and indus¬ 
trial uses [31, 32]. 

Water Quality 

While most of the studies have addressed the impact of 
climate change on water availability, recent research 
programs tend to focus on water quality. Indeed, the 
declining quality of surface and groundwater resources 
could imply the declining quality of drinking water 
with potential risk for human health [11]. In this con¬ 
text, the direct and indirect impact of climate change 
on emerging contaminants is particularly considered. 

Water Reuse and Emerging Contaminants The 

term emerging contaminants refers to a group of het¬ 
erogeneous substances that have been recently detected 
in the environment [33, 34]. They usually enclose 
pharmaceuticals and personal care products (PPCPs), 
hormones, endocrine disrupting compounds (EDCs), 
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flame retardants and algal toxins. However, the word 
“emerging” indicates that these substances are recently 
monitored but it does not necessarily imply that their 
occurrence in water is new [33, 35]. For example, some 
emerging substances occur in water for decades but 
they could not be detected in the past due to the 
technological limits. 

Although reclamation and reuse of wastewater is 
beneficial for communities to supplement existing 
surface or ground water supply, the increasing public 
awareness and regulation over water quality pose great 
challenge to the water reuse industry. Over the last half 
of the twentieth century, the increasing availability of 
chemical products for industrial, medical, and house¬ 
hold uses resulted in subtle but consistent changes in 
the characteristics of the wastewater that is produced 
in urban environments. As a result, pharmaceuticals 
and endocrine disrupting compounds (EDCs) have 
been detected in many water bodies around the world 
and are now considered ubiquitous wastewater con¬ 
taminants [36, 37]. Undoubtedly, the major contri¬ 
butor of such widespread contamination is municipal 
wastewater discharge. In many cases, surface water with 
some degree of wastewater influence is used as source 
waters for drinking water treatment facilities leading to 
the presence of these compounds in finished drinking 
water. Thus, the propensity for surface water or drink¬ 
ing water contamination could grow with the increase 
of water reuse applications, unless appropriate controls 
or planned water supply systems are implemented. 

Snyder et al. [38, 39] surveyed the occurrence of 
36 pharmaceuticals and EDCs in the source and fin¬ 
ished drinking water of 20 US DWTPs (Table 2). Twelve 
compounds were detected in at least 50% of the source 
water samples: atrazine, caffeine, carbamazepine, 
DEET, gemfibrozil, ibuprofen, iopromide, meprobam¬ 
ate, naproxen, phenytoin, sulfamethoxazole, and TCEP. 
Median concentrations of detected pharmaceuticals 
and EDCs in source drinking water were low and usu¬ 
ally less than 10 ng/L, except for atrazine (28 ng/L), 
caffeine (27 ng/L), fluorene (13 ng/L), galaxolide 
(28 ng/L), metalochlor (15 ng/L), musk ketone 
(16 ng/L), and TCEP (13 ng/L). It is important to 
note that median concentrations of fluorene, 
galaxolide, and musk ketone were biased by low fre¬ 
quencies of detection. Eight compounds were detected 
in at least 50% of the finished drinking water samples: 


atrazine, caffeine, carbamazepine, DEET, ibuprofen, 
iopromide, meprobamate, and phenytoin. Median 
concentrations of detected pharmaceuticals and EDCs 
in finished drinking water were also less than 10 ng/L, 
except for atrazine (29 ng/L), caffeine (23 ng/L), 
metalochor (86 ng/L), and triclosan (43 ng/L). Again, 
median concentrations of metalochlor and triclosan 
were biased by low frequencies of detection. 

Recent studies in environmental toxicology and 
pharmacology have suggested that long-term health 
risks may be associated with chronic exposure to trace 
chemical compounds, such as PPCPs, pesticides, and 
disinfection by-products (DBPs) in reclaimed water 
[36, 40-42]. Recently, many of these compounds have 
been classified as endocrine disrupters that can inter¬ 
fere with the functioning of natural hormones. Expo¬ 
sure to endocrine disrupters can impact the growth of 
amphibians, fish, and other wildlife at levels as low as 
parts per trillion [43]. 

Understanding the prevalence of these compounds 
and how to control them in the context of wastewater 
treatment is an active area of research. A number of 
articles have investigated the fate of trace hormones 
and pharmaceuticals through water treatment pro¬ 
cesses [38, 39, 44-55]. The ability of a particular treat¬ 
ment process to remove organic contaminants depends 
mostly on the structure and concentration of the 
contaminant. Chemical oxidation (during drinking 
water applications), biological removal/transformation 
(during wastewater applications), or nanofiltration/ 
reverse osmosis are the water treatment technologies 
most responsible for pharmaceutical and EDC 
removal, though the operational parameters of the 
process (e.g., oxidant dose and contact time) will also 
determine the degree of attenuation of a particular 
contaminant. 

Climate Change and Harmful Algal Blooms With 
optimal environmental conditions such as nutrient 
concentration and temperature, the photosynthetic 
biomass contained in surface water tends to multiply 
faster. In fact, spring streaming usually reloads water 
resources with nutrients (N and P), then summer pro¬ 
vides favorable water temperature (20-25°C) and light 
intensity for an optimal photosynthesis. Consequently, 
most of algae and cyanobacteria (also known as blue- 
green algae) reach a maximum growth rate. In some 
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Climate Change Impacts on Emerging Contaminants. Table 2 Concentrations of pharmaceuticals and EDCs in the US 
source and finished drinking water (adapted from [38, 39]) 


Source (n = 20) Finished (n = 20) 



MRL (ng/l) 

Max. 

Med. 

# 

Max. 

Med. 

# 

Acetaminophen 

1.0 

9.5 

1.6 

7 

< MRL 

< MRL 

- 

Androstenedione 

1.0 

1.9 

1.9 

1 

< MRL 

< MRL 


Atrazine 

1.0 

570 

28 

17 

430 

29 

15 

Caffeine 

1.0 

87 

27 

14 

83 

23 

12 

Carbamazepine 

1.0 

39 

3.1 

18 

5.7 

2.8 

11 

DEET 

1.0 

28 

6.9 

20 

30 

5.1 

18 

Erythromycin 

1.0 

3.5 

2.2 

8 

1.3 

1.3 

1 

Estrone 

1.0 

1.4 

1.2 

2 

2.3 

1.7 

2 

Fluorene 

1.0 

13 

13 

1 

< MRL 

< MRL 

- 

Galaxolide 

1.0 

30 

28 

3 

< MRL 

< MRL 

- 

Gemfibrozil 

1.0 

11 

4.8 

13 

6.5 

4.2 

5 

Hydrocodone 

1.0 

1.9 

1.9 

1 

< MRL 

< MRL 

- 

Ibuprofen 

1.0 

24 

4.2 

16 

32 

3.8 

13 

lopromide 

1.0 

46 

7.6 

14 

31 

6.5 

13 

Meprobamate 

1.0 

16 

5.9 

16 

13 

3.8 

15 

Metalochlor 

1.0 

170 

15 

7 

160 

86 

4 

Musk ketone 

1.0 

17 

16 

3 

17 

17 

1 

Naproxen 

1.0 

16 

2.2 

10 

8 

8 

1 

Oxybenzone 

1.0 

7.4 

2.9 

4 

1.1 

1.1 

1 

Phenytoin 

1.0 

13 

3.2 

18 

6.7 

2.3 

14 

Progesterone 

1.0 

1.1 

1.1 

1 

1.1 

1.1 

2 

Sulfamethoxazole 

1.0 

44 

8.1 

17 

< MRL 

< MRL 

- 

TCEP 

- 

66 

13 

15 

19 

5.5 

7 

Triclosan 

1.0 

30 

1.9 

6 

43 

43 

1 

Trimethoprim 

1.0 

2.3 

2.2 

3 

1.3 

1.3 

1 


cases their excessive growth leads to the appearance of a 
thin green layer at the water surface, a phenomenon 
called bloom and illustrated in Fig. 4a. 

Nowadays, the occurrence of cyanobacterial 
blooms in surface water is a growing environmental 
and public health concern. Indeed, some blooms can be 
potentially harmful according to the prevailing species. 
For example some cyanobacteria can perform the 
biosynthesis of a wide range of substances called 


cyanotoxins. Based on their toxicological properties, 
these ones are usually considered as hepatotoxins 
(acting on the liver), neurotoxins (acting on the ner¬ 
vous system) and dermatotoxins (acting on the skin). 
Humans are potentially exposed to cyanotoxins 
through recreational waters but also through drinking 
water produced using a contaminated resource. For 
instance, several intoxications have been reported 
after exposure to the widespread hepatotoxin 
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b Microcystin-LR c Cylindrospermopsin 


Climate Change Impacts on Emerging Contaminants. Figure 4 

Occurrence of algal bloom in a french pond (a) and structure of 2 cyanotoxins (b and c) 


microcystin-LR illustrated in Fig. 4b. Most of them 
were limited to gastro-enteritis [56], but more than 
50 patients died in the Brazilian dialysis center of 
Caruaru [57-59]. 

A few studies indicate that climate change should 
directly increase the frequency and magnitude of 
cyanobacterial blooms [60-62]. On the one hand, 
global warming would allow the bloom to occur 
sooner. However, the blooms are more intense 
when they occur sooner, as indicated in Fig. 5 [62]. 
On the other hand, global warming would allow the 
bloom to collapse later. Consequently, the amount of 
cyanobacteria in water should be higher the year after. 
However, the blooms are more intense when the initial 
number of cells is high, as indicated in Fig. 5. In addi¬ 
tion, one could also suppose that the increasing floods 
associated with climate change should bring higher 


amount of nutrients in surface waters, which would 
also favor the intensity of algal blooms. On the con¬ 
trary, the statistical data are not yet sufficient to con¬ 
clude on the impact of climate change on the frequency 
of blooms. This could be attributed to the lack of 
specific database for algal blooms monitoring. Indeed, 
such phenomenon is not necessarily reported to the 
administration in charge of the environment, and their 
management strongly varies according to the country. 

Climate change may also influence the distribu¬ 
tion of cyanotoxins. For example, the hepatotoxin 
cylindrospermopsin illustrated in Fig. 4c was mostly 
reported in Australia [63-65], New Zealand [66], 
Thailand [67, 68], and in southern states of the USA, 
like Florida [69] . Therefore, it has long been considered 
a tropical toxin [70]. However, since a few years, 
cylindrospermopsin has been characterized in 
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Climate Change Impacts on Emerging Contaminants. Figure 5 

Impact of climate change on harmful algal blooms (adapted from [62]) 


Julian day 
Bloom More Intense 


temperate countries, such as Germany [70-73] and 
France [74, 75], and even in boreal environments, like 
in Finland [76]. One possible explanation for this 
change in cylindrospermopsin distribution might be 
the global warming but there is not yet scientific evi¬ 
dence to confirm this hypothesis. 

Case Studies 

Two reservoirs within subtropical zone were chosen to 
study the impact of climate change on emerging con¬ 
taminants, where recycled wastewater is or will be used 
to supplement the water supply: Lake Mead in south¬ 
west United States and Lake Wivenhoe in northeast 
Australia. Since these water supplies provide raw drink¬ 
ing water to millions of people, there is a need to 
understand the various contaminant loads received by 
each reservoir - to ensure public health and public 
acceptance. The main impacts that require quantifica¬ 
tion and management are: (1) changes in water chem¬ 
istry that can be attributed to an increase in the 
proportion of recycled water relative to natural surface 


water flow; and (2) processes that mobilize pollutants 
during extreme storm events. Climate change has the 
potential to affect both of these scenarios. 

Lake Mead, United States 

Lake Mead is a man-made reservoir residing on the 
Lower Colorado River, about 25 miles from Las Vegas, 
Nevada, a major metropolitan area in southwest United 
States. It was established by the construction of Hoover 
Dam in 1920s. Lake Mead and the lower Colorado 
River serve the water needs for more than 30 million 
Americans, or roughly 10% of the US population [77]. 
Las Vegas has been one of the fastest growing US cities 
over the last decade, and its current population is 
approximately two million. Yet, Las Vegas is located in 
one of the most arid climates on earth, with only 8 cm 
of annual rainfall and more than 2 m of evaporation 
per annum. Lake Mead serves as both the source for the 
area’s drinking water and the sink for its wastewater. 
Approximately 900 million gallons per day (MGD) of 
water is withdrawn during peak summer months for 
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potable use, industrial use, and irrigation. Approxi¬ 
mately half of that water is returned to Lake Mead as 
treated wastewater via the effluent-dominated Las Vegas 
Wash (the remainder is lost to groundwater or evapora¬ 
tion). Though the flow of the Wash is small, historically 
averaging just 1.4% of the total flow into Lake Mead, it 
mainly consists of (85%) treated wastewater. There are 
no other major urban areas discharging wastewater into 
the Colorado River or Lake Mead. 

During the twentieth century, the average flow of 
the Colorado River dropped by a factor of 2, from 
approximately 1.9 x 1,013 L (15 million acre-ft) in 
1900, to approximately 8.6 x 1,012 L (7 million acre-ft) 
in 2000 (USGS 2004). Barnett et al. [78] concluded that 
60% of the observable climate-related trends (includ¬ 
ing snow-pack on the western Rocky Mountains, which 
serves as the source of the Colorado River) between 
1950 and 1999 were attributed to human-induced cli¬ 
mate change. Moreover, researchers predict that this 
trend will continue in the coming decades further 
straining water resources in the American Southwest 
[79-81]. The link between insufficient water supplies 
and slowing patterns of surface water flow partially 
attributed to climate change has been observed in the 
US Great Plains [82]. 

In 1996, the US Geological Survey issued the first 
report in the United States indicated that fish below 



wastewater outfalls showed evidence of impact to estro¬ 
genic compounds [83]. Since 1997, extensive monitor¬ 
ing of Lake Mead for pharmaceuticals and suspected 
EDCs has been underway [37, 84-86]. 

In 2000, the Colorado River Basin began to experi¬ 
ence prolonged drought. Between 2000 and 2008, the 
volume of Lake Mead dropped from 100% to less than 
50% its full capacity, and the lake had lost more than 
35 m in elevation. Figure 6 shows the Hoover Dam at 
full capacity as compared to the dam today highlight¬ 
ing the loss of lake elevation. During this time, the flow 
of the Las Vegas Wash and the concentrations of phar¬ 
maceuticals and EDCs within the Wash did not signif¬ 
icantly change [87]. The net effect was that there was 
less dilution of wastewater and an apparent increase in 
concentrations of wastewater contaminants (including 
atrazine, carbamazepine, DEET, diazepam, diclofenac, 
dilantin, estradiol, estrone, ethynylestradiol, fluoxetine, 
gemfibrozil, meprobamate, naproxen, progesterone, 
sulfamethoxazole, testosterone, and trimethoprim) in 
source drinking water (Fig. 7). 

Similar trends are observed for both nitrate and 
conductivity. Nitrate concentrations in the Las Vegas 
Wash did not change at a rate statistically different 
from zero (p = 0.4) between January 1, 2003 and 
December 31, 2007. Nitrate concentrations decreased 
in the Colorado River at a rate of 0.00007 mg L -1 d -1 



Climate Change Impacts on Emerging Contaminants. Figure 6 

Photographs of Lake Mead at full capacity in 1983 and again following drought in 2007 - Lake Mead was at full capacity as 
recently as 2000 (adapted from [87]) 
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Climate Change Impacts on Emerging Contaminants. Figure 7 

Summed concentrations of seventeen pharmaceuticals, endocrine disrupting compounds (EDCs), and other 
wastewater contaminants in Las Vegas' source drinking water as well as the volume of Lake Mead between March 2003 
and December 2007 (adapted from [87]) 


(p < 0.0001), yet increased in source drinking water at 
a rate of 0.0002 mg L -1 d -1 (p < 0.0001) over this same 
time period. Conductivity in the Las Vegas Wash did 
not change at a rate statistically different from zero 
(p = 0.03) between January 1, 2003 and December 31, 
2007. Yet, conductivity decreased in the Colorado River 
at a rate of 0.04 pS cm -1 d -1 (p < 0.0001) and increased 
in source drinking water at a rate of 0.05 pS cm -1 d -1 
(p < 0.0001) over this same time period. As was the 
case with concentrations of pharmaceuticals and EDCs, 
the increased loading associated with the slight increase 
in flow of the Las Vegas Wash over this time period does 
not alone account for the concomitant increase in 
nitrate and conductivity in source drinking water. 
This is especially true when one considers that histor¬ 
ically, source drinking water is largely composed of 
water from the Colorado River [77], where concentra¬ 
tions of nitrate and conductivity actually decreased 
over this time period. Thus, the discrepancy between 
loading and source water concentration is not 
explained by a hydrological explanation of a system at 
steady state. The increase in pharmaceutical and EDC 


concentrations, nitrate concentrations, and conductiv¬ 
ity in source drinking water is due to the decline in 
volume of Lake Mead between January 1, 2003 and 
December 31, 2007. During this time, the volume of 
Lake Mead declined from approximately 65% capacity, 
to almost 50% capacity (Fig. 6) at an average rate of 
0.00007 d -1 (p < 0.0001). 

Lake Wivenhoe, Australia 

Lake Wivenhoe is the major drinking water supply for 
over a million people in the city of Brisbane, in sub¬ 
tropical Queensland, Australia. Unlike Lake Mead, 
which receives the majority of its water from snowmelt 
from the Rocky Mountains, Lake Wivenhoe receives all 
its water from precipitation in its surrounding catch¬ 
ments (5,400 km 2 ). Lake Wivenhoe catchments have 
a high proportion of agriculture, resulting in significant 
loads of pollutants into the lake during the large and 
highly episodic inflow events [88] . The reservoir itself is 
a eutrophic system with regular blooms of toxic algal 
species, in particular the cyanobacterium Cylindros- 
permopsis raciborskii [89]. Other land use in the 
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catchments includes both urban and industrial activi¬ 
ties. In recent years, considerable research effort has 
been focused on characterizing the occurrence of trace 
organic contaminants in the lake and its tributaries 
prior to the introduction of recycled water. 

A recent drought, described as the worst on record 
and a rapidly growing population in southeast Queens¬ 
land, Australia, have been placing pressure on surface 
water supplies. This region is the fastest growing in 
Australia with the population forecast to more than 
triple by 2026 with commercial and industrial demand 
for water also increasing [90]. The region has few 
accessible groundwater resources and until recently, 
has relied on surface reservoirs to store water for pota¬ 
ble treatment. The largest such reservoir is Lake 
Wivenhoe which was formed by damming the Brisbane 
River and is fed by the surrounding watershed as well as 
the Somerset Dam on an upstream tributary. The 
storage capacity of this reservoir is 1.2 x 109 m 3 and 
the major potable water treatment plant in the area 
is located on the Brisbane River downstream of 
Wivenhoe Dam [91]. 

There is now little scope for further large dams in 
the region, and recently a 1.25 x 105 m 3 /d capacity 
desalination plant was commissioned as an additional 
climate independent source of potable water [90]. 
A further initiative to address the potable water situa¬ 
tion in the region is the construction of a water recla¬ 
mation project that will be the largest in the Southern 
Hemisphere and the third largest in the world once 
completed [91]. This project takes treated wastewater 
from six existing wastewater treatment plants in the 
South-East Queensland area. These plants all practice 
biological nutrient removal and deliver disinfected water 
of secondary standard to advanced water treatment 
plants (AWTPs) [90, 92]. The AWTPs adopt a dual 
membrane approach, using micro filtration and reverse 
osmosis, followed by advanced oxidation (H 2 0 2 /UV) 
and finally stabilization and disinfection with chlorine. 

A monitoring campaign of the reclaimed water 
produced by the AWTPs was carried out from May to 
November 2008. It was undertaken for disinfection by¬ 
products (DBPs), pharmaceuticals and personal-care 
products (PPCPs), and pesticides together with other 
organic micropollutants as well as inorganic and 
microbial analytes that were detected in source 
wastewaters [92]. Over 300 chemicals were monitored 


for in secondary treated effluent with 114 organic 
chemicals detected. On this basis, 114 chemicals were 
monitored in AWTP effluent, with 99 not detected and 
15 chemicals detected and quantified at least once. 
Of the detected compounds, bromodichloromethane, 
dibromochloromethane, chloroform, dichoroacetic 
acid, and NDMA (N-Nitrosodimethylamine) are cate¬ 
gorized as DBPs. Paracetamol (N-(4-hydroxyphenyl) 
ethanamide), salicylic acid, and DEET (N,N-Diethyl-m- 
toluamide) are categorized as PPCPs. Triclopyr ([(3,5,6- 
Trichloro-2-pyridinyl)oxy] acetic acid) and dalapon 
(2,2-dichloropropionic acid) are pesticides (subclass 
herbicides), and cholesterol and caffeine are consumer 
products categorized under “others”. The remaining 
detected compounds, 4-t-octylphenol, bisphenol A, and 
nonylphenol are regarded as good indicators of industrial 
contribution to wastewater [92]. Due to their weak 
estrogen-like activity, they are considered an environ¬ 
mental hazard. 

Results showed the sporadic occurrence of the 
herbicides atrazine and simazine, triclopyr, 2,4- 
dichlorophenoxyacetic acid (2,4-D) and Dichlorprop 
(2,4 DP), and PPCPs such as caffeine, DEET, salicylic 
acid, and carbamazepine (Table 3). Industrial organics 
included 4-t-octylphenol, nonylphenol, and bisphenol A. 
Typically water concentrations were close to the 
detection limit of 10 ng/L. Possible sources of these 
chemicals in the catchments include urban sources 
(e.g., WWTPs, unsewered homes, stormwater runoff, 
landfills) and agricultural activities (e.g., weed control). 

A wider range of chemicals were detected in samples 
from flow events with water concentrations typically 
exceeded levels measured in the lake. Bisphenol A was 
typically detected at the highest concentrations at 
most sites with water concentrations ranging up to 
>1,400 ng/L at Cooyar Creek and over half the sites 
exceeding 500 ng/L. The water concentrations of 4-t- 
octylphenol ranged from 28 ng/L to 500 ng/L. The 
highest concentrations were recorded at O’Reilly’s 
Weir and Cooyar Creek. Caffeine, DEET, and salicylic 
acid were detected in at least half of the samples, typ¬ 
ically between 10 and 30 ng/L. On some occasions 
DEET was measured at higher concentrations, particu¬ 
larly in Cooyar Creek (290 ng/L) and Emu Creek 
(200 ng/L). Ibuprofen was measured at the high water 
concentrations in Coal and Esk Creeks (2,280 ng/L and 
2,270 ng/L, respectively). Herbicide profiles were 
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dominated by different chemicals depending on where 
the sample was collected 2,4-Dichlorophenoxyacetic 
acid (2,4-D) and 2,4-Dichlorophenoxybutyric acid 
(2,4-DB) were measured in a sample from the 
O’Reilly’s Weir at 1,000 ng/L each. Hexazinone was 
measured at 670 ng/L in a sample from Cooyar Creek 
and tebuthiuron was measured at 130 ng/L in Capt. 
Logan’s Creek. 

Overall, the monitoring of chemicals in flow events 
has shown that water concentrations in tributaries dur¬ 
ing such events can exceed levels in the lake by ten times 
or more. Clearly these inflows can have impacts on 
water quality, although dilution and attenuation of 
chemicals are likely to occur. For example, ibuprofen 
was not detected in samples collected from the lake 
in May 2008, after flow event monitoring detected at 
the chemical at concentrations exceeding 1 ug/L. 
This chemical is known to degrade relatively quickly 
in natural water systems. 

There was a significant rain event in May 2009, the 
bulk of which had fallen between May 19 and 21. The 
net result was that Lake Wivenhoe capacity rose from 
43% (5.25 x 10 5 ML) on May 18 to 63% (7.32 x 
10 5 ML) on May 27. Monitoring flow events of lower 
magnitude in early 2008 had shown that significant 
chemical loads could enter the lake from different 
tributaries hence it was possible that flow events of 
the scale that occurred in May 2009 could impact on 
lake water quality. 

Sampling was undertaken at three sites in Lake 
Wivenhoe on May 27, 2009. Samples were subjected 
to the same analysis as previously. Interestingly, the 
occurrence of herbicides and PPCPs remained sporadic 
and in most cases below detection limits. Levels of the 
industrial organics remained in the range detected dur¬ 
ing sampling from earlier in 2009 and no hormones 
were detected. These results indicated that even though 
flow events are capable of delivering a load of contam¬ 
inants to the lake, the combined effects of dilution and 
attenuation appear to result in minimal impact on 
water quality. 

Future Directions 

With increasing demand from population growth and 
dwindling availability due to urbanization and pollu¬ 
tion, water has become a more precious and limited 


resource in many areas of the world. For the past few 
decades, water industry has been searching for new 
means of producing fresh water, such as direct and 
indirect reuse of wastewater, to supplement existing 
and traditional sources. The advances in water treat¬ 
ment technologies, together with more efficient water 
management practices, have greatly improved reused 
water quality and expanded the acceptance and adop¬ 
tion of water reclamation and reuse across the industry. 
However, research has suggested that there is increasing 
concern regarding the safety of reused water to human 
and aquatic species, because of the detection of some 
emerging contaminants in water systems receiving 
wastewater discharge. Although the concentrations of 
these contaminants are usually low, there are evidences 
that they can cause adverse health conditions to popu¬ 
lation after long-term consumption. The increasing 
public awareness has pressured governmental agencies 
to consider regulating these contaminants, which in 
turn poses great challenges to water treatment facilities. 

Climate change as a result of human activities since 
industrial revolution has been suggested to associate 
with the changes in global water systems through 
a number of influencing factors, such as glacier melt¬ 
ing, changes in precipitation, and evaporation patterns. 
The impact of climate change to local water resources is 
especially significant for arid regions that depend on 
large water reservoirs to regulate and balance seasonal 
water supply. The intake water quality in local water 
treatment plants is greatly affected by wastewater dis¬ 
charge and runoff from surrounding catchments, 
which can be impacted by changes in precipitation 
and upstream water volume. More research is needed 
to understand the long-term effect of climate change 
on water quality, including the levels of regulated and 
emerging contaminants. Such knowledge will help 
water industry to prepare and make long-term plans 
to address water safety and meet future regulations. 
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In the early 1980s, Eugene F. Stoermer began informally 
using the term “anthropocene” to refer to a time in the 
history of the Earth in which humans introduce 
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impacts that at times rival natural processes in the 
global functioning of the planet [7]. While there is 
some uncertainty about when the anthropocene 
began (with some dating it as far back as 8,000 years 
ago), there is little doubt that humans are affecting the 
planet today, and one of the ways is through our impact 
on the Earth’s climate. 

Concern about the possible impact of mankind on 
climate dates back at least as far as 1896, when Svante 
Arrhenius [1] suggested that mankind might inadver¬ 
tently warm the planet by increasing the concentration 
of carbon dioxide, and the idea that man might have an 
impact on the Earth’s climate has been studied ever 
since. Concern about the matter has been increasing 
since the 1970s, and in 1995, a very large group of 
climate scientists, in an assessment produced by the 
Intergovernmental Panel on Climate Change [2], 
wrote that “ the balance of evidence suggests a discernible 
human influence on global climate” That statement, 
based on a careful evaluation of observational evidence, 
theory, and model studies, ignited a firestorm of com¬ 
ment and controversy. There is little doubt that the vast 
majority of climate scientists were concerned then, and 
study since that time has made the arguments that man 
is affecting climate even more compelling. In February 
of 2007, the IPCC released its fourth assessment [3], 
which found that u human actions are c very likely ’ the 
cause of global warming,” with “very likely” specifically 
defined to mean a 90% or greater probability of 
explaining an increase in average global temperatures 
over the last 100 years, with many other attendant 
changes. Some debate continues about these conclu¬ 
sions to this day, with a few scientists and more non¬ 
scientists indicating doubt that mankind is responsible 
for the changes, but almost no doubt that the warming 
is real. 

There are many reasons for skepticism about 
climate change: some scientific, some not. It is not my 
intention that this section provides much information 
about climate change itself. The assessment report, the 
“synthesis document,” and the “summaries” intended 
for policymakers, laypersons, and scientists from other 
fields [2-5] provide much more detail than could pos¬ 
sibly be included here. 

Instead, these entries are designed to introduce 
readers to the “methodologies” used in climate science. 
What are the tools used for? How is confidence gained 


in the use of those tools? It is impossible to provide 
a thorough review with the few entries, and few pages 
of each entry in this section, but it is hoped that they 
provide readers with an introduction to the science, 
and methodology, and pointers to places where more 
detail can be found. 

Climate science is a complex field. Climate is 
governed by processes that interact and operate on a 
vast array of time and space scales. The processes involv¬ 
ing radiative transfer, chemistry, and phase changes 
of water are most easily described at atomic and molec¬ 
ular scales; the influence of ice sheets, continents, and 
planetary scale circulations controlling the basic energy 
balance of the planet operate at continental scales; 
even planetary orbital and solar variations operating at 
millennial time scales cannot be ignored. Many of those 
processes are well understood; other less so. 

Models are the means that scientists use to express 
their knowledge about a process, or a set of processes 
using equations that can be solved on a computer. It is 
impossible to represent everything known about all 
the processes that govern climate in detail on today’s 
computers because it is simply beyond the computer 
power currently available to us, and approximations 
must be made. There are also many things scientists 
recognize that are “not known” that may be important 
in understanding climate change. It is hoped that these 
entries will provide hints on how climate models are 
constructed, how they are assessed and tested, and how 
they are used to provide insight into the changing 
climate of the past, and the possible changes that will 
take place in the future. 

Here is a brief description of the entries of this 
section, and the rationale for their inclusion. 

A broad overview of coupled climate system model¬ 
ing is provided in ► Coupled Climate and Earth Sys¬ 
tem Models by Gent. This entry introduces the 
components of a coupled model, how they work 
together, and the kind of scientific problems that they 
are designed to address. 

The individual component models are described in 
more detail in the next set of entries. Each of these 
component models can be used on its own to study 
a piece of the climate system by essentially “prescrib¬ 
ing” the behavior of the other components, or they 
can be used as a part of the bigger model to explore 
interactions. Bitz and Marshall describe treatments of 
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the cryosphere (ice and snow in the Earth System) in 
the entry ► Cryosphere, Modeling of; Yoon and 
Ma cover ocean processes in ► Oceanic General 
Circulation Models. Bacmeister discusses global atmo¬ 
spheric models from a weather modeling point of view 
in ► Weather Prediction Models, and Rasch discusses 
these models from a climate perspective in the entry 
► Atmospheric General Circulation Modeling. Niu 
and Zeng provide an introduction to land surface 
models in ► Earth System Model, Modeling the Land 
Component of. These models provide scientists 
with a picture of how physical components of the 
climate system respond to “forcing” from other 
components. 

It is also interesting to explore the interaction 
between the physical system and society. ► Integrated 
Assessment Modeling by Edmonds et al. describes 
“Integrated Assessment” modeling and climate change 
that begin to assess how humans affect climate and how 
climate affects humans. 

Some aspects of climate change are most easily 
explored at a very high resolution with more detail 
than can be afforded on today’s computer systems. 
This motivates the entry on ► Regional Climate 
Models by Leung. Methods used to assess uncertainties 
in climate change studies are outlined in ► Climate 
Change Projections: Characterizing Uncertainty Using 
Climate Models by Sanderson and Knutti. 

Climate varies on many time scales. The entry by 
Goddard on ► Climate Predictions, Seasonal-to- 
Decadal provides a very nice introduction to the issues 
of prediction of climate processes on multi-decadal 
time scales. The climate section concludes with an 
example by Wang and Lau of one way that the mankind 
may be influencing a very important feature of the 
Earth system that influences the lives of billions of 
people, the Indo-Asian monsoon (► Monsoon Sys¬ 
tems, Modeling of). 

It is hoped that scientists and engineers find the 
entries interesting, and that they stimulate additional 
study in this important topic in Sustainability Science. 
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Glossary 

Bayes’ Theorem A law in probability theory relating 
the probability of a hypothesis given observed evi¬ 
dence to the often easier to characterize probability 
of that evidence given the hypothesis. The theorem 
states that the conditional “posterior” probability 
of an event A given an event B is equal to the “prior” 
probability of A multiplied by the likelihood of 
B given A is true, normalized by the prior probabil¬ 
ity of B. 

Climate sensitivity The equilibrium global mean near 
surface air temperature response in Kelvin to 
a sustained doubling of the atmospheric carbon 
dioxide concentration. 

CMIP-3 The Coupled Model Intercomparison Project 
Phase 3, a set of coordinated model experiments 
using General Circulation Models from the world’s 
major modeling centers. 

Detection and attribution A process whereby spatial 
“fingerprints” associated with individual climate 
forcing factors (such as aerosol or greenhouse gas 
concentrations) are identified and used to quantify 
whether an observed change exceeds the range of 
natural internal climate variability (detection) and 
to attribute it to different causes, that is, different 
forcings (attribution). 

Empirical model A model based on fitting empirical 
data, and thus makes no attempt to justify its rep¬ 
resentations of the system with any physical basis. 

General circulation model (GCM) A three-dimensional 
mathematical model for the atmosphere and possibly 
the ocean, land, and sea ice. 

Initial condition ensemble A number of simulations 
using a single climate model, each with a small, 
unique perturbation to the initial state. 

Last glacial maximum (LGM) A period in the most 
recent ice age lasting several 1,000 years, peaking 


approximately 20,000 years ago at the maximum 
extent of the ice sheets. 

Lead time The period in between the time at which the 
forecast is made and the time to be forecasted. 
Multi-model ensemble (MME) A collection of struc¬ 
turally different models from a range of institutions 
used to perform a coordinated set of experiments. 
Parameter space The multidimensional domain cre¬ 
ated by considering the possible values of a number 
of parameters within a model. 

Perturbed physics ensemble A set of climate simula¬ 
tions generated by taking a single physical model 
and altering uncertain parameters within a range of 
plausibility. 

Prior probability (marginal probability) The proba¬ 
bility of an event before any additional data is 
considered in a Bayesian sense. 

Posterior probability The probability of an event 
after considering additional relevant evidence in 
a Bayesian sense. 

Systematic error The difference between a model sim¬ 
ulation and observations or a poorly represented 
process which is not reducible by parameter tuning. 

Definition of the Subject and Its Importance 

The atmosphere, ocean, land surfaces, and ice sheets of 
the Earth are highly complex and coupled systems, with 
physical laws which describe behavior from the micro¬ 
scopic to the planetary scale. General Circulation 
Models are computational analogs for these physical 
systems, which can be used to study how these systems 
might behave when boundary conditions are changed 
(e.g., by increasing the concentration of atmospheric 
greenhouse gases). 

Inherent in the design of such models are a myriad 
of choices when deciding which components of the 
system are to be modeled and how to represent pro¬ 
cesses which cannot be currently modeled explicitly. In 
order to have any confidence in the ability of our 
models to have value for simulating aspects of future 
climate change, it is necessary for those models to 
reproduce observable properties of the physical system. 
However, model errors in the simulation of the past or 
present are likely to be smaller than errors in future 
projections because model developers can use 
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observations and historical records in the development 
of their code. Additionally, some processes may not be 
observable or testable yet, because they might only take 
place in a warmer (or otherwise changed) world. 

One way to characterize at least some of the uncer¬ 
tainty in future projections is to produce an ensemble 
of climate simulations, each making different but rea¬ 
sonable assumptions about their representation of 
physical processes. 

In recent years, a number of groups in the interna¬ 
tional climate science community have produced Gen¬ 
eral Circulation Models of the earth system, each 
making different choices about model complexity, res¬ 
olution, and parameterization of processes which occur 
at scales finer than those resolved. By conducting coor¬ 
dinated experiments with each of these models, it has 
become possible to examine some of the effect that 
such choices have on uncertainty in future climate 
simulations. However, the sheer volume of data and 
range of models available from such an ensemble pre¬ 
sents a new challenge for the science to address: How 
can a spread of non-independent “best guesses” be 
combined to produce meaningful statements of uncer¬ 
tainty which are relevant to climate-related policy 
decisions? 

Introduction 

In 1979, Jule Charney chaired a committee on anthro¬ 
pogenic global warming, producing a report [1] pro¬ 
viding a brief overview of the state of the science of 
climate change. At the time, two General Circulation 
Models were available for consideration, one led by 
Syukuro Manabe and the other by James Hansen. The 
report produced an estimate for the climate sensitivity 
(the equilibrium global mean temperature change to 
a doubling of the atmospheric carbon dioxide concen¬ 
trations) based on the mean result of these two models. 
In comparing the predicted future climate of these two 
models, the report stated: 

► We conclude that the predictions of C0 2 -induced cli¬ 
mate changes made with the various models examined 
are basically consistent and mutually supporting. The 
differences in model results are relatively small and 
may be accounted for by differences in model charac¬ 
teristics and simplifying assumptions. 


This, in many ways, represents the first effort to 
combine multiple results from an ensemble of climate 
model simulations, and the statements made using 
those models are still relevant to ensemble modeling. 
A better understanding in the uncertainties in the sim¬ 
ulations and increased confidence can be claimed if an 
ensemble of somewhat independent models produces 
common features in its simulations, and if the origins 
of the differences between simulations may be traced 
back to physical characteristics. 

When presented with a range of simulations of 
future climate, one must make judgments on many 
levels on how that ensemble should be interpreted: 
How should model agreement, or lack of it, translate 
into a degree of confidence in the simulations? Should 
all models be treated equally, and if not then how 
should one distinguish between them? If some pro¬ 
cesses are absent from some or all of the simulations, 
how can the projections be updated to account for 
these “known and unknown unknowns”? Should each 
ensemble member be interpreted to be an estimate of 
the “truth” with some unknown error, or should the 
“true” earth system be considered as a potential mem¬ 
ber of the ensemble? Although some of these questions 
verge on the philosophical, the judgments made in 
answering them can have large effects on the results 
themselves that are obtained and the degree of uncer¬ 
tainty in those results. 

In recent years, there have been systematic efforts to 
explore and characterize uncertainty using large 
ensembles of increasingly complex models of the 
earth system. These model simulations have been coor¬ 
dinated and analyzed to help in characterizing climate 
change in a series of assessment reports for the Inter¬ 
governmental Panel on Climate Change (IPCC). In 
1990, 1995, 2001, and 2007, a selection of GCMs were 
assembled from various major modeling groups 
around the world to compare simulations of past, 
future, and other idealized scenarios of climate change. 
Through the successive decades, model complexity and 
scope have increased; the early GCMs of Manabe [2] 
and Hansen [3] modeled atmospheric dynamics and 
radiative transfer, with a simplistic representation of 
the hydrological cycle. By the time of the First Assess¬ 
ment Report of the IPCC [4] (FAR) in 1991, models 
included clouds, a land surface model, and prescribed 
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ice cover. For the Second Assessment Report [5] (SAR), 
some models also included a representation of the 
ocean and interactive sea ice. In the Third Assessment 
Report [6] (TAR), some models considered the effects 
of volcanic eruptions and aerosol emissions, with 
a fully dynamical representation of the oceans. By the 
time of most recent Fourth Assessment Report [7] 
(AR-4), some models were beginning to include an 
explicit representation of the carbon cycle in the earth 
system. Today’s models continue to model additional 
components of the earth system, such as interactive 
vegetation, dynamically resolved ice sheets, a coupled 
carbon-nitrogen cycle and full atmospheric chemistry. 
In parallel with all of these improvements, the last few 
decades have also seen a continued increase in model 
resolution. Where the models used in the FAR split the 
earth into cells as large as 500 km on a side, models for 
the AR4 can resolve at a scale of a few tens of kilometers. 

This entry focuses on the advantages and additional 
complexities which one must consider when studying 
a range of different model simulations. Rather than 
giving a comprehensive description of the results of 
the models assessed in the successive generations of 
the IPCC, this entry will discuss the added technical 
and conceptual challenges encountered when consid¬ 
ering the results of a range of non-independent models 
and how a range of simulations may be combined into 
best estimates and uncertainties for future climate 
evolution. 

Projection Uncertainty and the Need for 
Ensembles 

Empirical and Physical Models 

In 150 AD, Ptolemy devised a model of the motion of 
planets in the solar system by describing a system of 
concentric, geocentric circles (or “deferents”) on which 
were mounted smaller circles (“epicycles”) on which the 
planets themselves were mounted. This system thus 
had a large number of degrees of freedom (the diame¬ 
ters and speeds of rotation of each of the deferents and 
epicycles), which could be finely tuned to reproduce 
the motions of the bodies in the night sky. Such was the 
predictive power of this approach, that variations of 
this simple model were accepted until Copernicus’ 
heliocentric model was published in 1543. Although 
Copernicus’ model fits the established view of the 


universe more closely, both of these models were empir¬ 
ical in that they were not based on any physical princi¬ 
ples at that time. However, even without a physical basis, 
Galileo was able to validate the Copernican model by 
studying the phases of the planet Venus - which was only 
consistent with the heliocentric formulation. It was not 
until Newton’s law of universal gravitation that the 
model could be given a physical underpinning. 

Any model of a physical system is an approximate 
representation of the truth. It should be able to repro¬ 
duce some behavior of that system, and it might do this 
empirically like Ptolemy’s model or by explicitly simu¬ 
lating physical processes within the true system like an 
orbital system based upon Newtonian gravitation. 
A model, whether empirical or physical, cannot ever 
be validated in the strict sense of showing it to be 
a wholly correct representation of the true system; it 
can only be evaluated by reproducing some output not 
used in the tuning of the model itself. This was true of 
Galileo’s observation of the phases of Venus - informa¬ 
tion not used in the tuning of the Ptolemic model. 
However, any empirical model becomes very sensitive 
to changing boundary conditions. For example, if the 
mass of the Sun were to instantly double, the Coperni¬ 
can model of the solar system would be a very poor 
approximation of planetary motion, whereas a model 
based upon Newtonian mechanics would capture 
enough of the necessary physics to remain useful. 

These fundamental principles are relevant to meth¬ 
odologies for simulating the climate today. If the sim¬ 
plest, zero dimensional empirical model of the climate 
is taken to be: 

dV 

C—— = F' — XT' 

dt 

where T' is the global mean temperature difference 
from an equilibrium state, F' is the additional radiative 
forcing to the planet (i.e., the change in the top of 
atmosphere radiative balance caused by a forcing, e.g., 
increased C0 2 ), C is the effective heat capacity of the 
system, and X is the global sensitivity parameter. This 
equation has two free parameters, C and X which may 
be tuned such that the model can fit an observed past 
time series of F' and T\ that of the twentieth century, 
for example. The model can then be evaluated by 
predicting a previously unseen time period, such as 
the last glacial maximum. This evaluation, if successful, 
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would give more confidence in the model but would 
not necessarily make it trustworthy for a prediction of 
the future - where the boundary conditions are outside 
those seen in both the training and validation period. 

The added advantage of using a GCM to simulate 
future climate is that model simulations are in theory 
more trustworthy because they are based upon physical 
principles, which it is believed can reproduce observed 
climate by coupling underlying physical laws that 
are known to be true. However, this view is often 
overoptimistic; although some components of the 
modeled climate, such as the equations of motion in 
the atmosphere or the instantaneous radiative forcing 
due to a change in atmospheric carbon dioxide con¬ 
centrations are well understood and consistently 
implemented in different GCMs, there are other pro¬ 
cesses such as convection which cannot explicitly be 
resolved with current computing resources. These pro¬ 
cesses and their effects on the large scale climate must 
be approximated with uncertain parameters that must 
be estimated by tuning the model to reproduce some 
observed features of the climate. What this means, in 
practice, is that a GCM is neither only an empirical nor 
an explicitly physical model; it is a hybrid of the 
two where model developers face many arbitrary 
choices in parameterizing processes which cannot be 
explicitly resolved. The necessity for the tuning process 
reintroduces some of the problems encountered with 
an empirical model, with the possibility of false confi¬ 
dence in model performance by over-tuning the model 
to reproduce past climate. The ambiguity in these 
parameterizations justifies the existence of multiple 
models for the same purpose [8]; each of these models 
is seen as a plausible approximation of the climate 
system given the imperfect understanding, the uncer¬ 
tainties in observations and the computational 
constraints. 

Types of Uncertainty and the Need for Ensembles 

Although weather and climate simulations share some 
properties (sometimes they are conducted with the 
same model), the limiting uncertainties are very differ¬ 
ent. Climate represents the distribution of all possible 
states in which one expects to find a system, whereas 
weather is the specific evolution of the system from 
a given initial state. A model-based weather forecast is 


a prediction , in that the initial state of the simulation is 
as close as possible to observations and the absolute 
errors grow rapidly afterwards. In weather prediction, 
these errors occur as specific weather systems evolve 
from the initial state. Because the atmosphere is 
a chaotic system, very small errors in the estimate of 
the initial state can result in very large differences in the 
distributions of weather systems a few days later. Initial 
condition uncertainty is evaluated by repeating simu¬ 
lations with a range of slightly different initial condi¬ 
tions to form “Initial Condition Ensembles.” The 
spread of these ensembles initially grows rapidly but 
eventually saturates when the “memory” of the initial 
state is lost (this timescale is longer in the oceans, 
perhaps up to 10 years for North Atlantic ocean tem¬ 
peratures [9]). 

On decadal to century timescales, the mean and 
spread of an initial condition ensemble represents 
a projection of the future climate state, although this 
spread is only a small fraction of the total error (some¬ 
times known as “uncertainty of the first kind”). The 
second kind of uncertainty relates to the boundary 
conditions of the problem, some of which are naturally 
occurring such as the level of incoming solar radiation 
or volcanic activity, while others are dependent on 
anthropogenic factors such as the future emissions of 
greenhouse gases or aerosols. To address this uncer¬ 
tainty, one must perform a range of simulations 
using different plausible scenarios for changes in 
boundary conditions. The results of any simulation 
are therefore conditional on assumptions made about 
future human behavior. There is currently little real 
skill in forecasting future volcanic activity and chang¬ 
ing solar activity so simplistic but plausible scenarios 
for these quantities are often used (such as repeating 
past values). However, in most future scenarios these 
represent a relatively small fraction of the total 
anthropogenic climate forcing. 

Figure 1 shows the relative sources of error in a cli¬ 
mate model projection as a function of the lead time 
[11]. For lead times of less than a decade, the uncer¬ 
tainty in the initial state combined with chaotic 
error growth and natural patterns of variability are 
the dominant sources of error but on the scale of 
several decades or more, it is the future emissions 
scenario which dominates the uncertainty. Predictions 
on all timescales, however, are subject to model 
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Climate Change Projections: Characterizing Uncertainty 
Using Climate Models. Figure 1 

A figure showing the fractional sources of uncertainty in 
a climate model projection as a function of time. The 
orange "internal variability" line shows the errors due to 
uncertain knowledge of the initial state of the system, 
while the dotted line shows the potential reduction in error 
if effort is made to assimilate ocean observations into the 
model at the start of the simulation. The green line shows 
the fractional error due to the unknown future emissions of 
climate altering gases, while the blue line shows the error 
due to the model imperfections. The boxes show where 
different types of uncertainty are dominant for a projection 
of future climate (Reproduced from [10]) 

uncertainties. These arise when a climate model con¬ 
tains parameterizations for unresolved or missing pro¬ 
cesses. Parameterizations take large-scale quantities 
resolved by the model, such as temperature, wind 
speed, and humidity, and relate them to unresolved 
processes, such as convective mass flux and cloud pro¬ 
files. Although these parameterizations are usually 
constructed from physical underpinnings and evalu¬ 
ated with observed data, they introduce some unavoid¬ 
able uncertainty when a range of parameter values 
might be physically plausible. GCMs are often subject 
to a tuning of parameter values to reproduce features 
of the observed climate, but with tens or hundreds 
of uncertain parameters this process is time consum¬ 
ing and can yield multiple solutions because of the 
computational cost, a systematic tuning of all param¬ 
eters is unfeasible. 


One method of quantifying the parameter uncer¬ 
tainty problem is to construct a “Perturbed Physics 
Ensemble” (PPE) using a single GCM. This process 
has been attempted using several major climate models 
[11-13] and involves taking a subset of important 
unknown parameters within the GCM and perturbing 
them within the bounds of physical plausibility. Such 
experiments might perturb, for example, a parameter 
which states the necessary humidity required for the 
formation of cloud. By varying this parameter, one can 
change dramatically how the model distributes clouds 
both in the present day and the future. These changes 
can affect the strength of global feedbacks which can 
change, for example, the amount of warming that the 
model predicts for a given rise in greenhouse gases. An 
example of a PPE is the “climateprediction.net” exper¬ 
iment [13], which used idle time in volunteer’s com¬ 
puters to perform perturbed simulations of future 
climate. Incorporating this range into an uncertainty 
estimate for predictions of future climate requires 
a framework for joint consideration of each model’s 
performance in simulating past and present climate as 
well as its future response. 

The remaining model uncertainties are due to so- 
called “systematic” or structural errors arising from the 
model design, that is, the choices of which processes to 
model, the resolution of the model, the numerical 
schemes, and the specific form of the parameterization 
scheme. The structural differences between different 
GCMs provide a lower bound on the extent of the 
structural difference between any one GCM and the 
true climate system, but in reality the models in an 
ensemble such as those used for the IPCC reports 
share many common properties in terms of resolution, 
numerical methods, missing components, and param¬ 
eterization schemes which might make all the models 
subject to similar errors. Nevertheless, considering 
a range of GCMs which make different modeling 
assumptions is an essential step when evaluating the 
robustness of any prediction of future climate change 
because it places a lower bound on the uncertainty 
arising from the choices made by model developers. 

Multi-model and Perturbed Physics Ensembles 

When making predictions of a future climate state, 
there is a wealth of evidence to suggest that considering 
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a combined prediction using multiple, somewhat inde¬ 
pendent models yields more accurate results than any 
single model [14-16]. Additionally, the spread of sim¬ 
ulations provides a measure of robustness in the pre¬ 
diction. The following section describes some reasons 
for the increased performance of multi-model and 
perturbed physics ensemble forecasts, together with 
some of the complexities arising from their analysis. 

Range of Ensemble Responses 

The spread of results from an ensemble of climate 
simulations is dependent upon the experimental 
design, or lack of it. A perturbed physics ensemble 
(PPE) has the luxury of allowing some control of the 
distribution of models in the parameter space of the 
model, though the structure of the underlying model 
places a fundamental limit on the range of observable 
behavior in the ensemble. For example, if a PPE is 
created by perturbing cloud parameters in a GCM 
which has no parameterization for cirrus clouds 
formed by gravity waves, then there is no way that 
such an ensemble can include uncertainty about that 
process. Designers of such experiments must also be 
aware that the decisions of how to sample the param¬ 
eter space of a model will directly influence the distri¬ 
bution of future climate simulations [17]. In contrast, 
multi-model ensembles (MMEs) such as the Coupled 
Model Intercomparison Project (CMIP-3), explore 
“systematic” model differences, which sample models 
with different representations of the physical system, 
rather than simply varying parameters in a single 
model. These are “ensembles of opportunity” where 
multiple modeling groups run coordinated experi¬ 
ments but the ensemble itself is not sampled in any 
systematic fashion. Nor is the ensemble randomly sam¬ 
pled because each modeling group will tune their 
model to minimize model differences from observa¬ 
tions, thus creating an ensemble of “best-guesses.” This 
is quite different from the PPE case where the model is 
intentionally detuned to produce a wide range of 
behavior. Evidence for this can be seen by examining 
the spread of climate sensitivity in both a multi-model 
and a perturbed physics ensemble (Fig. 2). When con¬ 
sidering a range of observational constraints on climate 
sensitivity, it is apparent that the multi-model values 
tend to cluster about the most likely value, whereas the 


Combining multiple lines of evidence 
Proxy data from millions of years ago 
Last glacial maximum (models) 

Last glacial maximum (data) 

Volcanic eruptions 
Last millenium 
Current climate mean state 
Instrumental period 






L — | 



-Climateprediction.net 

- CMIP-3 




— i _ 

_ i -- 1 - 


Climate sensitivity (K) 

Climate Change Projections: Characterizing Uncertainty 
Using Climate Models. Figure 2 

Distribution functions of climate sensitivity (an estimate of 
the equilibrium response of a model to a doubling of C0 2 ) 
for models in the CMIP-3 ensemble (hinting at the range of 
responses from an MMF analysis), compared with 
a selection of models from the climateprediction.net 
project (hinting at the range of uncertainty from a PPE 
analysis). Box and whisker plots show estimates of the most 
likely values, together with 66th and 90th percentiles of 
likelihood for climate sensitivity taken from various lines of 
observational evidence (Adapted from [19]). Histograms 
represent the fraction of models in each 0.5 K bin of climate 
sensitivity for the atmosphere-only components of 19 
models in the CMIP-3 archive and for a 2,000 member 
subset of the climateprediction.net ensemble [13] 

perturbed physics ensemble contains models which 
span the full range of uncertainty in climate sensitivity. 
Although impossible to verify, it also is possible that 
there is also a component of social anchoring [18] 
which draws multi-model sensitivities toward the 
mean value as any group which finds their model to 
be an outlier may have to defend why this is the case, 
whereas a model with the consensus value of sensitivity 
is less likely to be questioned. 

The Ensemble Mean State 

In various fields, it has been shown that the combined 
performance of multiple models can exceed that of an 
individual ensemble member. Examples of this can be 
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seen in models of crop yield [20], disease modeling 
[21], and even in the optimization routines used for 
movie recommendations based upon past viewing 
choices [22]. Similarly in seasonal climate predictions, 
it has been shown that the multi-model ensemble 
means yield better forecasts, in general, than using 
only initial condition ensembles from a single model 
[16]. A multi-model study incorporating a set of 
initial conditions for each model is often referred to 
as a “super-ensemble.” The accuracy of the model mean 
often performs best in multivariate applications, that 
is, a single model may show increased skill in predicting 
one particular diagnostic, but when many variables are 
considered in the same metric the ensemble mean 
prediction tends to show greater skill than any individ¬ 
ual model [23]. 

This effect can also be seen in GCM simulations of 
recent past climate. Figure 3 shows successive genera¬ 
tions of the CMIP ensemble evaluated using a multi¬ 
variate error metric comparing twentieth century 
observations to model simulations of that period for 
a variety of model diagnostics. The figure shows that 
model errors in simulating the current climate have 
decreased over time but also that for each generation 
of the ensemble, the multi-model mean results in 
a model-data discrepancy almost as good, or better 


than the best performing ensemble member. Various 
studies have found that both in detection and attribu¬ 
tion studies [24] and in simulations of recent climate 
[25] that a multi-model mean provides a better multi¬ 
variate simulation than any individual model. 

This approach is common in the reports of the 
IPCC, where an unweighted mean of future model 
simulations is used to show a “best-guess” simulation 
of future climate, while the degree of model spread is 
used to estimate some measure of the significance of 
the result. There are more sophisticated methodologies 
that one may use for model combination, involving 
Bayesian methodologies [27] or model weighting 
[28], but the correct implementation and interpreta¬ 
tion of such studies is subject to some debate. It has 
been shown that the ranking of model performance 
within a multi-model ensemble such as CMIP-3 is 
often highly dependent on the choice of metric used 
to evaluate the model. A metric based on the model’s 
ability to reproduce observed variability will produce 
a different ranking than a metric which evaluates the 
model simulation of the mean state [30], and the per¬ 
formance of different models on these two metrics are 
very weakly correlated (Fig. 4). In addition, violation of 
the model “democracy” (one model, one vote) in the 
IPCC process is potentially controversial, as choices of 
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Climate Change Projections: Characterizing Uncertainty Using Climate Models. Figure 3 

A comparison of model errors reproduced from Reichler and Kim ([26], corrigendum) multivariate errors are evaluated for 
successive generations of the coupled model intercomparison project (CMIP). Colored circles represent individual models 
in the ensemble, whereas black circles show the performance of the multi-model mean. REA represents the NCEP 
reanalysis and the CMIP-3 PICNTRL is the performance of the preindustrial control simulations when evaluated against 
present-day observations 
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Mean state 

Climate Change Projections: Characterizing Uncertainty 
Using Climate Models. Figure 4 

The relationship between model skill in reproducing the 
mean climate state and skill in reproducing patterns of 
natural variability for models in the CMIP-3 ensemble. Each 
point represents a single model in the CMIP-3 archive, and 
errors are averaged over a large number of diagnostics. The 
black line shows the fitted least-squares regression 
(Reproduced from [29]) 


how to weight models could be interpreted as a polit¬ 
ical statement [31]. 

The question of why multi-model means perform 
better than individual models is a complex one. Cer¬ 
tainly, the mean is not in itself a self-consistent repre¬ 
sentation of a physical system and is therefore not 
subject to many of the restrictions that apply when 
tuning one model to reproduce an observed climate. 
As an example, a single model maybe tuned in different 
ways to reproduce two different observed values “A” 
and £C B,” but it might be impossible to tune the model 
to reproduce “A” and “B” simultaneously. However, if 
different models in the ensemble make different 
choices about the relative importance of “A” and “B,” 
it is likely that the ensemble mean will be close to the 
observed values in the case of a large ensemble. Clearly, 
real GCMs have a large number of observable diagnos¬ 
tics to reproduce and a large number of tuning param¬ 
eters, but it remains true that the multi-model mean is 
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Climate Change Projections: Characterizing Uncertainty 
Using Climate Models. Figure 5 

This plot, from Knutti [31] shows the fraction of land area 
between 60°N and 60°S experiencing a given change in 
precipitation in the dry season. Precipitation change is 
measured in percent per unit global temperature rise in 
Kelvin measured over the period 1900-2100 relative to the 
1900-1950 average. Each light blue line represents a single 
CMIP3 ensemble member, the dark blue line marks the 
average of all distributions. The black line shows the 
precipitation change in the multi-model mean. The 
expected absolute precipitation change in the multi-model 
mean is about 30% smaller than in any single model 

less restricted by model structure than any individual 
model in the ensemble. Another interpretation is that 
some of the model biases are random perturbations 
about the truth (i.e., each model reproduces the obser¬ 
vations with some pattern of bias that is characteristic 
to that model and but different in each model), such 
that averaging many models reduces the magnitude of 
the biases. In the limit of completely random indepen¬ 
dent biases, the average would be perfect for an infinite 
number of models. 

In some cases, the multi-model mean can indicate 
behavior unrepresentative of any of the models within 
the ensemble. Figure 5 shows the distribution of 
expected percentage precipitation change per unit 
global temperature increase in the current dry season 
for various models within the CMIP3 archive. Each 
individual model shows a wide distribution of change 
with some regions showing up to 30% decrease in 
precipitation for every degree rise in global mean tem¬ 
perature. If the models are averaged together in 
advance, however, the resulting multi-model mean 
has no region which displays this extreme decrease in 
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precipitation in the dry season. The multi-model mean 
is thus not representative of the findings of the indi¬ 
vidual ensemble members in the respect that it fails to 
recover the extremes of the distribution of precipita¬ 
tion change. The reason for this discrepancy is, at least 
partially, a difference of the spatial representation of 
precipitation patterns in different ensemble members. 
Different models have different resolution, representa¬ 
tions of orography, and parameterizations for precipi¬ 
tation. When combined this gives each models unique 
spatial modes of variability for precipitation. This 
allows each model to display extreme future drying in 
some specific regions, but critically those regions are 
not necessarily identical in all models in the ensemble, 
effectively smearing out the small scales and the 
extremes of the distribution. Thus, although the mean 
result of a large ensemble may provide a reduction in 
model bias, the averaging process itself may create an 
unrepresentative forecast. 

Model Independence 

Given a set of truly independent models distributed 
about the truth, one would always be able to improve 
simulation quality by increasing the number of models 
in the ensemble as truly independent errors would tend 
to cancel. Any study which treats CMIP ensemble 
members as independent realizations of a possible 
future is implicitly making this assumption, but one 
can make statistical arguments to show that the models 
are not distributed in a way which would be consistent 
with this assumption [36]. To illustrate this visually, 
Fig. 6 shows maps of temperature and precipitation 
from a selection of models in the CMIP-3 archive, all 
of which could be used with equal weight in producing 
a multi-model mean. However, one can see instantly 
that the two GFDL models have very similar biases in 
surface temperature, even though they are submitted as 
separate models to the archive. The temperature biases 
in the other two models shown have very different 
spatial patterns. The precipitation plots, however, 
show that there are some common biases in all four 
of the models. There are many reasons why these com¬ 
mon biases might exist; all models in the CMIP-3 
ensemble cannot explicitly resolve features smaller 
than about half a degree, which renders them incapable 
of simulating behavior such as atmospheric blocking or 


the response to local orography. Models may also share 
parameterization schemes and be tuned to reproduce 
the same observations [32], and in some cases the same 
model can be submitted to the ensemble at multiple 
resolutions meaning that models can share consider¬ 
able parts of code, making it very likely that model 
biases will be correlated. In summary, it is both 
expected and evident that the current generation of 
climate models does not provide an independent sam¬ 
ple of estimates distributed about an underlying truth, 
and it is unlikely that increasing the number of similar 
models in the ensemble would drastically increase the 
accuracy of combined predictions. 

Model Validation and Tuning 

GCMs are frequently tuned by minimizing differences 
between simulations of the past century and observa¬ 
tions. The observations can be in the form of data from 
satellites and in situ measurements or may be expressed 
as reanalyzed products which attempt to incorporate 
information from both of these. Simulations of earlier 
periods may also be evaluated against proxy data (esti¬ 
mates of temperature or rainfall etc., produced from 
tree rings, ice cores, etc.), although the long simulations 
and necessary model reconfiguration for these periods 
often mean they do not form part of the active model 
development process. Because models are tuned to 
agree with data over the twentieth century, they tend 
to agree with each other for this time period. There is 
little spread in the model simulations over the twenti¬ 
eth century. Figure 7 taken from the IPCC AR-4 report 
shows that the models behave similarly throughout the 
twentiethcentury when compared to any one of the 
scenarios for the twenty-first century. The reader 
should not attach any significance to the absolute 
values of the global mean temperature time series, 
which are expressed as anomalies with respect to the 
1980-2000 mean for all models. 

The remarkable consistency of the global mean 
temperature evolution in the twentiethcentury in the 
current generation of GCMs is made possible through 
the various degrees of freedom the models have in 
fitting this well-observed period. The response of any 
model is governed by a combination of transient ocean 
heat uptake, climate sensitivity, and the radiative forc¬ 
ing to the system, which effectively makes the problem 



Climate Change Projections: Characterizing Uncertainty Using Climate Models 


2107 



gfdl_cm2_0 


gfdl_cm2_1 


ncar_ccsm3_0 


iap_fgoals1_0_g 


c 







Climate Change Projections: Characterizing Uncertainty Using Climate Models. Figure 6 

Temperature and precipitation maps of the North Atlantic region from four models submitted to the CMIP-3 archive. 
Each map shows the 1980-2000 averages for June, July, and August - expressed as a difference between the model 
simulation and the NCEP reanalysis for the same period. The top row shows the anomaly for surface temperatures, while 
the bottom row shows the anomaly for annual total precipitation 




poorly constrained with multiple ways to fit the twen¬ 
tieth century global mean temperature time series [33] . 
A study by Kiehl (2007) concluded that models pro¬ 
duced this agreement by compensating between differ¬ 
ences in climate sensitivity with differences in aerosol 
forcing. Figure 8 shows both the climate sensitivities 
and the later twentieth century anthropogenic forcing 
of climate in a selection of GCMs [34]. It is apparent 
that those models with a larger anthropogenic climate 
forcing in the twentieth century have a smaller climate 
sensitivity, allowing the models to successfully repro¬ 
duce the twentieth century temperature record (a weak 
correlation between aerosol forcing and climate 


sensitivity is also seen in the CMIP-3 archive used for 
the AR4 report [35]). 

In each of the twenty-first century scenarios illus¬ 
trated in Fig. 7, the aerosol concentrations are predicted 
to decrease as increasingly stringent clean air legislation 
comes into effect. Meanwhile, all the scenarios show 
a continuing increase in greenhouse gases throughout 
the twenty-first century, which makes the climate sen¬ 
sitivity of the models the primary factor influencing 
their future evolution as the total anthropogenic 
forcing increases. The differing climate sensitivities 
amongst the CMIP3 models thus cause a larger spread 
in the twenty-first century simulations than for the 
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Climate Change Projections: Characterizing Uncertainty 
Using Climate Models. Figure 7 

A figure reproduced from the IPCC AR-4 report (Fig. 10.4) 
showing the mean and inter-model spread of simulations 
in the CMIP-3 model archive for simulations of the 
twentieth century, together with the simulations of three 
different scenarios for periods after the year 2000. Global 
mean temperatures are shown relative to the 1990-2000 
mean. In each case, the line represents the multi-model 
mean and the shading shows the 1 standard deviation 
ensemble spread 

twentieth century simulations. However, it should be 
noted that most AR4 models included the “direct” 
radiative effect of aerosols, but not their indirect effect 
on cloud properties. This means that eliminating the 
correlation between climate sensitivity and aerosol 
forcing would not necessarily reduce projection uncer¬ 
tainty, and the success of most models in simulating the 
twentieth century may be partly spurious [36]. 

An additional problem lies in the lack of indepen¬ 
dent data with which to tune and verify the models. In 
many cases, model quality metrics are based upon 
mean state and variability data from the latter twenti¬ 
eth century data, which is very likely to be used in the 
development of parameterizations and tuning of the 
model. For example, most models use satellite data 
products to tune the top of atmosphere energy fluxes, 
and these products are often considered to be one of the 
more robust constraints when evaluating a model qual¬ 
ity metric. In addition, models may often be evaluated 
against reanalyses, rather than the observational data 
itself. Reanalysis products are model simulations 
strongly “nudged” to reproduce an incomplete set 
of observations, effectively filling in the gaps with 
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Climate Change Projections: Characterizing Uncertainty 
Using Climate Models. Figure 8 

A figure reproduced from Kiehl [35] which shows the 
relationship between climate sensitivity and total 
anthropogenic forcing of climate in the late twentieth 
century in a set of GCMs, represented by the black dots. The 
solid line represents a theoretical relationship between the 
two quantities necessary to produce the warming 
observed over the twentieth century, the dashed lines 
show the uncertainty in this relationship due to uncertainty 
in transient ocean heat uptake 


self-consistent model data output. This process intro¬ 
duces an additional layer of complexity, because the 
reanalysis climate will contain features both of the 
constraining observations and the underlying model. 
For fields where real data is sparse or where data is not 
assimilated directly (such as precipitation metrics), the 
reanalysis output might have much more dependence 
on the underlying model than on any real-world data. 
As a result, when using reanalysis data as a constraint 
for multiple models, those models with a similar rep¬ 
resentation of the hydrological cycle to that used in the 
reanalysis will appear to perform better. 

The model development process involves a consid¬ 
erable amount of value judgment, as a model serves 
many purposes and some compromise between the 
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many different plausible performance metrics must 
be made. The relatively small number of degrees 
of freedom available to model developers makes it 
impossible to perfectly match a large number of 
observable quantities simultaneously, which means 
that there may be multiple possible parameter combi¬ 
nations which are equally valid. Each of these combi¬ 
nations, although they fit historical observations 
equally well, may have different projections of future 
climate change if they exhibit different climate sensi¬ 
tivities or aerosol responses. 

In the past, model tuning has largely been a time- 
consuming process of expert judgment and trial and 
error, which leads to some uncertainty of what errors in 
a simulation are irreducible through parameter adjust¬ 
ment. Although not yet used operationally, various 
techniques have been proposed to automate this tuning 
process. One technique uses an optimal gradient 
descent approach to minimize some multivariate 
error metric [34]. This approach can yield multiple 
solutions, as the response surface in the parameter 
space of the model may show local minima. Another 
approach involves using a preexisting perturbed phys¬ 
ics ensemble and fitting a nonlinear response surface 
[37] to interpolate between the sampled points in the 
parameter space. This effectively produces a “model 
emulator” which can predict the point in parameter 
space which minimizes model error, but the result is 
dependent on the parameter space being sufficiently 
densely sampled to capture the dominant features. 
One can also combine the predictions from a range of 
plausible perturbed models. The ensemble Kalman fil¬ 
ter [38] approach has been used [39] to create a set of 
valid perturbed versions of a single climate model, but 
is subject to uncertainty that there is an unknown 
systematic error in the climate model which cannot 
be corrected by parameter modification. 

A final problem lies in the incompleteness of the 
model representation of the climate. Many current 
models, for example, cannot simulate the indirect effect 
of aerosols on cloud amounts. Tuning an incomplete 
model to reproduce the observed radiative balance 
at the top of the atmosphere therefore involves 
overcompensating the cloud amounts by artificially 
enhancing other processes, which arguably makes the 
representation of the current and future state less 
accurate. 


Statements of Probability 

Multi-model Ensembles 

As indicated throughout this entry, the production of 
a probabilistic statement for future climate from 
a multimember or perturbed physics ensemble has no 
clearly established methodology and requires a priori 
assumptions to be made. Arguably the simplest 
assumption that can be made is one of model equality, 
using the democratic “one model, one vote” approach 
[40] . In such a method, the probability of a future event 
is estimated by the fraction of models in which the 
event happens. This hypothesis can be tested by cross- 
validation within an unused subset of the ensemble. 
However, this approach is limited by the implicit 
assumption that the ensemble is a random sample of 
plausible estimates of the true climate, where the vari¬ 
ous arguments in section “Projection Uncertainty and 
the Need for Ensembles” suggest this assumption may 
not be valid. 

The next logical step is therefore to consider some 
measure of model skill as a weighting for each model, 
producing an estimate of future climate as a median of 
model predictions, such that models with a small bias 
are given a greater weight [41]. Such approaches are 
always highly dependent on the exact choice of metric 
used to evaluate the model weighting [29]. 

Many studies have adopted Bayesian methodolo¬ 
gies, where prior beliefs about the range of future 
climate change are updated with information from 
models and/or observations. One example [42] takes 
a prior probability distribution for current and future 
regionally averaged climate signal (or the corres¬ 
ponding climate change signal) and updates this 
using information from models and observations. 
Priors can be chosen to be uninformative (flat over 
a large range of possible values) so that the final PDF 
shape is mainly influenced by the information from 
models and observations. The likelihood of each 
model simulation of the past and the future is then 
represented as the realization from PDFs centered 
around the unknown “true” present and future climate, 
as if the ensemble were a sample from a large idealized 
population of possible models. The width of the PDFs 
is in turn estimated jointly with the climate signals. Its 
magnitude depends on that model bias compared to 
the consensus estimate of the present day and future 
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state. Markov Chain Monte Carlo techniques are used 
to approximate the result of Bayes theorem applied to 
priors and likelihood, allowing a joint probability dis¬ 
tribution for the “true” climate states and the unknown 
parameters characterizing the model distributions to 
be estimated. From it, the PDF of the regional climate 
change signal is also straightforward to derive. 

A Bayesian approach has been also applied at the 
grid-point scale by representing the entire field of 
future climate anomaly for each model in terms of a 
truncated set of basis functions combined with some 
noise estimate [43], such that each model has its own 
low-dimensional set of coefficients to describe the pat¬ 
tern of climate change. The advantage of this approach 
is that a similar Bayesian methodology may be applied 
to derive estimates for the “true” values of the coeffi¬ 
cients, which when recombined with the basic func¬ 
tions results in PDFs for climate change at the grid- 
point level. 

An issue with both traditional weighting schemes 
and the Bayesian approaches is the way in which outliers 
are treated - the so-called “convergence criterion”. In the 
case of a large PPE, such as the climateprediction.net 
experiment, the logic in down-weighting outliers 
assumes that there is some significance in the consensus 
mean projection, errors are distributed randomly and 
that models which deviate strongly from the consensus 
are somewhat less trustworthy. However, in a small 
ensemble of best-guesses such as CMIP-3, this argu¬ 
ment is subject to question. It is possible that a single 
model in the ensemble is able to simulate processes 
which are not simulated in other models. This model 
is arguably more trustworthy than the rest of the 
ensemble and yet it would be down-weighted through 
the application of a simple convergence criterion. 

Another issue with all of the methods discussed 
thus far is the assumption of model independence. It 
can be shown [44] that the width of the final PDF using 
a Bayesian methodology is inversely proportional to 
the number of the models considered in the ensemble. 
Whilst this would be true if all models were indepen¬ 
dent estimates of a true climate, it has been demon¬ 
strated that this not a valid assumption [32]. Although 
some statistical methodologies have endeavored to 
artificially reduce the more obvious interdependencies 
of the CMIP-3 ensemble [45], there is at present no 
generally accepted methodology for doing so. The 


Bayesian techniques that have been developed so far 
tend to produce a PDF narrower than the spread of the 
original ensemble, as the independence assumption 
causes uncertainties to decrease with added ensemble 
members. 

A completely different approach to producing 
model projections is to statistically “calibrate” models, 
where a relationship is established between model sim¬ 
ulations and observations over an observed period. 
Once this relationship has been determined, it may 
be applied to future climate projections to produce 
a “calibrated” estimate of the true future response. 
This approach assumes, of course, that the relationship 
between the projections and the true response will 
remain constant in the future. This approach has been 
applied to large scale metrics such as past and future 
sea-ice loss [46], as well as more complex statistical 
multivariate approaches which find the best fitting 
relationship between modes of variability in model 
simulated and observed past climate, again using 
those relationships to produce a calibrated future pro¬ 
jection [47, 48]. 

Finally, some “detection and attribution” studies 
[49] determine spatial patterns of climate changes asso¬ 
ciated with different atmospheric forcings, using obser¬ 
vations to determine whether models are over- or 
under-representing those changes in past simulations. 
This allows future model projections to be rescaled in 
light of the observations. One of the major uncer¬ 
tainties in such approaches is in the derivation of the 
calibration coefficients themselves, and whether the 
calibration is valid when applied to a future planet in 
a very different state. These uncertainties tend to result 
in wider PDFs than Bayesian methodologies [33]. 

Perturbed Physics Ensembles 

While “one model, one vote” may be a questionable 
assumption in a multimodel ensemble, it is quite osten¬ 
sibly wrong in a perturbed physics ensemble where 
some models have vastly inaccurate simulations of the 
mean climate [50] . PDFs of future climate derived from 
a perturbed physics ensemble have therefore often been 
forced to take a different approach. 

Most studies thus far arising from PPEs have 
focused on producing PDFs for climate sensitivity, 
and have broadly fallen into three categories: weighting 
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of the parameter space, using the ensemble to estab¬ 
lish relationships between observable quantities and 
unknowns such as climate sensitivity, or a traditional 
Bayesian technique. An example of the first approach 
[11] takes a PPE and ascribes each model a weighting, 
based upon model skill in reproducing the observed 
climate. By interpolating between the sampled points 
in the parameter space, one can then produce a 
weighted integral of the unknown quantity (e.g., cli¬ 
mate sensitivity). It is argued, however [51], that the 
PDF obtained from such an approach is fundamentally 
dependent upon the prior assumptions made in sam¬ 
pling the original parameters. 

A second approach of finding relationships between 
observable and unknown quantities has been demon¬ 
strated using both linear [52] and nonlinear [53] trans¬ 
fer functions. In each case, the ensemble is used to 
derive some predictors which internally estimate the 
climate sensitivities of ensemble members. These 
regression coefficients can then be used together with 
observations of the true climate state to make 
a prediction of the true climate sensitivity. Clearly, 
these predictions are subject to uncertainty in the 
observational state and in the internally derived pre¬ 
diction error, both of which may be estimated relatively 
easily. The major “unknown unknown” in such an 
approach is the systematic or irreducible error of the 
underlying model, that is, how much additional uncer¬ 
tainty arises when the predictor is applied to the real 
world. A lower bound of this quantity may be obtained 
by examining the skill of the predictor when applied to 
a multimodel ensemble such as CMIP-3, but this will 
not account for common errors arising from lack of 
resolution or simulated processes. 

The final approach to be considered is the use of an 
ensemble Kalman filter [40]. The ensemble is used 
together with observations to update prior beliefs 
about several unknown model parameters. The ensem¬ 
ble Kalman filter then involves an iterative process 
forming an idealized ensemble of plausible perturbed 
models. Once again the methodology is sensitive to 
assumptions about model error, which scale the relative 
importance of the model-observation discrepancies 
forming the overall cost function. By assuming model 
errors are small, the resulting idealized ensemble will be 
more tightly clustered about the observed state. The 
distribution of climate sensitivities in this idealized 


ensemble is then deemed to approximate a PDF for the 
sensitivity. One advantage of such a technique is that the 
predictions may be validated by producing a hindcast 
for the past (the Fast Glacial Maximum, in this case). 
The FGM simulation can then be used to produce an out 
of sample weighting for the optimized ensemble. 

Future Directions 

The analysis of climate simulations from multiple 
models is still a problem in its relative infancy. Various 
techniques have been proposed in this entry, each 
making different assumptions about model indepen¬ 
dence, prior distributions, systematic model errors, 
and about what statistical framework is appropriate. 
These choices remain, at present, somewhat subjective 
and often yield different probability distributions for 
unknown climate variables. The apparent contradic¬ 
tions between the methodologies can be understood, 
however, in light of the assumptions made. In contrast 
to a numerical weather forecast where thousands of 
verification cases are available to test the forecast skill, 
the climate projections for a century into the future are 
making a statement about a situation never observed 
before and where no model evaluation is possible. 
Because there is only a single realization of the future, 
any statement of probability expresses a degree of belief 
in a Bayesian sense of how different future outcomes 
are supported by current evidence (models, data, 
methods), and is therefore inherently subjective. 

Clearly, any projection (and the uncertainty associ¬ 
ated with it) must be tailored in a fashion useful to 
decisions on policy and planning for a changing cli¬ 
mate. Policymakers tend to push for increases in pre¬ 
cision, but this can lead to decreases in real accuracy if 
predictions are overconfident [54]. There is arguably 
little point in providing PDFs of future change for 
planning purposes if the width of those PDFs are mas¬ 
sively sensitive to either subjective decisions or 
unknown errors, and the raw collection of “best- 
guesses” from the different models is as useful a way 
as any to present the ensemble of forecasts. One inher¬ 
ent danger with this approach, however, is the tendency 
to see the multi-model distribution as a discrete prob¬ 
ability distribution for future climate. As is seen, the 
lack of model independence, the fact that all models are 
neglecting certain sources of uncertainty (e.g., the 
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carbon cycle climate feedback uncertainty) and the fact 
that every modeling group will tend to submit only 
a best-guess climate together implies that the true 
uncertainty may be larger than that indicated by the 
spread of model simulations. 

Future generations of multi-model ensembles are 
also likely to introduce more complex “Earth System 
Models,” at least for some ensemble members. These 
models, in addition to atmosphere, ocean, land, and sea 
ice components are likely to introduce fully coupled 
carbon-nitrogen cycles, chemistry, urban, and ecosys¬ 
tem models into the simulation. These components of 
future uncertainty have not been thoroughly explored 
in previous generations of the CMIP experiments, and 
are likely to increase the spread of simulation response 
for the coming century. Although this could be per¬ 
ceived to indicate an increase in uncertainty, it is more 
accurately converting an “unknown unknown” into 
a parametric uncertainty. If different models include 
different components of the earth system in their 
models, it will also become more difficult to compare 
them on a like-with-like basis, as is mostly possible 
today. However, this underlines the importance with 
each generation of climate models of recognizing the 
uncertainties associated with what is omitted, as well as 
those arising from the simulations themselves. 

Although there may be some use in overall metrics 
of model skill [55], it is likely that projections of spe¬ 
cific phenomena will benefit from tailored metrics to 
rank the performance of different models (e.g., El Nino 
or future sea ice extent). This will also require a proper 
assessment of which subset of models to use for each 
particular application based upon both past model skill 
and physical plausibility [31]. In addition, the commu¬ 
nity may benefit more from a diverse range of model 
predictions, where each model may be evaluated on its 
own performance, in place of a group of models which 
are artificially clustered toward a mean response leav¬ 
ing no way of simulating the extremes or boundaries of 
future climate change. 

In this entry both multi-model ensembles and 
perturbed physics ensembles have been discussed, but 
there is little discussion on how the information from 
the two may be combined. Indeed, at present there is 
little to no literature on how one may combine the 
parametric uncertainty sampled in a PPE with the 
inter-model systematic differences in a multi-model 


ensemble. This presents a fundamental problem in 
that current PDFs from both of these techniques can¬ 
not incorporate the best estimates of systematic and 
parametric uncertainty. Future analyses must combine 
these various uncertainties in order to make statements 
about model robustness. Currently, the ability to con¬ 
duct such an analysis is limited because only a small 
subset of the models in the CMIP-3 archive have pro¬ 
duced a perturbed physics ensemble, and for those 
ensembles which do exist, the experiments have not 
been conducted in any coordinated fashion. 

Despite all of the challenges associated with com¬ 
bining and interpreting results from multiple climate 
models, the presence of coordinated ensembles of pro¬ 
jections provides an invaluable insight into the magni¬ 
tude of some of the uncertainties which are inherent in 
every simulation conducted, and the ensemble pro¬ 
vides a unique opportunity to understand why models 
differ. As time goes on, the length of good quality 
observations will increase allowing better evaluation 
of the transient behavior of the models (a better metric 
for future transient response than those based upon the 
model simulation of the base climate [56]). In addi¬ 
tion, as more components of the climate system are 
simulated, although model convergence is not expected 
(at least in the short term), one can be confident that at 
least “unknown unknowns” in future predictions can 
be represented in the form of parametric uncertainty. 

Finally, possibly the greatest single uncertainty in 
future climate remains that of human behavior. Cer¬ 
tainly in the case of the CMIP-3 ensemble, the spread in 
twenty-first century simulations due to different emis¬ 
sion scenarios generally exceeded that of the inter¬ 
model spread to any particular scenario. Simple models 
of the climate have already been coupled to socioeco¬ 
nomic models [57-59], but little progress to date has 
been made in coupling socioeconomic models to 
GCMs. As a result, potential complex feedbacks 
between climate change and human behavior have 
not been sampled in any systematic framework. Nev¬ 
ertheless, although an integrated treatment of uncer¬ 
tainty in future climate projections may seem some way 
off, the use of multi-model ensembles will continue 
to frame at least some of those uncertainties in 
a systematic framework, providing a robustness which 
would be impossible with any single model, however 
complex that model may become. 
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This section focuses on ideas to deliberately remedy 
anthropogenic climate change, either by actively 
removing greenhouse gases from the atmosphere or 
by decreasing the amount of sunlight absorbed at the 
Earth’s surface. The technologies discussed are com¬ 
monly grouped under the term “geoengineering,” 
defined in a 2009 report by the Royal Society as the 
“deliberate large-scale manipulation of the planetary 
environment to counteract anthropogenic climate 
change.” Geoengineering methods can be contrasted 
with more conventional approaches to mitigating cli¬ 
mate change that involve reducing the emissions of 
greenhouse gases, especially carbon dioxide (C0 2 ). 
However, there is some overlap as enhancing the sinks 
of greenhouse gases, for example, by afforestation, can 
be described as both geoengineering and mitigation. 
(The Intergovernmental Panel on Climate Change 
(Working Group III) states that “mitigation means 
implementing policies to reduce greenhouse gas emis¬ 
sions and enhance sinks' [our emphasis].) 

Failure by the international community to make 
substantive progress in reducing C0 2 emissions, 
coupled with recent evidence of accelerating climate 
change, have brought urgency to the search for addi¬ 
tional means of tackling climate change. This has fueled 
much recent debate about geoengineering and a 
flurry of mostly model-based research studies. There 
is widely expressed concern that undertaking, or even 
discussing, geoengineering poses the “moral hazard” of 
reducing efforts to tackle the root cause of climate 
change, namely, greenhouse gas emissions. However, 
few of those researching geoengineering advocate it as 
an alternative to reducing greenhouse gas emissions. 
Instead, current discussions are usually framed in 
terms of the possible use of geoengineering in addition 
to reducing greenhouse gas emissions in order to limit 
the magnitude of climate change, for example, to stay 
within the widely discussed policy “threshold” of lim¬ 
iting global warming to 2°C above preindustrial. 

The technologies discussed herein can be subdivided 
into those involving carbon dioxide removal (CDR) 
from the atmosphere, and those involving reflecting 
sunlight, referred to by the Royal Society as “solar 
radiation management” (SRM) (although the word 
“management” implies a high level of understanding 
of the system in question that is probably not justified 
for the climate). The distinction can also be thought 


about in terms of wavelengths of radiation; short¬ 
wave geoengineering tries to reduce incoming sunlight, 
while long-wave geoengineering tries to reduce the 
return flux of heat to the surface from the increased 
blanket of heat-absorbing gases in the atmosphere. 
Sunlight reflection can never perfectly counterbalance 
an increased greenhouse effect because the two types of 
downwelling radiation have different spatial and sea¬ 
sonal patterns. 

The section presents some of the more widely 
discussed geoengineering options, without being com¬ 
prehensive. Here, we try to fill some of the gaps as well 
as introduce the entries herein. 

Sunlight Reflection (SR) 

Sunlight reflection can act rapidly to cool the climate 
if deployed on a sufficiently large scale. It could, in 
principle, be deployed to return the Earth to its 
preindustrial temperature, to hold it at some level of 
warming that has already occurred, or to lower future 
global warming. A popular framing is that sunlight 
reflection could “buy time” for decarbonising the 
economy and allowing greenhouse gas concentra¬ 
tions to stabilize and then come down. Alternatively, 
potent methods of sunlight reflection might be 
reserved for use “in emergency” should dangerous 
climate change become apparent, but it is as yet 
unproven that such a deployment after a threshold 
had been passed would prevent the change that was 
already underway. 

Methods of sunlight reflection can be distinguished 
in terms of the altitude at which they are applied. They 
start outside the Earth’s atmosphere with the idea of 
placing mirrors or sunshades in space to reduce the 
amount of sunlight reaching the Earth (D. J. Lunt 
“► Sunshades for Solar Radiation Management”). To 
counteract a doubling of atmospheric carbon dioxide 
would require a roughly 2% reduction in the amount of 
sunlight reaching the top of the Earth’s atmosphere. To 
achieve this, one proposal is to put tiny “flyers” (each 
about 0.3 m 2 ) between the Earth and the Sun at the first 
Lagrangian (LI) point. The roughly 5 million km 2 of 
sunshade required at this distance (around 1.5 million 
km) means the method will probably remain in the 
realm of science fiction for the foreseeable future. How¬ 
ever, the model studies that have been conducted to 
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examine the effects on the climate provide useful infor¬ 
mation on the effects of a uniform reduction in incom¬ 
ing sunlight, which might be achieved by other means. 

Currently, the leading candidate for geoengineering 
a reduction in incoming sunlight is to inject tiny par¬ 
ticles into the Earth’s stratosphere which will scatter (or 
in some cases absorb) sunlight (B. Kravitz “► Strato¬ 
spheric Aerosols for Solar Radiation Management”). 
(The stratosphere is the thermally stratified layer of 
the Earth’s atmosphere between about 10 and 50 km 
altitude, separated from the well-mixed troposphere 
below by the tropopause.) 

A natural analogue here is a volcanic eruption, such 
as Mt. Pinatubo in 1991, which injected sulfate aerosol 
into the stratosphere and measurably cooled the cli¬ 
mate (by around 0.5°C in the following year, followed 
by a tailing off). The stable thermal structure of the 
stratosphere means that aerosols stay aloft for much 
longer than in the troposphere below, and therefore, 
a much smaller loading of particles is required to have 
a given cooling effect. An estimated 1.5-5 MtS year -1 
would need to be deliberately injected into the strato¬ 
sphere to offset a doubling of C0 2 , which is much less 
than the 50-100 MtS year -1 that human activity cur¬ 
rently adds as pollution to the troposphere. The main 
unanswered questions surround how to inject sulfate 
aerosols into the stratosphere in a way that stops 
them from coagulating, because if the tiny particles 
combine to become larger particles, this can pro¬ 
foundly alter their radiative properties, at the extreme, 
turning a cooling effect into a warming effect. Other 
types of aerosol, notably soot, and engineered metal 
nanoparticles are also being discussed as possible can¬ 
didates for stratospheric injection. 

Moving down into the lower atmosphere, clouds 
are a major contributor to the reflectivity of the planet, 
and low-level clouds in particular have a net cooling 
effect at the surface. Hence, it has been proposed to 
cool the planet by making marine stratocumulus 
clouds more reflective (S. H. Salter “► Solar Radiation 
Management, Cloud Albedo Enhancement”). This can 
be achieved by distributing the same amount of cloud 
water over more but smaller droplets, which requires 
a source of the tiny particles known as cloud conden¬ 
sation nuclei on which water condenses. Sea salt is 
the most obvious and ubiquitous candidate aerosol, 
and a means of spraying it from the ocean using 


wind-powered boats with Flettner rotors is described 
by Salter. An alternative is to enhance the source of 
cloud condensation nuclei from the biological produc¬ 
tion and air-sea exchange of the gas dimethyl sulfide. 
The methods may also suppress rainfall, increasing the 
lifetime of clouds and giving a further cooling effect. 
They should be most effective far from sources of 
human pollution (which also provide condensation 
nuclei), such as in the Southern Ocean. However, this 
means the cooling effects on the climate would inevi¬ 
tably be patchy, which in turn can cause unexpected 
climate changes far away, for example, in some model 
simulations, the Amazon gets drier. 

Finally, a number of proposals have been made to 
enhance the reflectivity of the Earth’s surface, focusing 
on deserts, grasslands, croplands, and human settle¬ 
ments. Here, the total area altered and the change in 
surface reflectivity (albedo) are the key determinants of 
the global cooling effect. As the land comprises only 
29% of the Earth’s surface, to achieve a significant 
global cooling effect would require major overcooling 
of parts of the land. Instead, these approaches are best 
thought of as means of achieving significant localized 
cooling. In the case of more reflective croplands, 
cooling effects would be largest just before the crop is 
harvested, so this could provide a means of, for exam¬ 
ple, cooling future European summers. However, there 
remain unanswered questions regarding the effects on 
crop yield. Reflective roof surfaces are now legislation 
in California and more widespread adoption of this 
approach could become an important adaptation strat¬ 
egy in tackling urban heat islands, even though its 
global cooling effect will be negligible. 

Carbon Dioxide Removal 

Current total C0 2 emissions from fossil fuel burning, 
land-use change, and cement production are rapidly 
approaching 10 billion tons of carbon per year 
(~10 PgC year -1 ). Just to stop C0 2 concentration 
from rising in the atmosphere, the net anthropogenic 
source of C0 2 has to be reduced first by about 50% to 
match natural sinks and, then on, down to zero as 
the natural sinks decay. To help achieve this, carbon 
dioxide removal (CDR), and subsequent storage, is 
a clear complement to reducing C0 2 emissions. 
Doing both together could stabilize atmospheric C0 2 
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concentration sooner and at a lower level. Ultimately, 
CDR could be used to bring the concentration of 
atmospheric C0 2 down faster than natural sinks, to 
whatever is deemed a safe level. However, like reducing 
emissions, CDR will act relatively slowly to alter the 
rate and magnitude of climate change when compared 
to potent methods of sunlight reflection. To achieve 
significant global C0 2 removal, even with the most 
effective CDR methods, will require global deployment 
for decades. Furthermore, the Earth system actually 
works against deliberate CDR by always trying to main¬ 
tain a balanced apportioning of C0 2 between the 
ocean, atmosphere, and land surfaces. Hence, if C0 2 
is removed from the atmosphere, some leaks out from 
the ocean and/or land to partly compensate, meaning 
that the effect on atmospheric C0 2 concentration 
decays over time. (This is simply the opposite of the 
well-known land and ocean carbon sinks, which are 
generated by the addition of C0 2 to the atmosphere.) 

Carbon dioxide removal covers a wide range of 
methods and pathways to storage. C0 2 can be removed 
from the air by photosynthesis (by plants, algae, or 
cyanobacteria) or by physical and chemical means, 
which are related to natural weathering reactions. The 
ease of removal varies with the pathway. The carbon 
may ultimately be stored as liquid C0 2 (in geological 
reservoirs or the deep ocean), in charge-balanced solu¬ 
tion in seawater, as carbonate rocks, as charcoal, or as 
buried or standing biomass. The different forms of 
storage have differing stability in terms of thermody¬ 
namics, kinetics of reactions, and ease with which C0 2 
might be returned to the atmosphere. 

Photosynthesis is effectively solar-powered carbon 
capture for free (to us), although it is remarkably inef¬ 
ficient (~0.5% efficiency, compared to solar photovol¬ 
taic cells capable of ~20% efficiency), and biomass is 
the least stable form of carbon storage because it is 
a source of energy to other organisms. However, in 
the ocean, some of the carbon fixed in photosynthesis 
can sink to depths where it has a lifetime of up to 
1,000 years. The amount exported to depth depends 
crucially on the supply of limiting nutrients to 
the surface ocean. Hence, several geoengineering pro¬ 
posals consider adding limiting nutrients, especially 
iron, to the surface ocean to stimulate biological pro¬ 
ductivity (P. W. Boyd “► Ocean Fertilization for Seques¬ 
tration of Carbon Dioxide from the Atmosphere”). 


Iron fertilization is unusual in being a form of 
geoengineering that can draw on a series of 12 experi¬ 
ments to investigate its biogeochemical consequences 
(though not geoengineering specifically). However, 
what those experiments have shown is that it is remark¬ 
ably difficult to increase the sinking flux of carbon to 
the deep ocean. Model studies add that even with global 
iron fertilization maintained for a century, the poten¬ 
tial impact on atmospheric C0 2 concentration is mod¬ 
est, lowering it by, at most, around 30 parts per million. 

Land-based photosynthesis has the potential to 
fuel larger carbon dioxide removal fluxes, despite the 
smaller surface area of the land compared to the ocean 
and the need not to take land from natural ecosystems 
or to interfere with food production. The potential is 
greater because of the high productivity that can be 
achieved on land. The simplest method is afforestation, 
but it only works as a means of carbon dioxide removal 
if the conversion to forest is permanent and the carbon 
that is lost as trees decay is replaced by new trees. Still, 
afforestation is already underway at a global scale, with 
around 250 million hectares having been planted in 
recent decades, and this is creating a sink of circa 
0.3 PgC year -1 , canceling roughly 3% of current total 
C0 2 emissions. 

Carbon dioxide removal can also be achieved by 
converting waste biomass from farming and forestry 
into longer-lived forms for storage, although the long¬ 
term potential will depend more on the supply of 
deliberately cultivated biomass energy crops. Several 
conversion pathways are available. Biomass energy 
combustion coupled with capture and storage of the 
C0 2 given off (often referred to as BECS or BECCS) is 
more cost-effective than chemical methods of C0 2 air 
capture, although there are still energy penalties in 
capturing and compressing C0 2 . Fermentation of bio¬ 
mass, for example, to produce liquid biofuels, yields 
a near-pure stream of C0 2 reducing the capture cost. 
Alternatively, pyrolysis of biomass (in the presence of 
little or no oxygen) produces charcoal, which can be 
returned to the soil as biochar (S. Shackley et al. 
“► Biochar, tool for climate change mitigation and 
soil management”). Although the energy yield from 
pyrolysis is somewhat less than from combustion, 
biochar has a range of cobenefits, including improving 
soil water retention and fertility, which make it an 
attractive option. Energy remains in the biochar, but 
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it is hard for organisms to break the material down, 
making much of it long-lived in soil. 

Carbon dioxide can also be captured from the air by 
chemical means, using, for example, a strong alkali 
solution. When C0 2 has been captured into concen¬ 
trated form in this way, or from combusting or 
fermenting biomass, it can be stored in liquid form. 
However, it is safer and more permanent to neutralize 
carbonic acid to form carbonate rocks or aqueous 
bicarbonates by mirroring natural rock weathering 
processes. Carbonate weathering brings C0 2 into 
solution as bicarbonate (although, ultimately, on 
a ~ 10,000-year timescale when carbonates are 
redeposited in the ocean, the C0 2 will be returned to 
the atmosphere). Silicate weathering followed by car¬ 
bonate deposition is a permanent removal process for 
atmospheric C0 2 . However, these reactions are gener¬ 
ally rather slow, even when applied to a C0 2 -rich gas 
stream. Heating to speed up the reactions is too costly; 
hence, better methods for accelerating carbonation are 
needed, and some work on using electrochemical 
energy to accelerate the reactions is underway. 

Finally, C0 2 can be removed from the air by 
mining, crushing, and spreading on the land silicate 
minerals that weather rapidly such as serpentine or 
olivine (R. D. Schuiling “► Carbon Dioxide Sequestra¬ 
tion, Weathering Approaches to”). This represents 
a direct attempt to accelerate silicate weathering, 
which should be most effective in wet regions of the 
tropics under vegetation, because plants and their asso¬ 
ciated mycorrhizal fungi produce organic acids that 
accelerate dissolution. Olivine has relatively large 
deposits in the tropics, making it a promising candi¬ 
date, although estimates are needed of the energy and 
C0 2 costs of mining, grinding, and distributing the 
rock on appropriate land. Weathering rates can be 
limited by factors other than substrate supply, and 
current estimates suggest this CDR method may be 
limited to at most around 1 PgC year -1 . 

Broader Issues 

The concept of geoengineering and the particular pro¬ 
posals, both CDR and SRM, provoke a plethora of 
social concerns ranging from specific questions of eco¬ 
nomical or political feasibility for each proposed 
method to broader ethical and philosophical debates 
about our relationship with nature. 


Efforts are underway to establish a framework for 
governing geoengineering, particularly SRM. This 
poses a host of questions, relating both to research 
and possible implementation (S. Low et al. 
“► Geoengineering Policy and Governance Issues”). 
Historical efforts at weather modification provide 
some past precedent (albeit at a smaller scale), but 
they offer little in the way of existing governance 
frameworks to draw on. Given the rapidly evolving 
nature of the field, flexibility and adaptability will 
be key requirements for whatever governance frame¬ 
work emerges. 

The use of any geoengineering method as 
a response to climate change will ultimately be made 
by societies. Therefore, the public perception of, and 
engagement with, this group of emerging technologies 
is of critical importance in determining their future 
usefulness. Concepts of upstream engagement and 
responsible innovation can help incorporate a range 
of societal values into research at an early stage 
(and they are both, in different ways, built into two 
current UK Research Council-funded projects on 
geoengineering). 

Early public engagement (e.g., “Experiment Earth” 
conducted by the UK Natural Environment Research 
Council) has already yielded unexpected responses to 
geoengineering. In particular, the “moral hazard” argu¬ 
ment that geoengineering will suppress efforts to 
reduce greenhouse gas emissions has not been clearly 
borne out. Some members of the public responded to 
information on geoengineering with the opposite 
response; that if things are so bad that scientists are 
considering geoengineering, then efforts to reduce the 
root cause of climate change, namely, greenhouse gas 
emissions, must be strengthened. 

Moving forward, the ethical and philosophical 
debate about geoengineering needs to distinguish 
SRM and CDR techniques. SRM only deals with the 
symptoms of climate change, notably rising tem¬ 
peratures. CDR, on the other hand, like conventional 
emissions reduction, tackles the root cause of cli¬ 
mate change: rising greenhouse gas concentrations. 
Specific methods still raise specific concerns. But 
as research on geoengineering continues to escalate, 
we hope this section provides a stimulating 
introduction to the methods and the debates 
surrounding them. 
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Glossary 

Climatology Reference period used to describe the 
characteristics of the climate, such as the mean 
annual cycle, or the expected statistics of weather 
or of year-to-year climate variability. The World 
Meteorological Organization recommends the 
most recent three full decades; e.g., in 2009, the 
WMO climatology period would cover 1971- 2000. 

External forcing Factors that influence the climate 
system but are not explicitly driven by the climate 
system, such as human emissions of greenhouse 
gases, changes in the sun s radiation, and volcanic 
emissions. 

Forecast The guidance offered by a forecaster or fore¬ 
cast center on the future climate conditions. 
A forecast could be based on a single prediction, 
but typically is a distilled product that involves 
recalibrated model predictions and often multiple 
prediction inputs. 

Internal variability The chaotic evolution of a fluid, 
such as the ocean or atmosphere, due to nonlinear 
dynamics that are sensitive to small uncertainties or 
variations in initial conditions. Depending on time- 
scale, internal variability may refer to that generated 
internally to the atmosphere, to the ocean, or due to 
ocean-atmosphere interaction. It is the part of the 
seasonal-to-decadal climate that is not determinis¬ 
tically predictable. 


Prediction The future climate conditions indicated by 
a single prediction model, which could be statistical 
or dynamical. These differ from climate change 
projections in that information of the climate state 
at or near the initial time of the forecast is highly 
relevant to its future evolution. 

Teleconnections Climate variability in one region that 
is driven remotely by climate variability in another 
region. This typically refers to regional patterns of 
climate anomalies over land and/or oceans that 
result from specific ocean phenomena, such as dur¬ 
ing El Nino events. 

Definition of the Subject and Its Importance 

Seasonal-to-decadal climate prediction seeks to quan¬ 
tify the likely evolution or change of the climate system 
over a specific time horizon of months to years. 
Climate predictions based on dynamical models incor¬ 
porate all relevant processes to the extent possible, 
including anthropogenic climate change, but most 
importantly those processes that govern the likely evo¬ 
lution of natural climate variability. The predictions, if 
well calibrated, describe the probability of a given mag¬ 
nitude of change in the mean climate or changes in the 
characteristics of the weather over the forecast period. 
For example, a seasonal forecast for next winter might 
indicate a greater likelihood for the seasonal mean 
temperatures to be colder than usual, or might indicate 
the likelihood for more frequent storms over the 
3-month period. 

Seasonal prediction is a fairly well-established 
enterprise with a number of forecast centers around 
the world issuing real-time seasonal predictions-based 
dynamical models [1]. Increasingly, national meteoro¬ 
logical and hydrological services create seasonal fore¬ 
cast products based on their own statistical or 
dynamical prediction tools and/or incorporate predic¬ 
tions from the international centers. Decadal predic¬ 
tion is a much newer endeavor and is still considered 
experimental [2]. Only a few groups have attempted 
decadal-scale climate predictions intended to capture 
the evolution of natural decadal variability for the 
coming decade from a global circulation model [3-5], 
and although the results indicate there may be 
added information from these predictions relative to 
the more familiar climate change projections of the 
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Intergovernmental Panel on Climate Change (IPCC), it 
is not clear that the added information results from 
better prediction of the decadal-scale climate variabil¬ 
ity that would exist even in the absence of increasing 
greenhouse gases. 

Climate forecasts are potentially valuable to society 
on seasonal-to-interannual timescales to inform 
resource management, planning decisions, and on 
decadal timescales to inform longer-term plans and 
infrastructure investment. Even in the climate change 
context, decadal prediction could prove important, 
as the climate experienced regionally for the coming 
decade(s) will likely be some combination of anthro¬ 
pogenic climate change and natural decadal variability. 
Decisions and investments related to climate change 
adaptation typically apply to the next 10-20 years into 
the future, rather than 80 years into the future. Thus 
better information on evolution of the climate and 
changes in risks of climate extremes can lead to more 
appropriate planning. However, climate predictions 
are necessarily probabilistic, and in the case of decadal 
predictions are yet to be established as skillful. Thus it is 
important that decision systems be designed and opti¬ 
mized to account for the inherent uncertainty in future 
climate, that can still allow benefits to be realized in 
times of favorable climate and losses to be mitigated in 
times of adverse climate. 

Introduction 

Climate varies on all timescales, from seasonal varia¬ 
tions to millennial ice ages. Prediction of the climate at 
timescales that are relevant to societal decisions, but 
extending beyond weather forecasts, has been roughly 
broken into three classes: seasonal-to-interannual pre¬ 
diction that addresses the changes in seasonal climate 
and its weather characteristics a couple months to 
a year in the future, decadal prediction, sometimes 
referred to as near-term climate change prediction 
that addresses changes in the mean climate and its 
characteristics for a couple years to a couple decades 
into the future, and climate change projections that 
consider changes in the mean climate and its charac¬ 
teristics 50-100 years in the future. 

The seasonal-to-interannual timescale dominates 
the climate that is experienced locally. On a local- 
to-regional scale, year-to-year variability almost always 


explains the majority of the variance in the observed 
climate (e.g., Fig. 1). Year-to-year variability is where 
most impacts are experienced. However, it is the super¬ 
position of the three climate timescales that can lead to 
changes or trends in the frequency of adverse years. 
Extreme examples are potentially the protracted 
drought conditions in the western United States from 
the mid-1990s to the early twenty-first century [6], the 
2003 European heat wave [7], or the extremely active 
hurricane season 2005 [8], which was accompanied by 
many land-falling hurricanes in the United States such 
as Katrina. 

The primary difference between prediction of cli¬ 
mate variability on different timescales is the drivers, or 
phenomena, associated with those impacts. This leads 
to differences in the way prediction systems are 
designed to predict the climate fluctuations and asso¬ 
ciated impacts on different timescales. Seasonal-to- 
interannual prediction is an initial value problem; by 
initializing the climate system close to the observed 
state at the beginning of the prediction, a dynamical 
model will aim to capture the likely evolution of the 
climate system. At the other end of the time spectrum, 
climate change projection is a boundary value problem, 
which means that the driver of the climate change is 
external to the climate system and imposed upon it. 
Anthropogenic increases in greenhouse gases are due to 
man s activities and are not part of the natural climate 
system. Climate change projections depend on cor¬ 
rectly projecting the changes in the Earth’s atmospheric 
composition and the subsequent changes in the Earth’s 
energy balance. Decadal prediction lies at the intersec¬ 
tion between seasonal-to-interannual prediction and 
climate change projection; it is an initial value as well 
as a boundary value problem. Decadal prediction 
depends both on initializing the climate system close 
to the observed state, especially the slowly evolving 
components, and on correctly representing the changes 
in Earth’s energy budget. 

This is not to say that predictions on longer time- 
scales do not contain the higher frequency phenome¬ 
non. However, there is a predictability limit for natural 
climate variability, which refers to how far into the 
future some aspect of climate variability can be 
predicted before the uncertainty, or range of possibili¬ 
ties, approaches the climatological uncertainty. At that 
point little to no predictive information remains. 
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Climate Predictions, Seasonal-to-Decadal. Figure 1 

Example of simple decomposition of (a) temperature and (b) precipitation averaged over the state of Colorado in the 
United States. The top panels (black) show the observed annual mean time series. The second panels (red) represent 
"climate change" time series, in which the climate changes are consistent with the globally averaged temperature, 
obtained by decadally filtering the time series and regressing it against a similar low-frequency filtered time series of 
globally averaged temperature. The third panels (green) represent the "natural decadal variability," which is low-frequency 
time series that is not coincident with globally averaged temperature changes, obtained as the difference between the 
low-frequency filtered time series and the "climate change" time series. The bottom panels (blue) represent the year-to- 
year variability on top of the low-frequency changes, which is the difference between the full time series and the 
low-frequency filtered time series. Note that there is no attribution to anthropogenic changes or physical phenomena in 
any of these time series. Details are likely to change with different filtering parameters and with different approaches to 
estimate global warming 


The limit of predictability is not necessarily a fixed 
quantity. It changes with the phenomenon, but also 
changes with time, meaning that at some times 
a phenomenon will be more predictable than others 
and thus the evolution can be predicted farther into the 
future. It is not possible to determine what the true 
limit of predictability is or should be [9] . The model(s) 
that can predict the phenomenon with the greatest 
fidelity when compared to observations over some 


long history containing many realizations of the phe¬ 
nomenon determine the current limit of predictability. 

In order to make a prediction one must first deter¬ 
mine what is to be predicted. If the aim is to predict 
local-to-regional scale climate over land, one must 
know the driver of that climate variability. Numerous 
research and prediction studies have demonstrated 
that it is the large-scale variability in the pattern of 
surface temperature, and in particular the sea surface 
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temperatures that drive the predictable aspect of 
changes in the atmospheric circulation and thus 
regional temperature and precipitation. But what 
drives that sea surface temperature variability? The 
sea surface temperatures must be predicted if it is 
hoped to predict the associated terrestrial climate 
impacts. Once the ocean phenomena or processes rel¬ 
evant to sea surface temperature variations are identi¬ 
fied, the climate models must be capable of simulating 
those. Furthermore, if the prediction of some phenom¬ 
enon from a particular model is to provide actionable 
information, then the phenomenon must be predict¬ 
able above the other ongoing processes in the climate 
system; in other words, the signal of the phenomenon 
must be predictable above the background noise of 
the climate system. In the next section, an example of 
this process of identification, model validation, and 
prediction based on the El Nino-Southern Oscillation 
(ENSO) phenomenon and seasonal climate prediction 
is presented. 

Brief History of ENSO Prediction : Impacts of the 
ENSO phenomenon have been experienced for centu¬ 
ries, long before the phenomenon itself was identified. 
The peoples of Peru used the term El Nino to refer to 
the expected changes in the local climate and fish stocks 
associated with a seasonal reversal of the current system 
off the coast of western South America, because these 


changes occur near the end of the year at a time near 
Christmas (El Nino is Spanish for the Christ child). 
However, they also noted that warm seasonal waters 
associated with the change of currents, would occa¬ 
sionally be very warm and would also bring abundant 
rainfall. It is these extreme years, which recur about 
every 3-7 years that are now called El Nino events. 
Farmers in drought-prone regions of the Andes even 
developed a method to predict the coming of the 
increased rainfall during these events by monitoring 
the visibility of a star in the Pleiades constellation 
[10]. What they were observing was the shift of con¬ 
vection from the western tropical Pacific into the cen¬ 
tral Pacific in concert with the development of an El 
Nino event (Fig. 2). 

Sir Gilbert Walker could be said to be the pioneer of 
seasonal forecasting as he sought to quantify the atmo¬ 
spheric component of ENSO, the Southern Oscillation, 
and its relationship to regional climate variability, such 
as the devastating droughts in India [11]. To accom¬ 
plish this he examined correlations between 32 stations 
across the world for fields of sea level pressure, temper¬ 
ature, rainfall, and riverflow. He discovered that nega¬ 
tive excursion of the Southern Oscillation Index was 
associated with increased likelihood for drought over 
India; his empirical model has not been much 
improved upon over the last century for that region. 
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Climate Predictions, Seasonal-to-Decadal. Figure 2 

Schematic drawing of the tropical Pacific ocean-atmosphere state during (a) average or neutral conditions in which trade 
winds blow east to west, pushing warm surface waters to the western Pacific, which pushes down the thermodine 
(separation between warm upper ocean and cold deep ocean) and concentrates the deep convection in the western 
Pacific; (b) El Nino conditions in which the thermodine becomes deeper in the eastern Pacific and warm water moves 
westward, which weakens the east-west Trade Winds and allows the convection to move into the central Pacific (Source: 
http://www.tao.noaa.gov/proj_overview/tao_tour_ndbc.shtml) 
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Researchers have continued to improve upon the 
foundation that Walker laid for ENSO teleconnections 
(Fig. 3). Maps that show significant correlation 
between regional temperature and precipitation 
changes to ENSO events for specific 3-month average 
seasons are widely used to illustrate ENSO’s global 
reach [12, 13]. However, these teleconnection patterns 
represent expectations based on statistics and are not 
guaranteed to occur in any specific event; the probabi¬ 
listic likelihood of a regional impact [14] is a further 
refinement of the climate anomalies due to ENSO, and 
something that climate prediction models should be 
expected to replicate in their ensemble distributions 
over time. 

It was not until the second half of the twentieth 
century that researchers discovered that the Southern 
Oscillation was associated with changes in the large- 
scale sea surface temperature pattern over the tropical 
Pacific; it was the coupled interaction between the 
east-west sea surface temperature gradient and the 
low-level winds between the high and low pressure 
centers of the Southern Oscillation that led to the 
growth of Nino events [15]. It was soon after recog¬ 
nized that the change in the winds due to the changes in 
sea surface temperatures, associated with the Southern 
Oscillation, modified the distribution of the upper- 
ocean mass field below the surface [16], and that the 
adjustment of these perturbations to the mass field 
could lead to the eventual decay of the El Nino event 
and possible initiation of the opposite phase, La Nina. 

In the 1980s simple dynamical models [17-19] were 
developed that simulated the coupled air-sea processes 
central to ENSO and reinforced the theory that had 
been informed primarily by observations. The first 
experimental El Nino forecast was published in 1986 
[20], using one of these simplified dynamical models. 
Since then more complex models have been built that 
capture not only the dominant processes behind ENSO 
but also provide a more complete representation of 
the climate system to better capture uncertainties in 
ENSO. These models also simulate the atmospheric 
teleconnections that lead to changes in sea surface 
temperatures of other ocean basins and to changes 
in the terrestrial climate. These are the impacts that 
served as the initial motivation for the study of 
ENSO. Finally, through the process of identification 
of a primary driver of seasonal climate variability and 


the dominant physical processes behind it, and the 
development of models that could simulate and predict 
this driver and its teleconnections, seasonal prediction 
was born. 

The prediction of decadal-scale climate variability 
is a much more recent endeavor. Although research on 
decadal climate variability through the use of observa¬ 
tions and models is not new [21-26], a community¬ 
wide effort in this area is new. The motivation to 
predict decadal climate variability has arisen in part 
from a desire to use the climate change projections 
that appear in the Working Group 1 report of the 
Intergovernmental Panel on Climate Change [27] to 
inform sectoral decision making, [28] as well as plans 
and investments toward climate change adaptation. 
For these societal needs, climate information for the 
next 5-20 years becomes more relevant than that for 
the next 100 years. The other side to the motivation 
behind experimental decadal predictions is the realiza¬ 
tion that there are processes inherent in the natural 
climate system evolving at decadal-to-multidecadal 
timescales, and the mounting evidence that dynamical 
models have some ability to simulate some aspects of 
the observed variability [29, 30]. 

As the successes and failures in climate prediction 
are considered, it must be borne in mind that climate 
predictions are necessarily probabilistic. They indicate 
the likelihood of a range of possible outcomes. The 
magnitude of this range of outcomes, often referred 
to as the uncertainty or probability distribution, is 
sensitive to uncertainties in the initial conditions 
from which the predictions evolve, to uncertainties in 
external forcings, and to errors in prediction models. 
The value about which the uncertainty is centered is 
sensitive to the external forcings and to information in 
the initial conditions that may lead to specific, robust 
evolution of the climate system. Particularly in the case 
of decadal prediction, which is still in the experimental 
phase, success refers to relative performance, or agree¬ 
ment between prediction and observations, compared 
to the state of predictions beyond the seasonal time- 
scale, namely, climate change projections. In other 
words, much of the judgment of decadal prediction in 
these early experiments focuses on the added forecast 
quality from the initial conditions relative to that from 
the boundary conditions, or external forcing. There¬ 
fore, success in the eyes of the climate community may 
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not constitute information that is accurate enough or 
specific enough to be actionable. 

Although both seasonal prediction and decadal 
prediction experiments, and climate change projec¬ 
tions for that matter, use the same type of dynamical 
models, substantial differences exist between their 
application for these different timescales of prediction. 
The following sections contain discussions of Drivers 
of Variability, Model Fidelity, Prediction Systems, and 
Internationally Coordinated Efforts, first for the sea¬ 
sonal-to-interannual timescale, followed by a similar 
analysis for the decadal prediction problem. The only 
difference in the structures is that the section on 
seasonal-to-interannual prediction also contains 
a discussion of Forecast Skill. The echoed structure is 
intentional in that many of the issues and approaches 
will be similar for both timescales. However, there are 
important differences in what is known about the 
drivers of climate variability at these different time- 
scales as well as differences in the maturity of the 
prediction systems. 

Seasonal-to-lnterannual Prediction 

Seasonal-to-Interannual Prediction: Drivers of 
Interannual Variability 

Seasonal-to-interannual prediction derives from initial 
conditions of the climate system. Unlike weather fore¬ 
casts, where the relevant initial condition is the atmo¬ 
spheric state and the sea surface temperatures are 
approximately constant, seasonal forecasts depend 
more on the initial condition of the ocean. The evolu¬ 
tion of the ocean state, particularly the density struc¬ 
ture and the currents, leads to changes in the pattern of 
sea surface temperatures that can then influence the 
atmospheric circulation. 

The dominant pattern of surface temperature vari¬ 
ability, after accounting for global warming, is that of 
the El Nino-Southern Oscillation (ENSO) (e.g., [31]). 
For this reason ENSO has received a great deal of 
attention in studies of climate prediction on seasonal- 
to-interannual timescales. Changes in winds and pre¬ 
cipitation are associated with these global temperature 
pattern changes. 

El Nino events recur about every 3-7 years on 
average, and are somewhat locked to the annual cycle 
in that they tend to develop and grow through the 


middle of the year and tend to peak near the end of 
the year. During an El Nino event when much of the 
warm water in the western equatorial Pacific moves 
eastward, the region of deep convection also moves 
eastward into the central Pacific (Fig. 2), and in some 
cases reaches as far as the coast of South America. Since 
the equatorial Pacific spans nearly half the circumfer¬ 
ence of the Earth, a shift of the largest region of deep 
convection from the far western Pacific to the central 
equatorial Pacific represents a huge spatial shift in 
where the tropical atmosphere is heated. 

In the tropics, where the effect of Earth’s rotation is 
weaker, the atmospheric response to the pattern of sea 
surface temperatures is thermally direct. The low-level 
winds converge toward the warmest water, or equiva¬ 
lently, to the region of lower pressure. This is true of the 
mean conditions as well as the anomalous conditions. 
Since the lower atmosphere is very humid in the tro¬ 
pics, the regions of converging low-level winds produce 
an upward flow of very moist air and heavy precipita¬ 
tion with a very large latent heat release to the atmo¬ 
sphere associated with water vapor condensation. Near 
the top of the troposphere, relatively dry air is expelled 
from these regions of strong deep convection, and that 
air then sinks. The sinking dry air suppresses convec¬ 
tion. The regions of warmest sea surface temperatures 
and associated strong deep convection are located typ¬ 
ically over the western Pacific warm pool and the west¬ 
ern hemisphere warm pool, which encompasses the 
northeastern tropical Pacific extending to the north¬ 
western tropical Atlantic. Variations in these warm 
pool regions have direct impacts on the climate in the 
neighboring regions, but changes in the strength and 
location of those convective centers can also impact 
regional climate remotely though changes in atmo¬ 
spheric circulation. 

The resulting changes in the atmospheric circula¬ 
tion can lead to warmer conditions in the other tropical 
oceans [32, 33] , which carry additional regional climate 
impacts. For prediction of the regional climate due to 
tropical sea surface temperature changes outside the 
Pacific, it is important to be able to predict those sea 
surface temperatures. For example, the tendency for 
northeastern Brazil to be drier than normal during an 
El Nino event (Fig. 3) is due in part to the anomalous 
subsidence from the shift in deep convection over the 
central Pacific, but it is also due to associated warming 
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of the sea surface temperatures over the north tropical 
Atlantic [34]. Similarly, wetter conditions in eastern 
Africa associated statistically with El Nino events are 
now known to result from the warming of SSTs in the 
western Indian Ocean that are also associated with El 
Nino events [35]; an El Nino event that is not accom¬ 
panied by warm SST anomalies in the western Indian 
Ocean leads to drier conditions over East Africa due to 
anomalous subsidence resulting from El Nino’s 
enhanced convection in the central Pacific. 

El Nino can affect weather and seasonal climate 
outside the tropics through changes in the position 
and strength of the storm tracks. When the warm 
water that normally resides in the western Pacific 
extends across the Pacific, it changes the large-scale 
temperature differences between the tropical and the 
midlatitudes. This allows the storm track associated 
with the subtropical jet stream to strengthen over the 
central and eastern subtropical Pacific where it is usu¬ 
ally weaker and more variable. Additionally, the 
warming of the equatorial Pacific region as a whole 
allows the amount water vapor in the lower atmo¬ 
sphere to increase. The combination brings more fre¬ 
quent and stronger storms into the southern tier of the 
United States during El Nino events. This impact on 
extratropical climate is seen in the winter hemisphere 
because this is when the jet stream is strongest. So 
although a similar influence can be discerned for 
storm track headed toward South America, the impact 
is less robust, since El Nino events are typically growing 
during southern hemisphere winter in the middle of 
the year. During the northern hemisphere winter is 
closer to the time when El Nino events are mature. 

It is the large-scale changes in the patterns of 
low-level heat and moisture that drive changes in 
the atmospheric circulation. El Nino happens to be 
the dominant phenomenon influencing that and the 
focus of those changes is primarily over the tropical 
oceans. However, changes in land-surface conditions, 
such as soil moisture or ice, can also influence regional 
climate. Soil moisture influences the overlying atmo¬ 
sphere primarily through evaporation, which can then 
influence precipitation as well as near-surface air tem¬ 
perature during certain times of the year [36]. Dry soil 
conditions, and thus a reduced ability of the surface to 
cool itself through evaporation, are likely to have con¬ 
tributed to the 2003 European heat wave [37]. Changes 


in patterns, extent, and timing of snow cover can also 
impact the atmospheric circulation through changes 
in land atmosphere energy exchange and may impart 
predictability to northern hemisphere wintertime 
temperatures [38] and also the strength of the East 
Asian monsoon [39]. 

Seasonal-to-Interannual Prediction: Model Fidelity 

Once the main drivers of seasonal-to-interannual 
climate variability are identified, it is then necessary 
to ascertain whether the model to be used for 
seasonal-to-interannual prediction can replicate the 
drivers with sufficient realism. Change in patterns of 
SSTs is the dominant driver of seasonal-to-interannual 
climate variability worldwide. However, the regional 
terrestrial climate will only be predictable if the relevant 
SSTs are predictable [40]. Given that the El Nino 
phenomenon represents the majority of year-to-year 
variance in SSTs, including influencing the global 
ocean outside the tropical Pacific [41], most 
studies of the suitability of a model to predict 
seasonal-to-interannual climate will focus on the 
model’s ability to predict El Nino. Of course, such 
studies of model fidelity help further elucidate the 
processes behind such phenomena. 

The first attempt to predict El Nino employed 
a very simple model of the tropical Pacific Ocean that 
consisted of a warm, lighter, upper ocean overlying 
a cold, heavier deep ocean [17]. The depth of the 
upper layer determined the temperature at the surface 
in the eastern and central equatorial Pacific where 
upward currents are known to bring cold water from 
the deeper ocean into the upper layer and cool the 
surface; the more shallow the upper layer, the easier 
for the upwelling currents to bring cold water to the 
surface. The surface temperature anomalies in the east 
influence the east-west temperature gradient, which 
affects the strength of the trade winds, which affect 
the slope of the interface between the upper and 
lower ocean layers, and thus affects the eastern equato¬ 
rial surface temperature. This describes the classic 
Bjerknes feedback mechanism [15] that maintains the 
mean state as well as the coupled air-sea feedbacks that 
can evolve an El Nino or La Nina event. Off the equator 
in the western Pacific the anomalous winds create 
depth anomalies of the opposite sign to those in the 
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eastern equatorial Pacific, which can then adjust via 
equatorial wave dynamics, eventually causing the decline 
of the current event (e.g., El Nino) and potentially 
initiating an event of the opposite sign (e.g., La Nina). 
The positive feedback growth together with the delayed 
negative feedback that can cause the turnabout from one 
phase to the next was named the Delayed Oscillator 
mechanism [19]. Variants on this central idea, such as 
the Recharge Oscillator [42] have since been formu¬ 
lated as observations of the tropical Pacific became 
available [43] and as the tropical Pacific air-sea vari¬ 
ability was studied in more models. 

Although the first models to successfully predict El 
Nino in the late 1980s were very simplified compared to 
the complexity of the real ocean-atmosphere system, 
they still remain viable prediction tools. It is very diffi¬ 
cult to represent all the physical processes in the trop¬ 
ical ocean-atmosphere system precisely, and because 
of the strong interconnectedness of these processes, 
small errors in the representation of one process 
leads to associate errors in others. Thus it was not 


until the early twenty-first century that coupled 
ocean-atmosphere models of full complexity clearly 
demonstrated parity with simpler prediction models 
(Fig. 4) [44]. The metric most commonly presented 
to represent a model’s ability to predict El Nino is the 
NIN03.4 index of sea surface temperature, which is the 
average of the temperature anomaly over the central 
equatorial Pacific from 5S-5N and 170W to 120W, as 
this is the region that exhibits the highest correlation 
with terrestrial climate anomalies worldwide [45]. 

However, this simple index does not capture all 
of the characteristics of El Nino. The timing and 
spatial structure of El Nino-related sea surface temper¬ 
ature anomalies can also influence the resulting 
teleconnections [46]. Additionally, single metrics such 
as correlation or mean error can mask the conditions 
under which El Nino is predictable. Most dynamical 
systems, particularly those with a chaotic component, 
exhibit conditional predictability meaning that there 
are times when the system is more predictable than 
others [47]. Thus it is also common to present the 
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Climate Predictions, Seasonal-to-Decadal. Figure 4 

Anomaly correlation (%) by various methods of the seasonal mean Nino-3.4 SST as a function of lead (horizontal; in 
months). The results are accumulated for all seasons in the (target) period DJF 1997/1998 to DJF 2003/2004. Except for CFS 
(the Climate Forecast System) coupled ocean-atmosphere model of the National Weather Service's Climate Prediction 
Center (CPC), all forecasts were archived in real time at CPC from 1996 onward. CMPM is the previous coupled model, CCA 
is canonical correlation analysis, CA is constructed analog, CONS is a consolidation (a weighted mean), and MARKOV is an 
autoregressive method (From [44]) 
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prediction history of models, to show how observed sea 
surface temperatures along the equator vary compared 
to the predictions as a function of lead time [48]. 
A common finding from such qualitative examination 
is that although models may do well in predicting the 
occurrence of an El Nino event, they have difficulty 
predicting the magnitude of large events or locating 
the variability far enough east during strong events 
[49, 50]. Such biases have repercussions for predicting 
the associated climate anomalies. 

Predicting the driver of the climate anomalies is the 
first step. Next is to predict the associated climate 
anomalies. Biases in prediction of the drivers, such as 
El Nino events that do not exhibit the strength or 
structure of observed events, lead to biases in regional 
climate prediction. One way to circumvent some of 
the error in predicted SSTs is to statistically correct 
them before providing this information as boundary 
conditions to the atmospheric model. This approach 
is known as two-tier forecasting because the SSTs are 
predicted first and the climate is predicted second using 
an atmospheric GCM. Changes in the atmospheric 
circulation do not feed back onto the SST anomalies. 
Because El Nino is the largest driver of climate anom¬ 
alies, and El Nino teleconnections are driven by the 
ocean variability, this is a viable approach. However, 
outside the tropical Pacific a notable fraction of the 
ocean variability is driven by the atmosphere, and 
thus in those regions heat and momentum fluxes will 
not be properly represented by two-tier forecasts. 

One-tier forecasts, where the ocean and atmo¬ 
sphere evolve together, allow for a more physically 
consistent evolution of the ocean-atmosphere system. 
Coupled ocean-atmosphere models are increasingly 
the prediction tools of choice at operational forecast 
centers around the world [ 1 ]. However, due to model 
biases over some parts of the tropical ocean, regional 
climate prediction remains problematic with coupled 
models. In particular coupled models have great diffi¬ 
culty in representing the mean state of the tropical 
Atlantic, with the warmer water occurring in the west¬ 
ern instead of the eastern equatorial Atlantic [51]. As 
a result, the tropical Atlantic SST variability is not 
predicted with any skill for most seasons by the current 
generation of coupled models, and the potential 
predictability of climate variability over western Africa 
and northeastern Brazil is substantially degraded 


compared to what it would be with skillful SST pre¬ 
dictions [40]. 

Other biases that have been known for decades still 
persist in coupled ocean-atmosphere models and limit 
the quality of climate predictions. Such systematic 
biases include a double intertropical convergence 
zone over the Pacific, poor representation of regions 
of stratus clouds over the eastern subtropical and 
extratropical oceans, and vertical temperature gradi¬ 
ents that are too diffuse in the equatorial Pacific 
where the warm upper ocean transitions to the cold 
deep ocean. The processes responsible for these features 
in Nature and how they are represented in models are 
active areas of research. Recent modeling experiments 
using models with a spatial resolution of tens of kilo¬ 
meters rather than hundreds of kilometers does reduce 
some of these biases by better resolving certain climate 
processes. 

Seasonal-to-Interannual Prediction: Prediction 
Systems 

Prediction systems are based on observations, models, 
and their connection through data assimilation sys¬ 
tems. The three together form the three-legged chair 
of prediction systems [9]. Any weak leg compromises 
the system, and improvements in one leg often lead to 
improvements in the other legs. 

Predictability of seasonal-to-interannual climate 
variability arises from the initial conditions of the 
ocean, particularly those conditions in the tropical 
Pacific Ocean that carry some signal of future El Nino 
conditions. Observations of upper ocean heat content 
anomalies in the other tropical oceans are also impor¬ 
tant for prediction as they can influence the persis¬ 
tence of local sea surface temperature anomalies as 
well as moderate the impacts of El Nino-related 
teleconnections in the region. Therefore, it is important 
to adequately observe the tropical ocean state. How¬ 
ever, since models have errors in their representation of 
the real world, using the observations too faithfully to 
describe the initial conditions for model forecasts can 
cause problems. This is where data assimilation is 
essential to prediction systems. 

Data assimilation is the process used to produce 
initial conditions for a dynamical model by combining 
observations with other information from a previous 
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simulation of the model. If this is not done carefully, 
the introduction of the observations into the models 
can lead to initialization shock when the prediction is 
started. Initialization shock is a term used to identify 
the rapid development of model errors when a simula¬ 
tion is started. One approach to minimizing this prob¬ 
lem is called anomaly initialization in which observed 
anomalies of the ocean state rather than the full state 
are added to the model’s mean state to arrive at initial 
conditions. Other data assimilation methods address 
the mismatch between the spatial and temporal char¬ 
acteristics of the variability between Nature and the 
model. Currently, the atmosphere, ocean, and land 
components of prediction models are initialized sepa¬ 
rately. The data assimilation efforts are separate, and 
thus consistency in the initial states and tendencies of 
these components is not ensured. Methods to assimi¬ 
late observational data into the coupled model as 
a whole are being investigated starting with the coupled 
ocean-atmosphere system [52]. 

Recent advances in El Nino prediction skill at the 
European Centre for Medium-Range Weather Fore¬ 
casts in the United Kingdom were accomplished by 
both improvements to their model and improvements 
to the ocean data assimilation system [53]. Addition¬ 
ally, retrospective forecasts, also called “hindcasts,” of 
the NIN03.4 El Nino index from 1960 to present 
from that forecast system have demonstrated the 
value of the observations provided by the Tropical 
Atmosphere-Ocean array of data buoys that measure 
temperatures of the upper 500 m of the tropical Pacific 
Ocean, some at the equator also measuring ocean cur¬ 
rents, as well as temperature, winds, and humidity at 
the surface. At the time when the array of buoys was 
completed in the early 1990s, the forecast error of the 
NIN03.4 index dropped dramatically [54]. This result 
is most clearly demonstrated in forecasts that are initi¬ 
ated in February, when the biases in the model are at 
a minimum. This echoes the connected nature of these 
three elements of forecast systems; observations and 
their assimilation into models are crucial for predic¬ 
tion, but better models better elucidate the value of the 
observing network. 

Even at that point when models, the observing 
network, and the use of those observations for forecast 
initial conditions becomes essentially perfect, climate 
forecasts will still contain uncertainties. Small, almost 


imperceptible, uncertainties in the initial state or the 
detailed evolution of some small-scale processes will 
lead to some divergence in the future state. This is the 
chaotic element of the climate system, sometimes 
referred to as the “butterfly effect.” Where uncertainty 
is due to errors, there is the potential to reduce it. 
However, it is not necessarily the goal of forecasters to 
eliminate uncertainty, as this would be unrealistic, 
but to quantify it to the extent possible. Better models 
that can capture the random nature of processes, 
such as turbulence or convection, would improve 
process-related contributions to uncertainty. Better 
representation of such processes may actually increase 
the uncertainty in forecasts, relative to what models 
now indicate. Better observations, more complete 
observational networks, and improved data assimila¬ 
tion techniques can better indicate the uncertainties 
that arise from initial conditions [55, 56]. 

The uncertainty in climate forecasts should thus be 
considered as a range of possible outcomes. Typically 
the range of possible outcomes, or probabilities, are 
presented relative to the past climate history of the 
last several decades. A common format used by many 
operational forecast centers is tercile classes. For exam¬ 
ple, the precipitation for a given location over the last 
30 years is used to quantify the above-normal category 
as the wettest 10 years, the below-normal category as 
the driest 10 years, and the near-normal category as 
those in between. In this case, the climatological prob¬ 
abilities are 33.3% for any category without any further 
knowledge. This should be the forecast probability for 
each category if there is no signal in the current pre¬ 
diction or if the prediction tools have no skill in that 
region and/or season. If skill and signal exists, then the 
forecast probabilities will differ from the climatological 
probabilities (Fig. 5). If the signal in the forecast indi¬ 
cates likelihood for wetter conditions, then the proba¬ 
bility for above-normal precipitation will be higher 
than 33.3% and the probability for below-normal pre¬ 
cipitation will be less. Alternatively, the forecast can be 
represented as the probability for exceeding or not 
exceeding some quantitative value. 

One of the most important qualities of probabilistic 
forecasts is that the probabilities are reliable, or repre¬ 
sentative of the frequency of occurrence. The other 
important quality is that they are sharp, or differ 
substantially from the climatological probabilities. 
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Climate Predictions, Seasonal-to-Decadal. Figure 5 

Schematic of a probabilistic forecast distribution relative to 
a distribution of the historical climate observations. The 
values {horizontal axis) have been normalized with a mean 
of 0. The shaded regions represent the above-normal 
tercile {green) and the below-normal tercile {brown) of the 
historical distribution 


Diagnostics of these forecast characteristics can be 
visualized through reliability and attributes diagrams 
(Fig. 6) and quantified through reliability and reso¬ 
lution skill scores, respectively [57]. A reliability 
diagram shows the complete joint distribution of fore¬ 
casts and observations for a probabilistic forecast of an 
event or forecast category (such as the above-normal 
tercile). In a reliable forecast system, the probability 
assigned to a particular outcome should be the fre¬ 
quency with which - given the same forecast - that 
outcome should be observed. The information sup¬ 
plied by reliability diagrams includes calibration, or 
what is observed given a specific forecast (e.g., under 
and overforecasting), as well as resolution and refine¬ 
ment which is the frequency distribution of each of the 
possible forecasts giving information on the degree 
of aggregate forecaster confidence (small inset graph 
in Fig. 6). Reliability diagrams can further indicate 
whether there are systematic biases in the forecasts, 
such as not predicting enough occurrences of 
above-normal temperatures. Such probabilistic 


Reliability Diagram 
DJF Precip: Above-Normal Tercile 



Climate Predictions, Seasonal-to-Decadal. Figure 6 

An example of a reliability diagram, which indicates the 
skill of probabilistic forecasts. The diagram compares 
the forecasted probability of an event (in this case, 
above-normal winter rainfall in North America) to its 
observed frequency. A perfect forecast is represented 
by the dashed line, a horizontal line represents a forecast 
identical to climatology, and sloped lines are potentially 
skillful. The blue and red lines correspond to individual 
CGCMs and AGCMs, respectively, and are more 
horizontal than the black line, which represents the mean 
of these models. While the mean of the models is more 
reliable than any of the individual models, it tends to be 
underconfident for rare events (the black line lies above 
the perfect forecast line for low-probability events). 
Typically, a histogram accompanies a reliability diagram 
{inset), indicating the number of times that forecasts of 
various confidence levels were issued (Source: Adapted 
from [58]) 

verification, such as reliability diagrams also can be 
useful for estimating event-specific prediction skill, 
for example if El Nino events were better predicted 
than La Nina events or drought conditions were better 
predicted than very wet seasons. A distinction in pre¬ 
diction skill between the cases of high and low variabil¬ 
ity calls for further examination of the physical causes 
of the discrepancy and whether it is inherent to the 
climate system dynamics or a shortcoming of the 
model(s). 
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It is a common feature of dynamical model pre¬ 
dictions to be overconfident, indicated by a reliability 
curve that is more horizontal than the 45° angle that 
would indicate a reliable prediction system. For exam¬ 
ple, an overconfident forecast would be one in which 
a forecast that indicates above-normal rainfall is 
80% likely in a given season, but overtime that forecast 
is followed by observations of above-normal rainfall 
40% of the time. Such overconfidence can arise from 
errors in both the forecast signal and the forecast 
uncertainty. 

Recalibration of predictions and multi-model 
ensembling are two approaches used to improve fore¬ 
cast reliability. Multi-model ensembling, which com¬ 
bines the prediction of several dynamical models, 
can improve the reliability and overall skill of predic¬ 
tions in two ways. First, although all models have 
errors, they do not necessarily have the same errors, 
thus combining the models reduces the systematic 
errors that would exist in the prediction from a single 
model. This can lead to reduced error and thus 
increased correlation skill in El Nino predictions, for 
example (Fig. 7) [59]. Similarly, it can increase the 
spatial coverage for where there is skill in capturing 
the predictable signals in the climate. The second 
advantage is the improvement in uncertainty estima¬ 
tion by considering the random errors and different 
parameterizations of random processes that give rise to 
the range of possible outcomes. 

Multi-model ensembling can lead to overall better 
information on the climate signal and its uncertainty 
[60], and thus on forecast reliability (Fig. 8). Different 
approaches exist to combine models. The most 
straightforward is to treat all models equally. Particu¬ 
larly for prediction systems with short retrospective 
forecast histories of about 25 years or less, it will be 
difficult to discern differences in forecast quality 
between comparable models. This is the typically situ¬ 
ation with one-tier prediction systems that use coupled 
ocean-atmosphere models, because the ocean observa¬ 
tions used in the forecast initialization is only available 
since the late 1980s. For two-tiered forecast systems 
that use atmosphere-only models the ocean tempera¬ 
tures can be predicted statistically, which allows for 
longer histories of retrospective forecasts. In these 
systems, it becomes possible to discern differences 
in regional forecast performance and to use that 


information to give more weight to better performing 
models, which can lead to further improvements in 
forecast reliability [61, 62]. 

An alternative approach to performance-weighting 
models is to recalibrate the models prior to combina¬ 
tion [63]. Recalibration has the advantage of improv¬ 
ing forecast quality of individual models. It is also 
more viable for prediction systems with limited retro¬ 
spective forecast histories, although a minimum of 
about 25 years is still required to identify systematic 
biases in seasonal-to-interannual variability. The 
recalibration of predictions is an attempt to account 
for systematic biases in both the signal and uncertainty 
in the predictions at a given location. Recalibration can 
also be used to account for spatial biases in the forecasts 
by comparing observed and predicted seasonal climate 
over several decades [9]. 

Reliable forecast information may still not provide 
enough specificity for those who wish to include 
seasonal-to-interannual climate forecasts in their deci¬ 
sion models, such as those in the agricultural or water 
sectors. The spatial mismatch of the information, the 
fact that decision makers in sectors such as agriculture 
and water require information at much higher resolu¬ 
tion, even if it means greater uncertainty, is a com¬ 
monly cited reason for not using the operational 
forecasts [64]. There may also exist the desire for 
greater temporal resolution, such as the characteriza¬ 
tion of the weather within the climate that might pre¬ 
dict the likely number of dry spells of a given duration. 
In some cases, certain weather characteristics of 
the seasonal climate may be more predictable than 
the seasonal totals (e.g., [65]). One way to address the 
information mismatch between the coarse spatial res¬ 
olution, or the quality of the higher temporal variabil¬ 
ity, from global seasonal climate forecasts and the more 
detailed needs of the end user is through downscaling 
techniques. In statistical downscaling, the global cli¬ 
mate forecast provides the input parameters for an 
empirical model with high spatial resolution. Statistical 
techniques can also be used to infer the signal in the 
weather characteristics relative to the seasonal mean, 
based on changes in the large-scale background cli¬ 
mate, such as those empirically related to ENSO 
[66, 67], or to changes in atmospheric circulation 
[65]. In the dynamical downscaling, the global forecast 
is used to provide lateral boundary conditions to 
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Climate Predictions, Seasonal-to-Decadal. Figure 7 

(a) Anomaly correlation coefficients and (b) RMSE of NIN03.4 index during 1980-2001 with respect to lead time after 
removing the mean bias. The mean skill for all four cases including February, May, August, and November initial conditions 
is shown. Black for observation, red for 10 CGCM multi-model ensemble, blue for the Stat-Dyn forecast, and colored dots for 
individual coupled models as shown in the legend, respectively (From [59]) 


a high-resolution nested regional atmospheric model. 
While it may provide greater detail of the mean cli¬ 
mate by better resolving terrain and coastlines, it 
has not been robustly demonstrated that dynamical 
downscaling improves prediction of the climate var¬ 
iability relative to the global model. Dynamical 
downscaling cannot overcome large-scale errors in 
the global model driving the nested model, and 
in many cases will exacerbate those errors. With 
increases in computing power, global climate models 
are starting to close the gap by providing fine 
spatial resolution, and attempting to provide better 


representation of weather transients that may be of 
interest to the end user. However, for the next decade 
or so downscaling techniques, particularly statistical 
downscaling will continue to add value to seasonal- 
to-interannual forecasts. 

Seasonal-to-Interannual Prediction: Forecast Skill 

Forecast skill is a measure of how accurately the 
prediction system can predict the observed climate 
variability or how well the probabilities describe the 
frequency of occurrence of particular outcomes. 
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monthly values for the PDO index: 1900-September 2009 



Climate Predictions, Seasonal-to-Decadal. Figure 8 

(a) The positive {left) and negative {right) phases of the "Pacific Decadal Oscillation" (PDO), a long-lived El Nino-like pattern 
of Pacific climate variability, shown in terms of (a, left) the positive phase in sea surface temperatures {colors), sea level 
pressure anomalies {contours), and anomalous low-level winds {arrows), and (b) the time series of the PDO (Taken from 
http://jisao.washington.edu/pdo/) 


Measures of accuracy between the best guess, or most 
likely outcome, of the forecasts and what was observed 
are often referred to as deterministic measures, mean¬ 
ing they are concerned with verifying the prediction for 
a single specific outcome, such as a prediction for an 
above-normal temperature or 2°C warmer than aver¬ 
age in the coming season. The metrics for deterministic 
quantitative forecasts include the Brier skill score and 
its decomposition, which includes anomaly correla¬ 
tions or root-mean squared errors. The quality, or 
skill, of deterministic categorical forecasts can be 
assessed using a variety of measures. There is no single 
measure of forecast performance that can indicate all 
aspects of forecast quality [68]. Additionally, forecast 
producers may be interested in different aspects of 
forecast performance than users of the forecast infor¬ 
mation. The World Meteorological Organization 
has compiled a list of recommended deterministic 
and probabilistic verification measures for seasonal 


predictions entitled, The Standard Verification System 
for Long-Range Forecasts [69]. 

Keeping in mind that there are different, comple¬ 
mentary measures of forecast skill, the accuracy of pre¬ 
dictions is typically used to estimate the limit of 
predictability. The limit of predictability is a function 
of the predictable signal and the unpredictable chaotic, 
or noise, component in the climate system. With an 
ensemble of predictions from a single model or a set of 
models, the signal and noise can be estimated from that 
set of information. The signal would be the predicted 
information that the ensemble has in common and the 
noise is the range of discrepancy about the signal 
(Fig. 5). When the forecast is initialized, the ensemble 
contains very little noise, but as the prediction pro¬ 
ceeds, the chaotic processes in the climate system lead 
to divergence of the ensemble members. The limit of 
temporal predictability is reached once the magnitude 
of the noise becomes comparable to the signal. This in 
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part, determines how far into the future the certain 
aspects of the climate can be predicted. Similarly, the 
average signal-to-noise ratio for a given region, season, 
variable, etc. describes the expected climate predictabil¬ 
ity in that case. Since Nature has only one realization, it 
is not possible to estimate the inherent limit of predict¬ 
ability of the climate system [9]. Estimates of the limit 
of predictability can be determined in a given predic¬ 
tion system as described above, but that will be only an 
estimate, and will be different for different forecast 
systems. At best, the most accurate prediction system 
for a given region, season, variable, etc. represents the 
limit of predictability for that case, and should be 
considered the lower limit of predictability, as the pre¬ 
diction accuracy is found to be at least that good and 
may improve further with improved models and data 
assimilation systems. 

Given that real-time predictions have been in 
production for more than a decade now [1], several 
properties of forecast skill have emerged for seasonal- 
to-inter annual predictions. First, predictions of sea¬ 
sonal mean temperature are more predictable than 
those for seasonal precipitation totals. This is related 
in part to the larger-scale nature of temperature anom¬ 
alies and the processes behind them. Even the coarse 
resolution global climate models can represent fairly 
accurately the changes in seasonal temperatures. Pre¬ 
cipitation processes and patterns have much smaller 
spatial scales and are more affected by local scale fea¬ 
tures. While the global models may be able to capture 
large-scale shifts in regions of convection and storm 
tracks, they may have difficulty with the characteristics 
of storms or local convective activity. The potential 
importance of local scale processes on precipitation 
variability also means that the noise component of 
seasonal precipitation variability is larger than that 
for temperature. As a result, more ensemble members 
are required to estimate the seasonal signal for precip¬ 
itation than for temperature. The second robust prop¬ 
erty of seasonal predictions is that the tropics are much 
more predictable than are the extratropics. In the tro¬ 
pics, the atmospheric circulation is more explicitly tied 
to the changes in patterns of surface temperatures, and 
the noise in the resulting atmospheric circulation is 
relatively small. A third and notable property of the 
predictions, which actually applies to predictions at all 
timescales, is that there is conditional skill in the 


expected accuracy. There are times when the initial 
and evolving state of the climate system carries 
a much larger predictable signal than other times. For 
seasonal-to-interannual forecasts this coincides with El 
Nino events. For seasonal predictions over the United 
States most predictability derives from El Nino or La 
Nina conditions [70]. Similar results hold on a global 
scale too; the fraction of land area over which skillful 
forecasts can be made is up to twice as large during 
El Nino or La Nina conditions than in their 
absence [71]. Moreover, since these events have an 
inherent timescale of 6-12 months, or longer, the 
time horizon into the future that skillful forecasts can 
be issued is also expanded. 

Seasonal-to-Interannual Prediction: Internationally 
Coordinated Efforts 

Several internationally coordinated efforts have led to 
the understanding of seasonal-to-interannual climate 
variability and its prediction using dynamical models. 
One of the earliest was the Atmospheric Model 
Intercomparison Project (AMIP) [72]. This project 
was organized by the Working Group on Numerical 
Experimentation as a contribution to the World Cli¬ 
mate Research Programme. Different atmospheric 
models were run with the same observed sea surface 
temperatures as boundary conditions for the period 
1979-1988. The goal was to identify systematic errors 
as well as systematic responses to the boundary condi¬ 
tions across models. Without such a coordinated effort 
there had been questions whether the results from 
a single model were particular to that model or 
a more robust response expected of the climate system. 
Other coordinated activities followed. 

In the late 1990s, experiments were carried out 
using different atmospheric models to test the predict¬ 
ability of seasonal climate relative to the variability 
of sea surface temperatures. Two important issues 
addressed in that collection of research were the relative 
impact of initial atmospheric conditions predictability 
of the seasonal climate and a suggestion that predic¬ 
tion skill could be improved through a multi-model 
approach. In the United States five modeling centers 
participated in this research under the Dynamical 
Seasonal Prediction (DSP) project. On the other side 
of the Atlantic, 11 different partners throughout 
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Europe contributed to the Prediction of Climate Vari¬ 
ations on Seasonal to Interannual Timescales (PRO¬ 
VOST) project [73]. 

Further research on seasonal predictability and the 
value of multi-model ensembles was conducted with 
coupled models from seven European modeling centers 
under the Development of a European Multimodel 
Ensemble system for seasonal to inTERannual predic¬ 
tion (DEMETER) project [74]. This project also 
encouraged research to determine the value of seasonal 
predictions through their use in models that use the 
climate data to make prediction over a wide range 
of interests, from agriculture to health. The next 
generation of DEMETER was ENSEMBLES, which 
continued to advance methods and application of sea¬ 
sonal predictions from European Earth system models, 
thus adding complexity to the dynamical prediction 
models [75]. The ENSEMBLES project also began to 
extend those predictions to decadal timescales. The 
Working Group on Seasonal to Interannual Prediction 
under the World Climate Research Programme is cur¬ 
rently coordinating the Climate-system Historical 
Forecast Project (CHFP), which will provide access to 
a wide range of hindcasts to evaluate subseasonal-to- 
decadal predictions of the climate system, which also 
aims to quantify the predictability added by elements 
other than sea surface temperatures, for example 
through initialization and prediction of the land sur¬ 
face, the cryosphere, and the stratosphere [76]. 

Decadal Prediction (Experiments) 

Decadal Prediction: Drivers of Decadal-Scale 
Climate 

Decadal climate predictions sit between the seasonal- 
to- inter annual forecasts of the next months to a year 
in the future and the climate change projections of 
50-100 years in the future. There are many features of 
the climate system with timescales that vary over 
decades (decadal variability). The dominant drivers of 
climate features over decadal timescales are believed to 
be changing atmospheric composition, mainly increas¬ 
ing greenhouse gases, and slow changes in ocean circu¬ 
lation that lead to slow changes in the pattern of sea 
surface temperatures. The changing atmospheric com¬ 
position changes the energy balance of Earth, which 
leads to warmer temperatures and other associated 


climate changes that manifest primarily as trends. The 
temperature trends are not spatially uniform. Ice- 
albedo feedback in higher latitudes leads to greater 
rates of warming there than at low latitudes. Land has 
a lower heat capacity than water, so the continents 
warm faster than the oceans. Ocean dynamics also 
play a role in the patterns of climate change warming, 
particularly in upwelling regions, where the radiative 
warming is offset by the upward advection of colder 
ocean water from depth. 

What decadal predictions aim to capture that 
climate change projections do not is the predicted 
evolution of naturally occurring decadal-scale features. 
Climate change projections contain these processes and 
the associated variability, but since the climate system is 
not initialized with observations, the decadal evolution 
will not be temporally consistent with the observations. 
So one first test of a model is to see whether it is capable 
of simulating the dominant decadal-scale features 
observed in Nature. 

Decadal-scale variability has been identified in 
Nature in both the Pacific and Atlantic Oceans. In the 
Pacific Ocean the variability is referred to as the Pacific 
Decadal Oscillation (PDO), or more correctly Pacific 
Decadal Variability (PDV). The pattern of PDV 
(Fig. 8a) has its signature in sea surface temperatures 
with cooler than normal temperatures in the midlati¬ 
tudes of the North Pacific Ocean and warmer than 
normal temperatures in the eastern and central equa¬ 
torial Pacific Ocean during the positive PDV condi¬ 
tions [22]. The time series associated with the 
projection of sea surface temperature anomalies on 
this pattern represents the PDO index (Fig. 8b). This 
sea surface temperature pattern is reminiscent of El 
Nino conditions, except that the magnitude of sea 
surface temperature anomalies is larger in the mid¬ 
latitudes than in the tropics, and the tropical sea 
surface temperatures have a broader meridional extent. 
This pattern of sea surface temperatures is accompa¬ 
nied by sea level pressure anomalies in the North 
Pacific. A measure of the time series of changes in 
North Pacific sea level pressures is known as the 
North Pacific Pressure Index (NPPI). It was later real¬ 
ized that there is symmetry in the Pacific decadal var¬ 
iability such that a similar pattern of cooler than 
normal sea surface temperatures and anomalous low 
sea level pressure is also found in the midlatitudes of 
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the South Pacific Ocean. The full Pacific view of decadal 
variability has been named the Interdecadal Pacific 
Oscillation (IPO, [77]). However, the PDO is the 
more commonly used index outside Australia. 

The symmetry of ocean-atmosphere anomalies 
outside the tropics, and the resemblance to El Nino, 
suggests a role for El Nino in driving PDV. It is 
also notable that there is considerable year-to-year 
fluctuation in the PDO index. It is very difficult to 
identify in any particular year what phase, positive 
or negative, the PDV is in because within the 
protracted periods in which the PDV is preferentially 
of one sign or the other, there exist excursions of 
the index of opposite sign that may only last a year 
or two. 

Simple model experiments have shown that El Nino 
events can affect the positive phase of PDV [41]. Model 
analysis suggests an atmospheric Rossby wave train 
emanating from anomalous convective heating in the 
central Pacific leads to anomalous low sea level pressure 
in the region of the Aleutian low, thus strengthening 
the westerly trade winds. The strengthened winds lead 
to cooling through enhanced evaporation and also 
drive southward Ekman flow that brings colder water 
from the north southward. Those changes in the ocean 
mixed layer can be sequestered from the atmosphere 
from one winter to the next due to changes in the ocean 
mixed layer depth and its connection to the surface 
from winter, when El Nino peaks, the storm track is 
strongest and the atmosphere can directly affect the 
upper ocean, to summer when the previous El Nino 
would have decayed, the storm track is relatively weak, 
and increased solar radiation stabilizes the upper 
ocean. The following winter when the westerly winds 
of the storm track again increase, the sequestered 
mixed layer temperature anomalies reemerge [78]. 
This reemergence mechanism is hypothesized to the 
main way that the year-to-year variability associated 
with El Nino and La Nina can be rectified into longer 
timescale variability. However, other processes may 
also contribute to PDV. Some mechanisms that have 
been proposed included ocean-atmosphere coupling 
of a basin gyre mode [25], excitation of midlatitude 
oceanic Rossby waves [79], and a complementary, 
possibly independent oscillation driven by the tropics 
particularly when El Nino events are focused toward 
the central equatorial Pacific [80]. 


Associated with the decadal changes in Pacific 
Ocean conditions, decadal-scale terrestrial climate 
anomalies have also been identified over the United 
States [22] and throughout the Pacific sector [81]. 
Many of these climate anomalies are consistent with 
El Nino-related teleconnection patterns, such as wetter 
conditions in the southern tier of the United States and 
drier conditions over the Pacific Northwest [82]. 
Although only a few realizations of each phase of 
PDV exist in the instrumental records, the broad pat¬ 
tern seems to be consistent across these cases. However, 
because it is likely that El Nino is a dominant driver 
of PDV, and is associated with similar terrestrial 
teleconnections, it is difficult to say with confidence 
that the PDV is somehow independent of the mere 
existence of extended periods when El Nino events are 
stronger or more frequent versus when El Nino events 
are weaker or less frequent. 

Decadal-scale variability in the Atlantic is referred 
to as the Atlantic Multi-decadal Oscillation (AMO), or 
more correctly Atlantic Multi-decadal Variability 
(AMV), because there does not seem to be a spectral 
peak signaling a true oscillation. Because positive AMV 
conditions are associated with warming throughout 
the North Atlantic (Fig. 9), the index of AMV is simply 
the sea surface temperature anomaly averaged over the 
North Atlantic, and it is often detrended [83]. Other 
more elaborate means of isolating decadal-scale vari¬ 
ability over the Atlantic have been used (e.g., [24] ), but 
result in very similar time series, so the simple index is 
now the one most widely used. 

The hypothesized mechanism driving the AMV is 
associated with changes in the Atlantic Meridional 
Overturning Circulation (AMOC). The AMOC brings 
warm and salty water from the tropical Atlantic pole- 
ward. At high latitudes, cold salty water becomes 
denser than the water below it due to heat fluxes from 
the westerly storm tracks and brine injection from sea 
ice formation. The heavy surface water then sinks and 
flows back equatorward as North Atlantic Deep Water. 
The sinking water is replaced by the surface flow from 
tropics to high latitudes. If the rate of sinking increases, 
the poleward flow of surface water increases, bringing 
more warm tropical water into the midlatitudes. This 
represents an increase in the strength of the AMOC, 
and the AMV index becomes positive. If the North 
Atlantic water gets too warm or if it freshens the rate 
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Climate Predictions, Seasonal-to-Decadal. Figure 9 

(a) AMO index derived from detrended area-weighted 
mean North Atlantic SST anomalies by using a Chebyshev 
filter with a half-power period of 13.3 years, (b) Surface 
temperature anomaly associated with one positive 
standard deviation of the AMO index, calculated by 
regression of surface temperatures with the index and 
scaled by its standard deviation. The solid contour bounds 
regions significant at the 90% limit of a two-sided f-test 
accounting for autocorrelation (Source: From [83]) 


of sinking water slows down, and the rate of transport 
of warm tropical water poleward slows down. This 
represents a decrease in the strength of the AMOC, 
and the AMV will become negative. 

The AMOC is forced on all timescales. Because 
the Gulf Stream is the western boundary current 
of the wind-driven ocean gyre as well as contributing 
to the AMOC, changes in the winds will affect the 
AMOC as well as ocean temperatures. However, 
the multi-decadal-scale variability described above is 
a much slower process related to the inertia of the 
overturning circulation and the associated impact on 
the density properties of the Atlantic Ocean. Since 
observations of the AMOC have become available 
only since the end of the twentieth century, there is 
not enough observational evidence to quantitatively 
link the sea surface temperatures of the AMV with 


multi-decadal variability of the AMOC. However, the 
low-frequency variability of AMOC in some models is 
associated with a pattern of sea surface temperature 
anomalies that closely resembles the observed AMV 
pattern [83]. What has not been resolved is what pro¬ 
cess or collection of processes can influence the AMOC 
on long timescales. Some studies point to modification 
of the strength and local of the intertropical conver¬ 
gence zone over the Atlantic as a way to modify the 
salinity of the water transported from the tropics [84]. 
Others suggest that the North Atlantic Oscillation 
(NAO - also called the Arctic Oscillation, AO) plays 
a dominant role by influencing the strength of the 
winds, which then influence the rate of convection, or 
sinking of heavy water, in the Labrador Sea region with 
an estimated 10-year lag time [85]. Although this 
would be a fairly white noise process, the suggestion 
is that the ocean integrates the noise into a longer 
timescale red noise process, but one that might still 
carry some predictability due to persistence. 

The teleconnections associated with the positive 
phase of AMV include wetter conditions over the 
Sahel and India and drier conditions over northeast 
Brazil, due to the northward shift of the intertropical 
convergence zone toward the relatively warmer condi¬ 
tions north of the equator [29]. Also the warm tropical 
North Atlantic provides more fuel for the growth of 
tropical storms, and empirically it is seen that more 
tropical storms grow to hurricane intensity during the 
positive, warm phase of AMV than during the negative 
or cool phase. 

Decadal Prediction: Model Fidelity 

The task of judging whether a model captures decadal 
variability in the Pacific or Atlantic oceans for the right 
reason is greatly complicated by the limited history of 
observations compared to the timescale of the variabil¬ 
ity. Since the measurements of surface temperature and 
sea level pressure go back to the nineteenth century, 
most comparisons are made to these fields. In many 
cases, the question of model fidelity is closely tied to 
examination of the processes involved in producing 
the variability in a particular model and to what extent 
those are observed in Nature. The difficulty with that 
approach is as described in the previous section: differ¬ 
ent models may not agree on which process(es) dom¬ 
inates, or is even involved. 
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Different models also may yield somewhat different 
spatial patterns or spectra than suggested by the limited 
observations [86]. A similar situation exists relative to 
El Nino in coupled dynamical models [87] , and this has 
not prevented the use of those models for El Nino 
prediction. Thus, the most important factor may be 
simply whether or not a model is capable of capturing 
a reasonable representation of decadal variability in sea 
surface temperatures, as this is how whatever changes 
are occurring in the ocean will be communicated to the 
atmosphere. 

Exactly how best to validate decadal variability in 
the models is an area of active research. To date, more 
work has been focused around Atlantic variability per¬ 
haps because of the recognition of the role of the 
AMOC in the AMV. The idea is that if the AMOC is 
responsible for the AMV, then it is the AMOC that 
a model must be able to predict from a given set of 
initial conditions. The AMOC must then produce 
a surface temperature of reasonable resemblance to 
Nature, and the overlying atmosphere must be able 
to respond to the changes in surface temperature in 
a way that captures the observed teleconnections. 
Again, the difficulty is that very few realizations of the 
variability exist in the observational record, although 
paleoclimate reconstructions of past temperature or 
precipitation suggest that, for example, multi-decadal 
variability consistent with observed AMV has impacted 
regional climate 400-500 years back [88]. But the few 
realizations of the spatial pattern of sea surface temper¬ 
atures makes it difficult to know which parts of the 
pattern of anomalies are robust across events and 
which are variable from one positive phase to another. 

For PDV, most models capture the response of the 
midlatitude Pacific Ocean to El Nino variability. 
However, they often do not demonstrate the same 
level of multiyear persistence through a reemergence 
mechanism. Different models are also influenced to 
differing degrees by other processes hypothesized 
to contribute to PDV, including the white noise 
imposed by variability in the storm track. As a result 
the patterns of PDV, such as where the sea surface 
temperature anomalies are focused and the magnitude 
of that temperature variance, also differ among models. 

Some modeling studies have shown that at least 
the atmospheric models can translate the changes in 
patterns of sea surface temperature into realistic 


teleconnections. For example, using observed heat 
fluxes from the positive, warm phase of AMV in the 
Atlantic Ocean to drive an atmospheric model leads 
to decadal-scale changes consistent with observed 
changes in precipitation over the Sahel and India and 
also in the wind shear over the tropical North Atlantic 
relevant to hurricane formation (Fig. 10) [30]. Thus as 
with El Nino, if the sea surface temperatures can be 
predicted then there may be at least some predictability 
of the associated terrestrial climate impacts. 

Decadal Prediction: Prediction Experiments 

Experimental decadal prediction has only recently 
begun. Decadal prediction differs from climate change 
projections in the initialization of the climate system, 
with particular emphasis on initialization of the 
oceans. The first paper demonstrating actual retrospec¬ 
tive, decade-long, initialized forecasts was published in 
2007 [3]. They showed improvements in prediction of 
globally averaged temperatures relative to the un¬ 
initialized climate change projections from the same 
model. However, it was not clear how much predictive 
information is available at the regional scale from these 
predictions, and it is not obvious what the main drivers 
are behind any predictive information they may yield. 

The prediction systems for decadal prediction are 
essentially the same as for seasonal-to-interannual pre¬ 
diction. They require observations, models, and their 
connection through data assimilation systems. One of 
the main differences is the requirements on the obser¬ 
vations: seasonal-to-interannual predictions mainly 
need information about the upper several hundred 
meters of the tropical oceans; decadal predictions 
require information about the global oceans, including 
the middle and high latitudes and also to much greater 
depths to capture information on the lower branch of 
the AMOC. 

Observations needed to produce initial ocean con¬ 
ditions are incomplete. The creation of retrospective 
forecasts of decadal variability at least several decades 
into the past requires information on salinity fields that 
just do not exist. This has tested the limits of ocean state 
estimation with limited data, and the estimates even for 
large-scale averages, such as the average salinity anom¬ 
aly in the upper 700 m of the midlatitudes of the 
Atlantic Ocean, can vary greatly. The uncertainty 
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among datasets for upper ocean salinity anomalies on 
basin scales is larger than the variability within a single 
dataset [89]. Since the beginning of the twenty-first 
century, however, the Argo program of drifting buoys 
has provided unprecedented measurements of the 
upper 2 km of the global ocean. The floats measure 
temperature and salinity profiles as they descend and 
ascend the water column about every 10 days. There are 
currently over 3,000 floats reporting data through sat¬ 
ellites (Fig. 11). Even with good observational data 
coverage of the global oceans there will still be chal¬ 
lenges in merging those data efficiently with models 
through data assimilation systems to account for both 
mean biases and biases in space-time variability. 

The first step in exploring decadal prediction has 
been through perfect model studies. In perfect model 
studies, a free-running integration of the model is 
taken as truth; this integration assumes the role of the 
“observations.” Ensemble members are set up to start 
from a particular point in the free-running integra¬ 
tion with small perturbations to the initial state, 
representing the uncertainty in initial conditions. The 
ensemble members are then integrated forward to see 
how well they can track the “truth” of the free-running 
integration. In this experimental setup, the “observa¬ 
tions” are perfect because since they are taken from the 
model they are known everywhere, and the model 
is perfect, because it is dynamically consistent with 
the “observations,” which are just a snapshot from the 
model. Similar experiments toward the design of 
assimilation systems test the insertion of these perfect 
“observations” but sampled only at locations that 
the actual observing network could provide 
data. The idea is to see if in this most idealized 


of circumstances - perfect observations and perfect 
model - the model is able to predict the evolution of 
the “observed” variability taken from the free-running 
integration. If not, it implies that in the particular 
model, too much noise exists to extract a predictable 
signal. The situation will only be worse in a real forecast 
setting with imperfect observations in the model that is 
also not perfect. 

A number of these perfect model prediction 
experiments have been carried out since the early part 
of the twenty-first century. In a coordinated experi¬ 
ment of five European modeling centers, called PRED¬ 
ICATE, two to three experiments were carried out by 
each group starting their ensemble predictions at dif¬ 
ferent points in time to explore the prediction depen¬ 
dence on the variability of the AMOC. In all cases there 
was some skill in predicting the evolution of the AMOC 
(Fig. 12). The experiments also demonstrated condi¬ 
tional predictability much like is seen with El Nino 
predictions. The perfect model predictions started 
when the AMOC was stronger than average yielded 
predictability of the AMOC to about 10-15 years into 
the future; predictions started with a weak AMOC 
predicted the future evolution of the AMOC only 2-5 
years into the future [90]. The PREDICATE experi¬ 
ments were based on model control runs, meaning 
that atmospheric composition was held fixed. More 
recent perfect model studies explore the relative pre¬ 
dictive signal due to the initial conditions versus due to 
radiative forcing from increasing greenhouse gases 
[91]. Several common lessons are beginning to emerge 
from these studies. One is that the predictable time 
horizon, when the signal in the ensemble of predic¬ 
tions is larger than the uncertainty across ensemble 
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Observed and modeled variability. The color shading is the low-pass filtered [LF) data and the green dash line is the 
unfiltered data, (a) Observed AMO Index (K). (b) Observed JJAS Sahel rainfall anomalies (averaged over 20°W-40°E, 
10-20°N). (c) Observed JJAS west central India rainfall anomalies (averaged over 65-80°E, 15-25°N). (d) Observed time 
series of the dominant pattern (PC 1) of low-frequency JJAS rainfall anomalies, (e) Observed anomalous Atlantic major 
Hurricane number (axis on the left, original data from the Atlantic basin hurricane database-HURDAT, with no bias-type 
corrections from 1944 to 1969, there is no reliable data before 1944) and observed Hurricane Shear Index (1958-2000) 
(m/s, brown solid line for LF data, brown dash line for unfiltered data, axis on the right), (f) Modeled AMO Index (K). 

(g) Modeled JJAS Sahel rainfall anomalies, (h) Modeled JJAS west central India rainfall anomalies, (i) Modeled PC 1 of LF 
JJAS rainfall anomalies, (j) Modeled Hurricane Shear Index (m/s) (Source: From [30]) 
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Climate Predictions, Seasonal-to-Decadal. Figure 12 

Measures of the potential predictability of variations in the strength of the Atlantic Meridional Overturning Circulation 
from four of the five coupled models (see legend), [left) The anomaly correlation coefficient (ACC: unity for perfect 
potential predictability; zero for no potential predictability) for (fop) strong and [bottom) weak MOC initial conditions 
[right). The normalized root mean squared error (rmse: zero for perfect potential predictability; unity for no potential 
predictability) in the same order. Also shown in the figures are the multi-model average ACC and rmse [thick black line) and 
the multi-model average ACC for a simple damped persistence [thick gray line) (Source: From [90]) 


members, is longer for midlatitudes than for the tropics 
due to the dominance of year-to-year variability in the 
tropical oceans. Another lesson is that upper ocean heat 
content is more predictable than sea surface tempera¬ 
ture due to the impact of weather noise on surface 
temperatures, while the upper ocean temperatures are 
more reflective of the slow changes in the atmospheric 
circulation. Thus even if the AMOC is predictable, the 


surface temperatures connected with that feature will 
be less so, but it is this surface expression that is neces¬ 
sary for predicting the terrestrial climate impacts. 
Finally, it appears that the external forcing due to 
increasing greenhouse gases becomes comparable to 
the information from ocean initial conditions by 
10 years out for the midlatitudes and less in the tropics. 
Again, these are perfect model results. However, such 
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results only indicate the upper limit of predictability 
for a particular model, and even though the similar 
results have been found across several of the current 
models, it is not to say that different results might be 
possible from better models. 

To date, only a few pioneering attempts have been 
documented of “retrospective forecasts,” which are 
decadal predictions initialized with real observed 
initial conditions from some time ago. These predic¬ 
tion experiments not only used different models, 
they also used very different methods to obtain the 
initial conditions: one initialized with only sea surface 
temperatures [4], one initialized the observed data 
for ocean temperature and salinity anomalies as well 
as atmospheric anomalies [3], the third nudges their 
model toward the observational analysis using a differ¬ 
ent procedure [5]. 

The results are mixed. All claim to gain benefit from 
initialization of the climate system compared to the 
climate change projections that consider only changes 
in the atmospheric composition. Two of the studies 
[3, 5] show improvement in global mean temperatures 
compared to the un-initialized climate change projec¬ 
tions; the other study [4] slightly degrades their pre¬ 
diction of global mean temperatures with initialization. 
All claim, or at least imply, that much of the decadal 
variability that is captured is due to initialization of 
the AMOC. Two of the studies [4, 5] demonstrate 
improved temperature predictions over the eastern 
North Atlantic region, but for the study that also quan¬ 
tifies the impact of initialization on prediction 
errors [4] shows larger errors for North Atlantic sea 
surface temperature in the initialized predictions. The 
same two studies that show improved correlations for 
North Atlantic temperature predictions claim that it is 
due to improved prediction of the AMOC. Since there 
are no observations of this circulation feature in the 
twentieth century, the conclusions are based on com¬ 
parison with the analysis responsible for the initial 
conditions, which constitutes something like a semi¬ 
perfect model result rather than a verified prediction. 

Two of the early prediction experiments [3, 4] do 
show some improvement in regional temperature pre¬ 
dictions over land, but how much improvement is not 
easily discerned, and is difficult to compare across the 
experiments. There are also some regions where the 
temperature predictions are less skillful. Maps of skill, 


or differences in skill, are not provided for precipitation 
in these studies. These papers are only the beginning of 
assessment decadal prediction skill. 

What these results do or do not show must be 
viewed with caution though. Several difficulties stand 
in the way of more conclusive estimates of predictabil¬ 
ity and prediction skill for decadal climate variability. 
One difficulty is that the current sets of experiments, 
and even those that will soon be available (see Decadal 
Prediction: Internationally Coordinated Efforts) have 
very few ensemble members. Small ensemble size leads 
to uncertainty in the predicted signal, and provides 
very little information about the uncertainty due to 
uncertain initial conditions in a particular model. 
Multi-model ensembles will likely be more problematic 
for decadal predictions given the wide range of 
approaches to initial conditions; the prediction skill 
can also be compromised by the data assimilation 
component, even if the models are of equally high 
quality. Data assimilation and the development of ini¬ 
tialization techniques for current and retrospective 
decadal predictions require considerable research 
investment. What experiments do exist and are likely 
to exist in the next several years will have limited 
realizations of decadal-scale variability, complicated 
with the evolution of that variability against a changing 
background climate due to increasing greenhouse 
gases. That combined with limited observations, not 
only of the subsurface ocean but also of terrestrial 
climate for much of the world, makes verification of 
retrospective forecasts extremely challenging. 

Decadal Prediction: Internationally Coordinated 
Efforts 

Several international efforts have been organized since 
the beginning of the twenty-first century to ascertain 
the predictability and prediction skill of decadal cli¬ 
mate variability by systematizing the investigation 
across many models. The PREDICATE project, which 
was referred to above, provided a systematic compari¬ 
son of the “perfect model” predictability in five 
European coupled models. They found potential pre¬ 
dictability in the AMOC (Fig. 12), and also to some 
extent in surface air temperatures, that exceeded 
damped persistence [90]. The PREDICATE project 
examined the potential predictability of the response 
of atmospheric models to prescribed sea surface 
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temperatures, such as those associated with the AMV, 
and found good consistency across the models 
suggesting potential predictability if the pattern of 
SST was itself predictable [92]. 

A more recent activity, also drawing on the 
European modeling and prediction community is 
ENSEMBLES ([75]; http://ensembles-eu.metoffice. 
com/). This was a 5-year climate change research pro¬ 
ject begun in 2004, and involving 66 research partners 
across Europe. The project generated retrospective cli¬ 
mate forecasts from seasonal to multi-decadal scales, 
provided local interpolation and/or downscaling, and 
sought to apply that information to sectoral outlooks, 
such as agriculture, health, and energy, across Europe. 

The most extensive collaboration on decadal pre¬ 
diction experiments is the coordinated experiments 
designed and being run for the IPCC Fifth Assessment 
Report [93]. Together with the climate change projec¬ 
tions for the next IPCC report, these decadal prediction 
experiments will be part of the Coupled Model 
Intercomparison Project-5 (CMIP5). There is a mini¬ 
mal set of runs at the core of the experimental design 
that requires hindcasts initialized for near the end of 
1960 and every 5 years after that to 2005, in each case 
predicting 10 years past the initialization. Of those 10 
sets of experimental start dates, a subset - those initial¬ 
ized at 1960, 1980, and 2005 - will be run out for 
30 years. These experiments are to be run with a nominal 
ensemble size of 3. Of the dozen or more international 
modeling and prediction centers that will participate in 
the decadal prediction experiments of CMIP5, several 
will run with larger ensemble sizes and more start dates. 

What is not being coordinated for the CMIP5 
decadal prediction experiments is the data assimilation 
or initialization strategy. The guidelines only require 
that the predictions begin with a state of the climate 
system representative of the observations at that time. 
Thus, although there will likely be prediction systems 
that perform better than others, considerable analyses 
and further research will be required to assess which 
part of various prediction systems are responsible for 
their relative success or failure. 

Future Directions 

Although seasonal prediction is a relatively mature 
activity, considerable room for further improvement 


exists in the production, provision, and application of 
seasonal climate forecasts [9]. Dynamical models have 
many recognized biases in their tropical climate, such 
as tropical upper ocean structure and mixing, a 
tendency to produce a double intertropical conver¬ 
gence zone in the Pacific, and poor simulation of the 
stratus clouds that sit near the coasts along the eastern 
subtropical oceans. These problems are probably not 
unrelated, but they have proved difficult to solve. These 
tropical biases impact the realism of predicted El Nino 
events, which introduces biases into the associated 
teleconnections. Although much of the discussion in 
this chapter has focused on the climate predictability 
that arises from tropical SSTs, and especially El Nino, 
other factors in the climate system that are not well 
represented of initialized in models may carry addi¬ 
tional prediction skill. Such processes include land 
characteristics [36] such as soil moisture, snow, and 
vegetation, as well as sea ice, variability in the strato¬ 
sphere [94], and intra-seasonal variability such as the 
Madden-Julian Oscillation [95]. The provision of sea¬ 
sonal forecasts has improved since the 1990s; it has 
become common practice for operational centers to 
provide probabilistic information. However, that infor¬ 
mation is not sufficient for many decision makers if it is 
not accompanied by information on how the forecast is 
constructed, the past skill of the system, and more 
flexible or varied information that would allow sophis¬ 
ticated users to incorporate the data into quantitative 
decision systems. These types of best practices are 
much easier to address than model biases. Also, if 
addressed they would allow for broader use of past 
forecasts for research to underpin the use of current 
forecasts for decisions. 

Decadal prediction is still in the phase of research 
and experimentation. Thus, decadal prediction itself 
should be considered a future direction of climate 
prediction. Although there have been some pioneering 
studies that present results from decadal prediction 
systems, there is no community-wide agreement on 
how decadal prediction systems should be constructed, 
what information can be provided, with what accuracy, 
and even how best to verify the information that is 
predicted [2]. The internationally coordinated set of 
experiments under CMIP5 should contribute to 
a better understanding of these prediction systems 
and their potential. These experimental predictions 
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will build on the current limited understanding to 
illuminate the relative information provided by initial 
conditions, in a real forecast setting, compared to the 
radiative forcing from current and future atmospheric 
greenhouse gas increases. The prediction experiments 
taken together should also help identify model biases 
that are of particular concern to decadal variability and 
set priorities on the future development and mainte¬ 
nance of the ocean observing system. Also, the added 
complexities of data assimilation for decadal prediction 
that will encompass longer timescales, greater depths in 
the ocean, and more need for salinity information, 
which was not available for most of the ocean prior to 
the twenty-first century, will lead to innovations in data 
assimilation systems (e.g., [96]). 

The seasonal and decadal prediction systems share 
many common elements. In particular, they use the 
same type of dynamical models, and they both rely 
heavily on ocean initial conditions interpreted through 
data assimilation systems. They are both potentially 
impacted by external forcings such as solar variability 
and volcanoes. The same model biases that affect sea¬ 
sonal prediction skill will impact decadal predictions 
also. As the research community develops improved 
dynamical models, that better represent the Earth sys¬ 
tem in all its complexity, it will benefit climate pre¬ 
dictions at all timescales. The additional observational 
data and more sophisticated data assimilation systems 
that are required for initialization of decadal predic¬ 
tions will provide more information of the ocean state 
that could be relevant for seasonal predictions as well. 
Already, some efforts to create retrospective seasonal 
predictions are being run farther into the future to 
investigate the ability of those systems to predict 
interannual-to-decadal climate variability. On the 
other side, retrospective decadal predictions that will 
contribute to the CMIP5 database already predict 
through the seasonal timescale. 

The larger vision for the future direction of seasonal 
and decadal prediction is the union of the two efforts. 
This has been called “seamless prediction” [97, 98], 
which seeks the seasonal predictions to both the lon¬ 
ger-term decadal predictions as well as the shorter- 
term weather forecasts. Initial steps in bridging the 
weather and seasonal timescales have been made (e.g., 
[97] ), and since the observational and data assimilation 
systems are in place and have been well tested for these 


timescales, it is a sensible starting point. The joining 
of seasonal and decadal prediction scales would 
appear to be developing naturally as part of the 
evolving research into climate variability, predictabil¬ 
ity, and prediction on these timescales. 
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Post-combustion capture Concentration of C0 2 from 
mixed, dilute, and flue gas streams that are the 
products of combustion, commonly through chem¬ 
ical solvents, sorbents, or selective membranes. 
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Carbon capture and sequestration (CCS) is the long¬ 
term isolation of carbon dioxide from the atmosphere 
through physical, chemical, biological, or engineered 
processes. The primary purpose is to reduce the 
impacts of human-induced climate change associated 
with increased greenhouse gas concentrations from 
emissions and land use. CCS can be a major partial 
solution to climate change, accounting for between 
15% and 25% of the needed abatement, and should 
be considered as part of a portfolio of solutions which 
include efficiency and conservation, renewable energy, 
nuclear power, and other options. 


Glossary 

Carbon capture and sequestration (CCS) The long¬ 
term isolation of carbon dioxide from the atmo¬ 
sphere through physical, chemical, biological, or 
engineered processes. 

C0 2 capture The separation and concentration of 
C0 2 from mixed, dilute, or flue gas streams, com¬ 
monly for the purpose of geological sequestration 
or EOR (which requires 95% concentrations or 
higher). 

C0 2 enhanced oil recovery (EOR) Injection of C0 2 
into depleted oil fields for the purpose of increasing 
production. This typically also results in long-term 
storage of C0 2 . 

Geological sequestration The long-term physical or 
chemical storage of C0 2 in deep geological forma¬ 
tions for the purpose of indefinite separation from 
the atmosphere. 

Oxyfired combustion The burning of an hydrocarbon 
fuel (coal, oil, gas, biofuel) in an oxygen-rich or 
pure oxygen environment for the purpose of C0 2 
concentration or capture. 

Precombustion capture Concentration of C0 2 from 
mixed gas streams that are the products of fuel 
conversion (e.g., gasification, methanation, or fer¬ 
mentation) before combustion, commonly through 
physical solvents. 


Introduction 

Carbon capture and sequestration (CCS) is the long¬ 
term isolation of carbon dioxide from the atmosphere 
through physical, chemical, biological, or engineered 
processes. This includes a range of approaches includ¬ 
ing soil carbon sequestration (e.g., through no-till 
farming), terrestrial biomass sequestration (e.g., 
through planting forests), direct ocean injection of 
C0 2 either onto the deep seafloor or into the interme¬ 
diate depths, injection into deep geological formations, 
or even direct conversion of C0 2 to carbonate min¬ 
erals. Some of these approaches are considered 
geoengineering (see the appropriate chapter herein). 
All are considered in the 2005 special report by the 
Intergovernmental Panel on Climate Change [1]. 

Of the range of options available, CCS most com¬ 
monly entails the capture of C0 2 from power and 
industrial plants followed by injection into deep geo¬ 
logical formation (geological carbon sequestration, 
or GCS). While the economics are hotly debated, in 
many ways this form of CCS appears to be a critical 
option for major greenhouse gas reduction in the 
next 10-50 years. The basis for this interest includes 
several factors: 

• There is no obvious immediate technical barrier to 
deployment. Systems to capture and concentrate 
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C0 2 have been in commercial operation for over 
80 years (REF), and the oil and gas industry has 
injected C0 2 for enhanced oil recovery for over 
35 years. 

• The initial estimates of potential geological capaci¬ 
ties are enormous (Fig. 1). Formal estimates for 
global storage potential vary substantially, but 
are likely to be between 800 and 3,300 Gt of C 
(3,000 and 10,000 Gt of C0 2 ), with significant 
capacity located reasonably near large point sources 
of the C0 2 . (REF) 

• Testing of large-scale GCS is feasible and has begun 
(Fig. 1). In many industrialized countries, large 
C0 2 sources like power plants and refineries lie 
near prospective storage sites. These plants could 
be retrofit today and injection begun (while bearing 
in mind scientific uncertainties and unknowns). 
Indeed, some have, and five projects described 
here provide a great deal of information on the 
operational needs and field implementation of 
CCS, with more to follow. 

Part of this interest comes from several key docu¬ 
ments written in the last few years that provide infor¬ 
mation on the status, economics, technology, and 


impact of CCS. These are cited throughout this text 
and identified as key references at the end of this 
manuscript. 

When coupled with improvements in energy effi¬ 
ciency, renewable energy supplies, fuel switching, and 
nuclear power, CCS helps dramatically reduce current 
and future emissions [2-4]. If CCS is not available as 
a carbon management option, it will be much more 
difficult and much more expensive to stabilize atmo¬ 
spheric C0 2 emissions. Estimates of the cost of carbon 
abatement without CCS appear 30-80% higher than if 
CCS were to be available [5]. 

Carbon Capture 

CCS has two separate but coupled steps. The first is the 
separation and concentration of C0 2 from industrial 
flue streams, chiefly power plants. This first step is 
commonly called carbon capture and usually includes 
compression and transportation via pipeline. The sec¬ 
ond involves the injection of C0 2 as a dense, supercrit¬ 
ical (liquid-like) phase into deep geological formations. 
This step is commonly called geological carbon seques¬ 
tration ., or GCS. In short, the cost for CCS lies 
mostly in the capture stage. The risk lies mostly in the 
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C0 2 Capture and Sequestration. Figure 1 

Map of large CCS projects worldwide. Base map shows highly prospective {dark shaded) and prospective (medium shaded) 
areas; squares show size of pure C0 2 streams available for sequestration (After Bradford and Dance (2004)) 
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sequestration stage. Both steps are needed for a suc¬ 
cessful project. 

In the first step, CCS requires the separation of C0 2 
from industrial flue streams and concentration to 
C0 2 purities of 95% or greater [2, 6]. This limits 
compression costs and makes effective and efficient 
use of available sequestration resource (subsurface 
pore volume). Currently, four technology pathways 
exist for commercial C0 2 capture and separation 
(Fig. 2): 

• Post-combustion capture: This involves separation of 
C0 2 from nitrogen, commonly with chemical sor¬ 
bents (e.g., monoethanolamine (MEA)). 

• Precombustion capture: This involves conversion of 
fuel feedstocks (e.g., coal) into syngas via gasifica¬ 
tion, steam reformation, or partial oxidation and 
then shifting the syngas chemically to hydrogen and 
C0 2 , and then separating the H 2 from C0 2 . Cur¬ 
rently, this last step is commonly done with physical 
or chemical sorbents. 


• Oxyfiring combustion: This involves combustion of 
fuels in a pure oxygen or 0 2 -C0 2 rich environment 
such that effectively no nitrogen is present in the 
flue gas. Separation of 0 2 from air (N 2 ) is required 
and is the main cost element. Chemical looping [7] 
is considered a variation of this technology. 

• Direct capture of high-purity streams: In these cases, 
C0 2 is “pre-captured” and already at or above 95% 
purity. This requires only compression and trans¬ 
portation before sequestration. 

• Direct air capture: This involves separation of C0 2 
directly from the atmosphere. The very low concen¬ 
trations of C0 2 in air (~385 ppm) make this 
approach both expensive and controversial. 

Each of these approaches requires substantial power 
to run the adsorption and air separation units, raising 
operating expenses and increasing the amount of C0 2 
emissions produced simply to drive the sequestration 
process. They also require more capital in plant con¬ 
struction and have differing operational costs and 
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C0 2 Capture and Sequestration. Figure 2 

Schematic diagram of the four main C0 2 capture pathways and the main processes involved (After Thambimuthu et al. [6]) 
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energy penalties. At present, each technology pathway 
appears equally viable from an economic and thermo¬ 
dynamic standpoint [2, 6, 8]. 

Industry has substantial experience with each of 
these technology pathways, chiefly from operation of 
hydrogen plants, fertilizer plants, refineries, and natu¬ 
ral gas processing facilities. C0 2 has been separated 
from industrial flue streams at scales much greater 
than 1 MMt C0 2 /year (270,000 t C/year). Similarly, 
C0 2 has been separated from small-scale power plants, 
and the technology to scale these operations to plants 
of 200 MW or greater exists. Large pipelines transport 
millions of tons of C0 2 hundreds of kilometers, and 
millions of tons of C0 2 and other acid gases are com¬ 
pressed and injected into geological formations every 
year. Thus, a great deal is known about carbon capture, 
separation, and transportation, and many countries 
have regulatory frameworks in place to accommodate 
the permitting of separation facilities and pipelines 
(in the USA, the Department of Transportation carries 
this authority). 

Cost remains an important barrier to wider com¬ 
mercial deployment [2, 9], especially retrofit costs. 
However, as the concepts for geological carbon seques¬ 
tration are proven to be reliable for current power plant 
technology, improved power plant designs are expected 
to be able to bring down sequestration costs dramati¬ 
cally. Several large pilot projects are testing pre-, post, 
and oxyfired combustion tests at the 2-30 MW scale, 
a necessary precursor to broad commercial deploy¬ 
ment. A number of technologies claim to be able to 
capture and separate C0 2 at $9-11/t C0 2 , less than half 
the best available technology. As the results from these 
and future tests are made public, decision makers and 
investors will be able to plan better for plant economics 
and design. 

Post-combustion Capture 

As the name suggests, post-combustion capture (PCC) 
involves capture and separation of C0 2 from flue gas 
streams after combustion [2, 6]. It can also apply to 
capture from high- or low-concentration bag-house 
streams from industrial applications (e.g., cement 
manufacturing). Most PCC study has focused on 
separation from coal plants, both because of their 
central role in global power production and their 


high concentration of flue gas C0 2 (14-20%). How¬ 
ever, PCC can also be applied to natural gas plants 
(4-7% concentration) or other post-combustion 
streams (e.g., biomass fired power generation). 

Typically, C0 2 capture involves one of these sepa¬ 
ration processes: 

• Chemical solvents: This process operates when flue 
gas contacts liquid solvents and dissolves into the 
liquid. The C0 2 -rich solvent is typically heated to 
release the C0 2 and reconstitute the solvent, which 
is then recirculated. 

• Membranes: This process separates C0 2 from mixed 
gases as C0 2 preferentially flows through the mem¬ 
brane, which rejects other gases. Membranes can be 
made of polymers [10], ceramics [11], or more 
exotic materials (e.g., [12]). Costs and viability are 
a function of gas selectivity and permeance. 

• Solid sorbents: In this process, selective solid mate¬ 
rials (e.g., zeolites) adsorb C0 2 from mixed gas 
streams. These are released under different physical 
or chemical conditions, typically through heating. 

• Exotic materials: Other classes of materials have 
been proposed and remain promising, including 
ionic liquids [13, 14] and metal-organic frame¬ 
works (MOFs; REF). 

The cost and performance of different systems 
varies as a function of loading, energy costs, capital 
costs, vapor pressure, process efficiency, and other 
issues (e.g., Abanades et al. 2009). While some pro¬ 
cesses appear to be highly efficient, they may have 
cost or volumetric issues which limit deployment. 
Many prospective processes remain at the bench-scale 
and have not been tested in large-scale pilots or 
commercial plants. 

Today, most post-combustion capture is performed 
by liquid solvents, chiefly amines. This is in large part 
due to the familiarity of the technology (invented in 
1930) and the conventional equipment set used, such as 
gas stripper towers and thermal regeneration. Rochelle 
[15] argues that the familiarity and performance of 
amine-based solvents makes them the class of C0 2 
capture processes and materials most likely to be widely 
deployed. Many commercial groups are working to 
improve the cost and performance of their amine sys¬ 
tems today, including Mistubishi, Babcock and Wilcox, 
Alstom, Aker Clean Carbon, HTC, and Huaneng. 
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While other capture systems show promise based 
on bench-scale performance [16], few have been tried. 
One exception is Alstons chilled ammonia process 
[17], piloted at the Mountaineer power plant in West 
Virginia. Work on the pilot project ceased in 2011, and 
results remain inconclusive. 

Precombustion Capture 

Precombustion capture involves the removal of C0 2 
after the coal is converted into syngas, but before com¬ 
bustion in a turbine or boiler. Typically, the first step 
involves gasifying the coal. This syngas contains C0 2 , 
which may be separated at this stage for partial capture 
(typically around 20% of carbon content). To achieve 
higher fractional capture, a water-gas shift reactor 
converts carbon monoxide and steam to C0 2 and 
hydrogen. This increases the concentration of C0 2 , 
improving capture efficiency and increasing the 
amount of carbon (in the form of C0 2 ) that can be 
removed using this process. The C0 2 is then removed 
using either a chemical or a physical solvent, such as 
Selexol™ or Rectisol™, and is compressed. This 
approach is used widely where coal is used for chemical 
feedstock. 

For power production applications, the remnant 
hydrogen burns in a turbine to generate electricity 
[2]. The likeliest configuration for these systems is an 
integrated gasification combined-cycle power plant, or 
IGCC. While both IGCC and precombustion C0 2 cap¬ 
ture technologies are available today, the costs limit 
commercial deployment. Four gigawatts of IGCC 
power plants have been built worldwide as of the end 
of 2007 [1], and no existing IGCC plants have C0 2 
capture. In 2003, the US Department of Energy created 
the FutureGen project [18], aimed at construction of 
a 275 MW IGCC + CCS plant capturing 1 million tons 
C0 2 /year. FutureGen halted in 2008 [19], and then 
restarted as an oxyfiring retrofit plant (see below). 
A similarly scaled project, GreenGen in China [20], 
is slated for completion in 2012 with C0 2 capture 
in 2014. 

Many companies have proposed specific IGCC 
designs with C0 2 capture, including GE, Siemens, 
and Mitsubishi. Overall, IGCC plans with CCS appear 
to have higher net efficiencies, lower water consump¬ 
tion, and lower energy penalties than other kinds of 


new plants with carbon capture [21]. The commercial 
availability of gasifiers, water-gas shift reactors, solvent 
towers, and gas turbines also provides some commer¬ 
cial advantages. However, the total capital costs for 
IGCC systems with CCS remain higher than conven¬ 
tional plants. 

Other precombustion plant designs show promise. 
For example, hydrogen and syngas from gasification 
can also be used to make chemicals such as ammonia, 
urea, or olefins. Coal-to-chemical plants can use some 
hydrogen like an IGCC plant and generate power as 
well. These poly generation plants show some advan¬ 
tages in terms of economic return (e.g., [22]). Several 
US projects, including the Texas Clean Energy Project 
[23] and HECA project [20] are attempts to manage 
costs and create economic return for precombustion 
plants through polygeneration. 

Given the high efficiency of physical solvents for 
precombustion, little research has focused on solvent- 
based precombustion technology improvement. Much 
research has focused on improving efficiency or reduc¬ 
ing costs of gasification or on better plant integration to 
improve overall efficiency and cost. Additional research 
has focused on alternative separation technologies, 
including hydrate formation, ceramic membranes, or 
exotic materials. Alternative gasification technologies 
such as molten metal gasifiers [24] or underground 
coal gasification (Friedmann 2009) remain potential 
avenues to large cost reductions for precombustion 
capture-based plant designs. 

Oxyfired Combustion 

The burning of a hydrocarbon fuel (coal, oil, gas, 
biofuel) in an oxygen-rich or pure oxygen environment 
for the purpose of C0 2 concentration or capture is 
called oxyfired combustion (REF). Oxyfiring is pre¬ 
mised on this basic chemical reaction: 

CH X + 0 2 -► C0 2 + XH 2 0 

where the coefficients x and X vary as a function of fuel 
type (for methane, x = 4 and X = 2). When oxygen is 
blended with the fuel stoichiometrically, this reaction 
balances and water can be separated with compression 
leaving concentrated, dry C0 2 . 

One of the potential challenges with oxyfiring com¬ 
bustion is that many hydrocarbon fuels will burn 
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extremely hot in a pure oxygen environment, making 
retrofits difficult and new reactors expensive. One 
common approach to this problem is to mix oxygen 
with C0 2 at near atmospheric oxygen concentrations 
(e.g., 25% oxygen and 75% C0 2 ) creating a “synthetic 
air” The C0 2 is recirculated after separation, and acts 
to moderate the temperature and kinetics of the fuel 
combustion in the reactor (REF). This approach is 
considered promising for retrofits, both in the power 
sector (subcritical and supercritical pulverized coal 
boilers) and in the industrial sector (e.g., in catalytic 
crackers common in hydrocarbon refining). 

A number of commercial projects have made 
progress on oxyfiring. In Europe, Vattenfall’s 
Schwartzepumpe pilot project successfully ran a 
30 MW thermal boiler with synthetic air. Similarly, 
Total successfully operates a 50 MW oxyfired boiler in 
association with injection into a depleted natural gas 
field in Lacq, France. The largest current program 
in oxyfiring is the revised FutureGen program 
(FutureGen 2.0; REF). This US project is attempting 
to retrofit a 250 MW boiler in Meridosia, Illinois 
for oxyfired combustion. This project plans to capture 
1.3 million tons C0 2 /year, over 90% of the plant 
emissions. 

The primary cost of oxyfired combustion is the cost 
of oxygen separation from air (REF). The air separation 
unit (ASU) requires substantial capital and operating 
costs. There are also some specific technical consider¬ 
ations associated with the operating pressure of the 
combustion unit. If the reactor burns its fuel at lower 
than atmospheric pressure (a common configuration 
for boilers), then there is a risk of air leaking into the 
reactor and contaminating the stream with nitrogen. 

Chemical looping [7] is considered an oxyfiring 
variant, although in most ways the technology is radi¬ 
cally different from synthetic air combustion. Instead, 
a metal carrier is oxidized, creating heat which can be 
used to run a steam cycle. Most commonly, the process 
is configured with two interconnected fluidized bed 
reactors: an air reactor and a fuel reactor. The solid 
oxygen carrier is circulated between the air and fuel 
reactors, and the metal oxide is reduced in the second 
stage. This process remains promising, in large part due 
to the very high theoretical conversion efficiencies (as 
high as 80%). However, no reactors larger than bench- 
top have proven viable yet, and questions remain about 


the operation, longevity, and stability of the metal 
oxide carriers. 

Direct Air Capture (DAC) 

Recent carbon cycle research by Solomon et al. [25] and 
Hansen et al. (2007) has resulted in increased interest in 
direct capture of carbon dioxide (C0 2 ) from the air 
(e.g., [26]). Pielke [27] has described the macro-scale 
economic playing field for air capture, while others 
([28], Fackner 2009, [29]) have begun research into 
specific technologies capable of removing C0 2 from 
the air. Several companies have recently formed to 
develop and commercialize direct air-capture systems, 
including Kilimanjaro, Carbon Engineering, and 
Global Thermostat. 

The key technical concern is the low partial pressure 
of C0 2 in air due to its low concentration (<390 ppm). 
This requires the movement of very large volumes of 
air through a capture unit, and high expense of energy 
to execute the separation. The American Physical Soci¬ 
ety Panel on Public Affairs [30] recently completed 
a fairly comprehensive assessment of the 2009 state of 
the art. The key messages of the report include three 
conclusions: 

• DAC is not currently an economically viable 
approach. 

• Generally, low-carbon power and energy is best 
used directly to minimize losses rather than used 
for DAC. 

• Eventually, DAC could play a role in capture and 
storage from decentralized sources of C0 2 , such as 
vehicles, ships, or planes. 

While the situation may change with C0 2 demand 
and price, the consensus today is that the economics 
and engineering of DAC remain formidable and diffi¬ 
cult to overcome. 

Geological Sequestration 

A number of geological reservoirs appear to have the 
potential to store many hundreds to thousands of Gt of 
C0 2 [31]. The most promising reservoirs are porous 
and permeable rock bodies at depth (Figs. 3 and 4). 

• Saline formations contain brine in their pore vol¬ 
umes, commonly with salinities greater than 
10,000 ppm. Because C0 2 is buoyant in most 
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Geological Storage Options for C0 2 

1 Depleted oil and gas reservoirs 

2 Use of C0 2 in enhanced oil recovery 

3 Deep unused saline water-saturated reservoir rocks 

4 Deep unmineable coal seams 

5 Use of C0 2 in enhanced coal bed methane recovery 

6 Other suggested options (basalts, oil shales, cavities) 


Produced oil or gas 
injected C0 2 


* Stored C0 2 



C0 2 Capture and Sequestration. Figure 3 

Options for storing C0 2 in underground geological formations (After Benson and Cook [31]) 


geological settings, target saline formations re¬ 
quire a cap rock or sealing unit above the main 
injection zone. 

• Depleted oil and gas fields have some combination of 
water and hydrocarbons in their pore volumes. In 
some cases, economic gains can be achieved 
through enhanced oil recovery or enhanced gas 
recovery [32-34]. Substantial C0 2 -enhanced oil 
recovery already occurs in the USA with both nat¬ 
ural and anthropogenic C0 2 . These fields provide 
much of the knowledge base available about the 
potential issues related to C0 2 sequestration. 

• Deep coal seams , often called unmineable coal 
seams, are composed of organic minerals with 
brines and gases in their pore and fracture volumes 
that can preferentially adsorb and bind C0 2 as well 
as store it in pores and minor fractures. These 
targets present some challenges in that coals are 
relatively low permeability units, presenting chal¬ 
lenges to injection. 


Because of their large storage potential and broad 
distribution, it is likely that most geological sequestra¬ 
tion will occur in saline formations. However, initial 
projects have been proposed for depleted oil and gas 
fields, accompanying enhanced oil recovery, due to 
the high density and quality of subsurface data 
and the potential for economic return. Although 
there remains some economic potential for enhanced 
coal bed methane recovery much less is known about 
this style of sequestration [2, 31, 35, 36]. Even less is 
known about sequestration in basalts. As such, many 
workers are not convinced of the economic viability of 
sequestration projects in coal, basalts, or oil shales 
given today’s technology and understanding [37]. 

Storage of large C0 2 volumes in geological forma¬ 
tions requires that the C0 2 be relatively dense, so that 
storage capacity is efficiently used. Given typical geo¬ 
thermal gradients and hydrostatic loads, C0 2 is likely 
to be in a supercritical state at most target sites greater 
than 800 m depth (e.g., [38]). At the likely range of 
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C0 2 Capture and Sequestration. Figure 4 

Schematic diagram of large injection at 10 years time illustrating the main storage mechanisms. All C0 2 plumes ( yellow ) are 
trapped beneath impermeable shales (not shown). The upper unit is heterogeneous with a low net percent usable 
porosity, whereas the lower unit is homogeneous. Central insets show C0 2 as a mobile phase [lower) and as a trapped 
residual phase [upper). Right insets show C0 2 dissolution [upper) and C0 2 mineralization [lower) (After MIT [2]) 


injection pressures and temperatures for most projects, 
C0 2 would be buoyant and gravitational forces would 
push C0 2 upward from the injection point. 

Consequently, trapping mechanisms are needed to 
store C0 2 effectively. For depleted oil and gas fields or 
for saline formations, C0 2 storage mechanisms are 
reasonably well defined and understood (Fig. 4). C0 2 
sequestration targets will require physical barriers to 
C0 2 migration out of the crust to the surface. These 
barriers will commonly take the form of impermeable 
layers (e.g., shales, evaporites) overlying the sequestra¬ 
tion target. This storage mechanism is highly or directly 
analogous to that of hydrocarbon trapping, natural gas 
storage, and natural C0 2 accumulations. At the pore 
scale, capillary forces can immobilize a substantial frac¬ 
tion of a dispersed C0 2 bubble, commonly measured 
to be between 5% and 25% of the C0 2 -bearing pore 
volume. The volume of C0 2 trapped as a residual phase 


is highly sensitive to pore geometry, and consequently 
is difficult to predict; however, standard techniques can 
measure residual phase trapping directly in the labora¬ 
tory with rock samples. 

Once in the pore volume, the C0 2 will dissolve into 
other pore fluids, including hydrocarbon species 
(oil and gas) or brines. Depending on the fluid com¬ 
position and reservoir condition, this may occur rap¬ 
idly (seconds to minutes) or over a period of tens to 
hundreds of years. Once dissolved, the C0 2 -bearing 
brines are denser than the original brines, and so the 
strong buoyant forces of free-phase gas are replaced 
by small downward forces. Over longer time scales 
(hundreds to thousands of years) the dissolved C0 2 
may react with minerals in the rock volume to dissolve 
or precipitate new carbonate minerals. For the majority 
of the rock volume and major minerals, this process is 
slow, and may take hundreds to thousands of years to 
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achieve substantial storage volumes. Precipitation of 
carbonate minerals permanently binds C0 2 in the sub¬ 
surface; dissolution of minerals generally traps C0 2 as 
an ionic species (usually bicarbonate) in the pore fluid. 

Although work remains to characterize and quan¬ 
tify these mechanisms, the current level of under¬ 
standing can be used today to develop estimates of 
the percentage of C0 2 that can be stored over some 
period of time. Confidence in these estimates is bol¬ 
stered by studies of hydrocarbon systems, natural gas 
storage operations, hazardous waste injection, and 
C0 2 -enhanced oil recovery (C02-E0R). Current eval¬ 
uations of CCS effectiveness based on the current 
understanding of trapping mechanisms estimate that 
more than 99.9% of injected C0 2 can be reliably stored 
over 100 years , and it is likely that 99% of C0 2 can be 
reliably stored for 1,000 years [31]. These estimates 
assume careful siting, due diligence before injection, 
and appropriate management of injection, and reflect 
the view that the crust contains many sites that are 
generally well configured to store C0 2 effectively. 

Large-Scale Commercial Deployment 

In order to achieve substantial GHG reductions, geo¬ 
logical storage deployment has several requirements: 

• Projects must be large in scale, roughly in the order 
of 1 Mt/year C0 2 or more (Friedmann 2006). 

• There must be minimal leakage from the under¬ 
ground storage reservoirs back to the atmosphere. 

• There must be minimal impact on other uses of 
the subsurface environment and the resources 
it contains. 

The issue of scale dominates deployment of 
GCS ([5, 39]; McFarland 2004; [4]). These volumes 
would have geological carbon sequestration, providing 
25-75 Gt C over 50 years, or 15-43% of emissions 
reduction needed to stabilize atmospheric C0 2 levels 
at 550 ppm [39]. 

Today there are five well-established large-scale 
injection projects with an ambitious scientific program 
that includes monitoring and verification (Table 1): 
Sleipner in Norway [40], Weyburn in Canada [41], In 
Salah in Algeria [42]; Snohvit in northern Norway 
(REF); and Cranfield, Mississippi (REF). Each project 


has injected C0 2 at the rate of ~1 MM t/year 
(~280,000 t C/year). Each project has had a substantial 
supporting science program or anticipates one. 
Recently, the Global CCS Institute in Australia com¬ 
piled a list of active and near active projects, as well as 
a roster of announced integrated and pilot projects 
(REFS). A few are listed here. 

These projects have sampled a wide array of geology 
with varying trapping mechanisms, injection depths, 
reservoir types, and injectivity. Each of these projects 
appears to have ample injectivity and capacity for suc¬ 
cess, and none has detected C0 2 leakage of any signif¬ 
icance. In addition to the three sequestration projects, 
many industrial applications have injected large 
volumes of C0 2 into the subsurface. EOR operations 
in W. Texas, New Mexico, Colorado, Wyoming, 
Oklahoma, Mississippi, Trinidad, Canada, and Turkey 
have individual injection programs as large as 3 MM t 
C0 2 /year (~820,000 t C/year) and cumulative anthro¬ 
pogenic emission injections of ^10 MM t C0 2 /year 
(2.7 MM t C/t) [43]. It should be said that the moni¬ 
toring and verification program at each site varies 
substantially [2] . In many EOR projects, there is almost 
no monitoring beyond that required for C0 2 flood 
operations [32]. 

It is worth noting that many of these projects have 
come online recently in China, which has the potential 
to ramp up projects very rapidly ([44, 45]; Liu and 
Gallagher 2009). It should also be noted that all of the 
first projects do not involve capture from a power 
plant, but rather from industrial facilities where C0 2 
is available at a relatively low cost. While this does 
improve the knowledge of GCS greatly, some questions 
will remain about plant integration and economics 
after these projects which will only be resolved through 
successive scale-up and demonstration. 

Science and Technology Status 

As discussed above, the knowledge of trapping mecha¬ 
nisms and the successes of the three large projects 
provide substantial information. These are augmented 
by studies of naturally occurring C0 2 systems, [46], 
natural gas storage facilities, hazardous waste disposal, 
acid gas injection, and C0 2 -EOR [31]. This knowledge 
provides a firm foundation for commercial action and 
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C0 2 Capture and Sequestration. Table 1 Some current and pending large C0 2 injection projects 


Site 

Location 

Reservoir class 

Plant/capture type 

Permeability 

Seal type 

Start 

date 3 

Sleipner 

Norway 

Offshore Saline Fm. 

Gas refining 

Very high 

Thick 

shale 

1996 

Weyburn 

Canada 

Onshore EOR 

SNG plant/ 
precombustion 

Moderate 

Evaporite 

2000 

In Salah 

Algeria 

Onshore Sandstone 

Gas refining 

Low 

Thick 

shale 

2004 

Snohvit 

Norway 

Offshore Saline Fm. 

Gas refining 

Moderate 

Shales 

2008 

Cranfield 

USA 

(Mississippi) 

Onshore Saline Fm. 
And EOR 

Natural C0 2 dome 

Very high 

Mudstone 

2009 

FutureGen 2.0 

USA (Illinois) 

Onshore Saline Fm. 

IGCC/precombustion 

Moderate to 
very high 

Thick 

shales 

2016 

ZeroGen 

Australia 

Onshore EOR/ 

Saline 

IGCC/precombustion 

Low to 
moderate 

Shale 

2018 

GreenGen 

China 

Offshore EOR +/- 
Saline Fm. 

IGCC/precombustion 

Low to 

moderate 

Shales 

2015 

HECA 

USA (California) 

Onshore EOR 

IGCC/precombustion 

Moderate to 
high 

Thick 

shale 

2016 

Gorgon 

Australia 

Offshore Saline Fm 

Gas refining 

Moderate 

Thick 

shales 

2013 

SaskPower 
Boundary Dam 

Canada 

(Saskatchewan) 

Onshore EOR +/— 
Saline Fm. 

SC boiler/Post- 
combustion 

Moderate 

Evaporite 

2016 

Hauneng 

Shidongkou 

China 

Onshore saline Fm. 

USC boiler/post¬ 
combustion 

Unclear 

Shales 

2016 

Shenhua DCL/ 
Majiata 

China 

Onshore Saline Fm. 
+/- EOR 

Coal liquefaction/ 
precombustion 

Low to 
moderate 

Shales 

2013 

Archer-Daniels 

Midland 

USA (Illinois) 

Onshore Saline Fm. 

Ethanol by-product 

Moderate to 
high 

Shales 

2013 

TCEP 

USA (Texas) 

Onshore EOR 

Polygeneration 

precombustion 

Moderate to 
high 

Evaporite 

2017 


a = date of first injection or planned first injection of C0 2 


a nascent foundation for the development of regula¬ 
tion, standards, and legal frameworks for sequestration 
(REFS, including Wilson et al. 2009; CCSREGS project; 
EPA; EU ruling). GCS itself, however, drives study into 
specific technical and scientific challenges associated 
with the central elements of site characterization, selec¬ 
tion, operation, and monitoring ([36]; WRI 2009). 
Forward investigation around these topics will enhance 
the technical and operational understanding of com¬ 
mercial GCS. 


Monitoring and Verification (M&V) 

Monitoring and verification must detect and track C0 2 
in the deep subsurface near injection targets, in the 
shallow subsurface, and above ground. Monitoring 
and verification studies are a chief focus of many 
applied research efforts. The US Department of Energy 
has defined M&V technology development, testing, 
and deployment as a key element to their technology 
roadmap (US DOE 2010). The European Union 
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C0 2 ReMoVe effort is dedicated to monitoring and 
verification, and the industry-led C0 2 Capture Project 
continues to study monitoring in commercial settings 
[47]. Some form of M&V will be required at commer¬ 
cial sites, but the extent of monitoring required by 
regulators, operators, or financiers remains uncertain. 
Many geophysical and geochemical methods are suffi¬ 
ciently well understood for them to be used to make 
reasonable performance predictions at candidate stor¬ 
age sites ([48]; WRI 2009). Testing which of these 
approaches will be the most valuable for a given geo¬ 
logical environment remains to be determined. 

Key Science and Technology Gaps 

Despite the tremendous amount of applied and basic 
knowledge, there remain both cross-cutting-specific 
and site-specific topics for investigation (e.g., [49]). 
From an applied perspective, the National Energy 
Technology Laboratory has written several plans to 
identify and address key technology gaps (e.g., [36]). 
Recent new R&D projects aimed at specific compo¬ 
nents of R8d3 include the National Risk Assessment 
Program (NRAP), the Carbon Capture Simulation Ini¬ 
tiative (CCSI; REF), Weyburn’s Final phase project 
(REF), and portions of Chinas project 836 and 927. 
While these efforts and reports are not meant to be 
comprehensive, they reflect the current state of knowl¬ 
edge and potential to continue scientific investigations 
in GCS. 

Deployment Challenges 

Despite the current gaps in sequestration science and 
technology, commercial projects have begun and are 
ready to proceed with confidence in their success. 
Today, enough is known to safely and effectively exe¬ 
cute key tasks around single large-scale injection 
projects: 

• Characterize a site. 

• Design and operate the project. 

• Monitor the C0 2 injection. 

• Mitigate problems that might arise. 

• Close and abandon the project. 

Although this knowledge is currently being brought 
to bear on specific injection projects around the world, 
greater work is needed to codify and develop tools, 


regulations, and standards for deployment of multiple 
million ton injections in thousands of wells nationwide 
and worldwide across a range of geological settings. 
This affects both sides of the deployment rubric over 
the project life cycle ([50]; WRI 2009). Potential oper¬ 
ators must execute a set of tasks to prepare for and 
execute injection permitting and operation. Similarly, 
potential regulators, investors, insurers, and public 
stakeholders require information to make decisions. 
Part of the challenge is to provide a technical basis for 
each set of actors to make decisions concerning the 
minimal amount of information needed to serve all 
stakeholders. 

While many possible goals and terms may be pur¬ 
sued in site characterization, it is difficult to imagine 
the success of a large-scale injection project without 
knowledge of three parameters. These are injectivity , 
capacity ; and effectiveness (Friedmann 2006). In general 
terms, injectivity and capacity may be estimated by 
conventional means, such as special core analysis, 
regional and local structural and stratigraphic map¬ 
ping, and simple multi-phase fluid flow simulations. 
However, there are explicit standard measures of effec¬ 
tiveness. Ultimately, characterizations must rely on 
estimates of geomechanical integrity, hydrodynamic 
stability, and seal continuity for the rock system, fault 
system, and well system [51-55]. 

Given this complexity, it is not broadly accepted 
today what terms constitute effective storage. The 
FutureGen project’s request for proposals [18] laid 
out a set of minimum criteria for acceptance. These 
criteria are based on expert opinion, laboratory exper¬ 
imentation, analog studies, and simulations. Similarly, 
the World Resources Institute (2009) proposed a set 
of guidelines to potential operators based on a working 
group of over 40 experts ranging from geoscientists 
to lawyers and regulators. However, the few active pro¬ 
jects today do not constitute a large empirical data set 
from multiple geological carbon sequestration deploy¬ 
ments to provide a definitive standard for characteri¬ 
zation of site effectiveness. 

Hazards Assessment and Risk Management 

Supercritical C0 2 is buoyant and will seek the Earth’s 
surface; therefore, C0 2 injection carries the possibility 
of leakage. Importantly, C0 2 leakage risk will not be 
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uniform across all sites, thus C0 2 storage sites will have 
to demonstrate minimal risk potential in their site 
characterization plans [56, 57]. Based on analogous 
experience in C0 2 injection such as acid gas disposal 
and enhanced oil recovery, these risks appear to be less 
than those of current oil and gas operations [31]. 

The direct hazards associated with geologic seques¬ 
tration fall into three distinct categories: 

• Hazards associated with the release of the carbon 
dioxide to the Earth surface 

• Hazards associated with release into groundwater 
and subsequent degradation 

• Hazards associated with Earth movement caused by 
the injection process itself 

The hazards themselves in turn are associated with 
failure mechanisms and triggers (Friedmann 2007; 
[56]). Potential triggering of events associated with 
these hazards could lead to undesired consequences. 
As such, it is an important goal to identify and under¬ 
stand these hazards in order to avoid triggering hazard 
events. Identification and characterization of these haz¬ 
ards is the critical first step to managing the risks at 
a site. They also serve as the basis for a quantitative 
probabilistic risk assessment (PRA). A robust PRA 
cannot be made today due to substantial scientific 
and technical gaps. However, the hazards associated 
with a site can be identified, mapped, characterized, 
and parameterized sufficiently to avoid failure or 
(alternatively) avoid selecting a bad site. 

Water Issues 

A number of issues arise from CCS regarding water use 
and quality. These include additional water use due to 
process steam and additional hear requirements in the 
plants; displacement of in situ brines; and potential 
risks to groundwater quality from unplanned C0 2 
leakage [58]. Additional water needs for surface facili¬ 
ties is highly sensitive to capture process and plant type, 
ranging from 20% to 110% additional water consump¬ 
tion ([59]; b). C0 2 injected into saline formations 
necessarily interacts with water and brines there. This 
interaction includes displacement of the brines there, 
local drying of the reservoir, and dissolution of C0 2 
into brine forming carbon acid. Early research focused 
on the potential risks and impacts of acid brine forma¬ 
tion, ultimately concluding that these risks are small. 


Injection also creates a pressure transient in the 
reservoir which increases through time. In the right 
context, this pressure can drive water co-production 
(enhanced water recovery) and even drive surface desa¬ 
lination processes [60, 61]. Based on preliminary cal¬ 
culations, it appears that water co-production can cut 
the water demands for an integrated CCS plant by 50%, 
or produce freshwater for agricultural or civil use. More 
work is needed to understand the viability of this tech¬ 
nical approach. 
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PM Particulate matter 

psi Pounds per square inch 

R&D Research and development 

RD&D Research, development, and demonstration 

S0 2 Sulfur dioxide 

TCR Total capital requirement 

Introduction 

The provision of electric power is one of the prerequi¬ 
sites of prosperity; there is strong correlation between 
electric power-generating capacity and per capita gross 
domestic product (GDP) (Fig. 1) [8]. Across the world, 
economic indicators signal that there will be continued 
growth and increased electricity demand; also, at least 

3.6 billion people lack adequate access to electricity and 

1.6 billion have no electricity at all. 

During the next 15 years, 1,200 GW of new capacity 
is projected to be added to the world’s present electric 
generation capacity of about 4,000 GW. The EIA 
projects that by 2030 global demand for electricity 
will grow by more than 75%. 

The U.S. Energy Information Administration (EIA) 
data for 2008 show that 54% of the electricity supply in 
the USA was generated by coal, 12% by natural gas 


(increasing to an estimated 32% by 2020), and about 
2% by oil, with the rest produced by nuclear power 
(21%), hydropower (9%), and by renewable solar or 
wind (2%). 

Electricity generation (billion kilowatt-hours) by 
fuel type from 1970 to 2020 in the USA is illustrated 
in Fig. 2 [20]. The increase in the use of coal and, 
especially, natural gas for power generation projected 
for the period from 2000 to 2020 is to meet growing 
demand for electricity and to offset the projected retire¬ 
ment of nuclear power plants. 

Because of the large coal reserves in major develop¬ 
ing countries such as China, India, and Indonesia, 
where most of the new power-generating plants are 
being installed, it can be expected that coal will remain 
the dominant source of power generation worldwide 
during the first half of this century. There is also 
a strongly growing demand for natural gas, a clean 
fuel capable of being used in power generation with 
high efficiency. 

Coal as an energy source has the attraction of broad 
availability (there are large reserves in several countries 
around the world), low cost, and economic utilization 
by mature technologies. 
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Prosperity and electricity (Source: EPRI [8]) 
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C0 2 Reduction and Coal-Based Electricity Generation. Figure 2 

US Electricity generation by fuel, 1970-2020 (billion kilowatt-hours) (Source: US EIA [5]) 


Emissions from coal have decreased 



1970-2007 


C0 2 Reduction and Coal-Based Electricity Generation. Figure 3 

US emissions of criteria pollutants and electricity from coal during 1970-2007 (Source: NCC Report [14]) 


The disadvantages are due mainly to environmental 
and health impacts at the mining phase and in the 
course of coal utilization. Both the mining and 
the utilization of coal are subject to increasingly 
tightening standards by governmental regulations. 

Because of no acceptable practical alternative 
to coal and natural gas, industrial research and 
development programs since the 1970s have been 
concentrated on methods of reducing the emissions 
of so-called criteria pollutants (S0 2 , NO x , Hg, and 


fine particulates). New environmental technologies 
developed and demonstrated under the Clean Coal 
Technology Program of the U.S. Department of Energy 
(DOE) during the past 40 years have become commer¬ 
cial and are being applied worldwide. 

US emissions reductions have been significant in 
the face of increasing output of electricity generation 
coal (Fig. 3) [14]. 

The reduction of C0 2 emission remains a challenge. 
Carbon capture and geological sequestration (CCS) is 
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the key enabling technology for the long-term signifi¬ 
cant reduction of C0 2 emissions from coal-based 
power generation. It is expected that CCS will become 
commercial (deployed without significant government 
subsidy) for base load power generation around 
2020-2025 following the construction and operation 
of several demonstration plants during the present and 
next decade. 

Before the advent of commercial CCS, however, 
higher efficiency technology options are available for 
coal-based power generation. Improved efficiency will 
reduce the amount of fuel used for the generation of 
a given electric energy output and hence reduce the 
emissions of all pollutants. Also, when the time comes 
for CCS deployment, higher efficiency will reduce 
the energy penalty and make new plant or retrofit of 
existing plant less expensive [1, 2, 6]. 

Main higher efficiency technology options 
discussed below are: 

• Pulverized Coal Combustion in Ultra-supercritical 
Steam Cycle 

• Integrated Coal Gasification Combined Gas Turbine- 
Steam Cycle (IGCC) 

• PC or CFB/Oxy/Combustion in Supercritical Steam 
Cycle 


Pulverized Coal in Ultra-Supercritical Steam Cycle 
(PC/USC) 

The efficiency of PC/SC power plant can be increased 
in small steps to 43% (HHV) and beyond, as illustrated 
in Fig. 4 [18]. 

The first two steps in the diagram concern the waste 
gas heat loss, the largest of a boiler’s heat losses, about 
6-8%. The air ratio , usually called excess air factor, 
represents the mass flow rate of the combustion air as 
a multiple of the theoretically required air for complete 
combustion. Excess air increases the boiler exit gas 
mass flow and, hence, the waste gas heat loss. Improved 
combustion technology, e.g., finer coal grinding and 
improved burner design, permit lowering the excess air 
without sacrificing completeness of combustion. Some 
of these remedies require additional expenditure in 
energy, e.g., for finer coal grinding, and for increasing 
the momentum flux of the combustion air through 
the burners, but this increase in parasitic energy is 
usually small compared to the efficiency gain due to 
the reduced excess air. 

The boiler exit gas temperature can be reduced by 
appropriate boiler design limited only by the dew point 
of the flue gas. There is a close relationship between the 
excess air of combustion and the low limit of exit gas 
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Steps to improve the efficiency in PC/Supercritical Coal Plant (Schilling [18]) 
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temperature from a boiler fired by a sulfur-bearing fuel. 
Higher excess air leads to an increase in the oxidation of 
S0 2 to S0 3 , with S0 3 promoting sulfuric acid forma¬ 
tion in the combustion products. Sulfuric acid vapor 
increases the dew point of the flue gas and hence raises 
the permissible minimum exit gas temperature. At an 
exit gas temperature of 130°C (266°F) a reduction of 
every 10°C (18°F) in boiler exit temperature increases 
the plant efficiency by about 0.3%. 

The Rankine cycle efficiency is proportional to the 
pressure and temperature of heat addition to the 
cycle, and is inversely proportional to the condenser 
pressure, and therefore to the temperature of the 
cooling medium. The usual design basis for condenser 
pressure in the USA is 2.0” Hg abs. (67 mbar). 
Power plants in Northern Europe with access to 
lower-temperature cooling water use condenser pres¬ 
sure of 1.0” Hg abs. (30 mbar) pressure. This difference 
can produce an efficiency gain of more than 2 
percentage points [19]. 

As steam pressure and superheat temperature are 
increased above 221 bar (3,208 psi) and 374°C (706°F) 
the steam becomes supercritical (SC); it does not 
produce a two-phase mixture of water and steam as 
in subcritical steam, but instead undergoes a gradual 
transition from water to vapor with corresponding 
changes in physical properties. 

Ultra-supercritical (USC) steam generally refers to 
supercritical steam at more than 1,100°F temperature. 
EPRTs terminology for 1,300°F and 1,400°F plants is 
Advanced Ultra-supercritical (AUSC). 

The thermodynamic efficiency of the Rankine 
steam cycle increases with increasing temperature 
and pressure of the superheated steam entering the 
turbine. It is possible to increase further the mean 
temperature of heat addition by taking back partially 
expanded and reduced temperature steam from the 
turbine to the boiler, reheating it, and reintroducing it 
to the turbine. In the usual designation of steam 
parameters the second and third temperature refers 
to single and double reheat, e.g., 309 bar/594°/594°/ 
594° C, respectively. 

The average efficiency of the existing US coal-fueled 
electricity generating fleet is 33%, based on the higher 
heating value (HHV) of the coal. 

Pulverized coal plants with USC parameters of 
300 bar and 600/600°C (4,350 psi, 1,112/1,112°F) can 


be realized today, resulting in efficiencies of 43% 
(HHV). These plants are 21% more efficient than the 
mean of today’s US fleet of coal-fired plants; i.e., they 
would use 21% less coal for the same power generation 
and emit 21% less C0 2 . There are several years of 
experience with these 600° C (1,112°F) plants in service, 
with excellent availability [4, 11]. 

Further improvement in efficiency achievable 
by higher ultra-supercritical steam parameters is 
dependent on the availability of new, nickel alloys for 
boilers and steam turbines (Fig. 5) [10, 17]. 

Two major development programs in progress, the 
Thermie Project of the European Commission and 
a US program managed by EPRI for the USDOE’s 
NETF and the Ohio Coal Development Office 
(OCDO), aim at steam parameters of 375 bar, 
700°C/720°C (5,439 psi, 1,292°F/1,328°F), and 
379 bar, 760°C/760°C (5,500 psi, 1,346°F/1,400°F), 
respectively. The plant efficiency increases by about 1 
percentage point for every 20°C rise in superheat and 
reheat temperature. An advanced 700°C (1,293°F) USC 
plant will likely be constructed during the next 7-10 
years constituting a benchmark for 46% efficiency 
(HHV) coal-fired power plant. 

It is estimated that before 2020, the time when CCS 
technologies may begin to become commercially avail¬ 
able, about 13 Gigawatts (GW) new coal-based capac¬ 
ity will be constructed in the USA (EIA 2010). If more 
efficient presently available (1,112°F) USC technology 
is utilized instead of subcritical steam plants, C0 2 
emissions would be about 570 MMt less during the 
30 years’ lifetime of those plants, even without 
installing a C0 2 capture system. 

High efficiency coal-based power generation is also 
important to long-term solutions of reducing C0 2 
emissions by using CCS, as it mitigates the significant 
energy cost of CCS application. Because of the reduced 
coal use for a given electricity output, the plant has 
a smaller footprint with respect to size of coal handling 
and emission control systems. These savings and the 
use of modern analytical techniques that enable opti¬ 
mal use of Ni alloys can minimize the cost of USPC 
technology. 

Coal consumption and C0 2 emission comparisons 
for presently available 500 MW PC/SubC, PC/SC 
and PC/USC plants without CCS are presented below 
in Table 1 [13]. 
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C0 2 Reduction and Coal-Based Electricity Generation. Figure 5 

Stages in materials development and related advanced steam parameters (Henry et al. [10]) 


C0 2 Reduction and Coal-Based Electricity Generation. Table 1 Comparative coal consumptions and emissions of air- 
blown pulverized coal combustion technologies without CCS (MIT Coal Study [13]) 


Performance 

Subcritical 

PC/SC 

PC/USC 

Heat rate (Btu/kWh) 

9,950 

8,870 

7,880 

Gen. efficiency (HHV) 

34.3% 

38.5% 

43.3% 

Coal use (MMt/year) 

1.54 

1.37 

1.22 

C0 2 emitted (MMt/year) 

3.47 

3.09 

2.74 

C0 2 emitted (g/kWh) 

931 

830 

738 


Assumptions: 500 MW net plant output; Illinois #6 coal; 85% capacity factor 


Coal Gasification Combined Gas Turbine-Steam 
Cycle (IGCC) 

Gasification-based technologies use partial oxidation 
of coal with oxygen as the oxidant to produce a 
synthesis gas (syngas) consisting mainly of CO 
and H 2 . The gas is cleaned to remove contaminants 
before it is used as fuel in a combustion turbine. 


The exhaust gas of the gas turbine raises steam in 
a heat recovery steam generator (HRSG) for a steam 
turbine-electric generator set. The combined cycle effi¬ 
ciency improves through the reduced effect of the 
steam condenser’s heat loss. As with combustion tech¬ 
nologies, higher efficiency results in lower emissions 
per MWh. 
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While the IGCC concept is being successfully dem¬ 
onstrated in two plants in the USA and two plants in 
Europe, utility power generation demands introduce 
new challenges that will require further RD&D to over¬ 
come. The gasification process operates best under 
steady-state conditions. The load change conditions 
associated with utility electricity generation will burden 
the technology. The many chemical syngas cleanup 
processes will have to respond to these changes on 
a real-time basis. In addition, the gasifier and associ¬ 
ated gas cleanup systems will be exposed to a much 
larger range of fuel quality than experience has dem¬ 
onstrated. Again, this variation introduces conditions 
that require more RD&D to commercialize. 

In the prevalent designs the gasifier operates at high 
pressure. Bituminous coal is pulverized and fed as coal- 
water slurry [7]. The temperature in the gasifier is 
above 1,300°C, so that the coal ash can be removed as 
liquid slag. The product syngas undergoes rigorous 
cleanup of particulates, sulfur and nitrogen oxides, 
and mercury, prior to entering the gas turbine (GT) 
combustor. Precombustion cleanup of the syngas is 
economically favorable compared to postcombustion 
flue gas cleanup because of the lower volume flow 
rate, undiluted by nitrogen, and elevated pressure. 
The syngas can be cooled prior to cleanup by radiant 
and convective heat exchangers which raise steam, and 
improve plant efficiency. However, because of the addi¬ 
tional cost and operational problems due to fouling, 
the convective heat exchanger is often omitted in recent 
designs at the detriment of the plant efficiency. 

Another element of efficient design is the integra¬ 
tion of subsystems with the main generating plant. For 
example, air can be fed from the main compressor of 
the GT to the Air Separation Unit that produces oxygen 
for the gasification process, and the nitrogen can be 
returned to the gas turbine combustor as a diluent to 
reduce NO x formation. The gain in improved effi¬ 
ciency, however, is weighed against operating a more 
complicated plant of somewhat reduced availability. 

The current IGCC units have, and next generation 
IGCC units are, expected to have electricity generating 
efficiencies that are less than or comparable to those of 
supercritical PC generating units [9]. 

Current units typically gasify high-heating value, 
high-carbon fuels. Polk IGCC with a Texaco-GE 
coal-water-slurry gasifier and radiant syngas cooling 


operates at 35.4% (HHV) generating efficiency. The 
Wabash River IGCC with a coal-water-slurry fed 
E-Gas gasifier, radiant and convective syngas cooling, 
and no integration operates at about 40% generating 
efficiency. The IGCC in Puertollano Spain with a 
dry-feed shell-type gasifier, radiant and convective 
heat exchangers, and combustion turbine-air separa¬ 
tion unit integration has a generating efficiency of 
about 40.5% (HHV). Supercritical PC units operate 
in the 38-40% efficiency range, and ultra-supercritical 
PC units in Europe and Japan are achieving 42-46% 
(HHV) generating efficiency. 

A National Research Council Panel reviewing the 
DOE’s Advanced Gasification Research Program esti¬ 
mated the improvements upon successful completion 
of the research. The research areas and anticipated 
benefits are listed in Tables 2 and 3 [15]. 

C0 2 Capture and Compression in PC and IGCC 
Plant 

C0 2 capture from pulverized coal combustion (PC) 
involves postcombustion cleanup, the separation and 
recovery of C0 2 that is at low concentration and low 
partial pressure in the exhaust gas. Chemical absorption 
with amines is presently the only commercially available 
technology. The C0 2 is first captured from the exhaust 
gas stream in an absorption tower. The absorbed C0 2 
must then be stripped from the amine solution using 
a large amount of steam, regenerating the solution for 
recycle to the absorption tower. The recovered C0 2 is 
cooled, dried, and compressed to a supercritical fluid. 
It is then ready to be piped to sequestration. 

The use of steam for C0 2 removal reduces the 
steam available for power generation. 

To maintain constant net power generation the coal 
input, boiler, steam turbine/generator, and emission 
control equipment must all be increased in size. 
The thermal energy required to recover C0 2 from the 
amine solution reduces the efficiency by 5 percentage 
points. The energy required to compress the C0 2 to 
a supercritical fluid is the next largest factor, reducing 
the efficiency by 3.5 percentage points. All other energy 
requirements amount to less than 1 percentage point. 

R&D is in progress pursuing the use of alternative 
sorbents, such as chilled ammonia to reduce the energy 
intensity of the C0 2 capture process. 
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C0 2 Reduction and Coal-Based Electricity Generation. Table 2 Evolutionary improvements due to DOE advanced 
gasification research 



Change in cost and performance of IGCC system due to R&D 


Increase in 
efficiency 

(%) 

Capital cost 

reduction 

($/kW) 

O&M 

reduction 

($/year) 

Reduction in 
COE ($/MWh) 

Availability 
improvement (%) 

Emissions 

DOE gasification project areas 

Warm gas cleanup 

1-2 

70-100 

Minimal 

1.8 

0 

500 ppb 
sulfur 3 

Instrumentation 

(temperature 

measurements) 13 

0.5-1 

0 

Minimal 

Minimal 

1-2 


Materials (refractory) 0 

0 

0 

2 million 

0.5 

4-6 

- 

Non-DOE or non-gasification technical areas 

Heat recovery 

3 






Industry learning and 
evolution 

1-2 







Source : National Research Council 

COE cost of electricity, O&M operation and maintenance, kW kilowatt, MWh megawatt-hour 
a Air Products & Chemicals, Inc. analysis 
internal report by Parsons 

c The DOE turbine program is considered a complementary program to advanced gasification program 


C0 2 Reduction and Coal-Based Electricity Generation. Table 3 Evolutionary improvements due to DOE Advanced 
Gasification Research 



Change in cost and performance of IGCC system due to R&D 


Increase in 
efficiency (%) 

Capital cost 
reduction ($/ 
kW) 

O&M 

reduction 

($/kWh) 

Reduction in 
COE ($/MWh) 

Availability 

improvement 

(%) 

Emissions 

DOE gasification project areas 

Ion transport 
membrane air 
separation 3 

1 

75 

Minimal 

1.4 

0 


Stamet pump (from 
backup material) 13 

0.5 

40-100 

Minimal 

1.4-1.8 

0 

- 

Non-DOE or non-gasification technical areas 

DOE turbine 
program 0 

2-3 (for combined 
cycle power island) 

60-100 






Source : Gary Stiegel [22], U.S. Department of Energy National Energy Technology Laboratory (for data on DOE gasification project area cost 
and performance) 

COE cost of electricity, O&M operation and maintenance, kW kilowatt, kWh kilowatt-hour, MWh megawatt-hour 
a Air Products & Chemicals, Inc. analysis 
internal report by Parsons 

c The DOE turbine program is considered a complementary program to advanced gasification program 
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The stakes are high because a successful solution 
would be applicable to both new plants and to the 
retrofit of existing plants with CCS [6]. 

IGCC lends itself favorably for efficient C0 2 capture 
and sequestration because C0 2 can be separated from 
a relatively small volume of fuel gas (syngas) at high 
pressure. Without CCS, IGCC is more expensive, and 
has lower efficiency and availability than PC/SC and 
PC/USC technology but, if CCS were available today, 
equipped with CCS, IGCC would be cheaper. By 
recirculating part of the liquid phase supercritical 
C0 2 the water in the slurry feed can be replaced by 
C0 2 . This could reduce the oxygen required for the 
vaporization of the water in the slurry and for 
the attainment of the high temperature necessary for 
trouble-free liquid slag removal from the gasifier. 

For coals of lower heating value such as subbitumi- 
nous coals or lignite, the COE gap is, however, substan¬ 
tially narrowed or is even reversed. 

It is noteworthy that there is significant cost and 
performance loss attached to the capture and compres¬ 
sion of C0 2 from both combustion and gasification 
plants in preparation of its sequestration [5]. 

Results of studies presented in Table 4 provide 
information on C0 2 emitted, efficiency, estimates of 
total plant cost and cost of electricity for IGCC, PC/ 
USC technologies without and with C0 2 capture and 
compression, and for SC/PC/Oxy. 

PC/Oxy/USC 

When oxygen, instead of air is used as oxidant 
for combustion, the mass flow rate of combustion 
products is significantly reduced and the flue gas C0 2 


concentration is greatly increased. In order to avoid 
unacceptably high temperatures in the boiler, combus¬ 
tion products, mainly C0 2 , are recirculated from the 
end of the boiler to the combustion chamber. This 
restores the furnace gas temperature to air combustion 
levels resulting in an 0 2 volume concentration of about 
30%, compared to 21% for air-blown combustion. This 
difference is due to the higher specific heat of C0 2 than 
that of the replaced nitrogen, and also, to C0 2 ’s high 
radiative emissivity. Flue gas recirculation (FGR) 
increases the C0 2 concentration in the flue gas to 
beyond 90%, the complement being N 2 , due to air 
leakage and about 3% 02 required for complete 
burn out of coal. This makes the flue gas ready for 
sequestration after the removal of condensables but 
without energy-intensive gas separation. If avoidance 
of corrosion in the compressor and pipeline requires 
further exhaust gas polishing, the fivefold reduced flue 
gas volume leads to strongly reduced capital and 
treatment costs relative to those for an air-blown 
combustion plant [12]. 

The presently available Cryogenic Air Separation 
process consumes a significant fraction of the generat¬ 
ing plant’s output and reduces its efficiency by 6.4 
percentage points. The development of membrane- 
type oxygen processes with greatly reduced energy 
requirements are urgent R&D targets 

CFBC with Oxy/FGR 

CFBC with an external heat exchanger lends itself 
favorably to oxy/fuel application because the solids 
circulation provides an effective means, additional 
to flue gas recycle, for controlling the combustion 
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C0 2 Reduction and Coal-Based Electricity Generation. Table 4 C0 2 emissions, efficiency, and cost estimates for 
advanced generation technologies without and with CCS 



PC/SC 

PC/USC 

IGCC 

SCPC/Oxy 

Without 

With 

Without 

With 

Without 

With 

With CCS 

C0 2 emitted g/kWh 

830 

109 

738 

94 

824 

101 

104 

Efficiency, % HHV 

38.5 

29.3 

43.4 

34.1 

38.4 

31.2 

30.6 

TCR $/kW 

2,800 

4,524 

2,865 

4,408 

3,016 

3,996 

4,016 

COE c/kWh 

6.00 

9.64 

5.93 

9.25 

6.44 

8.24 

8.82 


Source : C0 2 and efficiency data from MIT 2007. TCR and COE data modified after Booras [21] to include the effect of the recent increase in 
construction costs 
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temperature. Solids, consisting of sorbent, coal ash, 
and coal char particles, are precipitated from the gas 
stream exiting the combustor (riser) section of the CFB 
boiler and are split into two solids streams: one that is 
recirculated to the riser without cooling, and the other 
that is cooled in an external heat exchanger before 
recirculation. The cooling of the combustor by the 
cold solids permits the reduction of the rate of flue 
gas recirculation, with the result that the 0 2 concentra¬ 
tion in the feed stream can rise to above 40% without 
exceeding a limiting combustor temperature level 
of about 850°C required by the thermodynamic stabil¬ 
ity of CaS0 4 and by smooth fluidization. The 
corresponding lower gas mass flow leads to reduced 
size and cost of the boiler and of the post combustion 
plan. The advantages also include increased fuel flexi¬ 
bility including the capability of firing biomass, which 
may allow additional C0 2 credits. 

The oxygen-fired CFB concept has been validated in 
Alstom’s 3 MW th pilot scale test facility [16]. The 
results show that the technology can be developed 
from that of conventional CFB boilers, so that a 100- 
350 MW e oxy-fired demonstration plant could be 
developed in the short term followed by a commercial 
size plant in the midterm. 

Concluding Remarks 

Coal will continue to play a large and indispensable role 
in electricity generation, in a carbon-constrained world, 
under any scenario. The key enabling technology for C0 2 
emissions mitigation in coal combustion and gasifica¬ 
tion plants is C0 2 capture and sequestration (CCS). 

CCS has to be demonstrated at scale, integrated 
with power generation, and the legal framework of 
sequestration has to be developed before CCS becomes 
commercial, probably by the 2020-2025 period. 

Before the advent of CCS there will be about 
45 GW new coal-based electricity generating capacity 
constructed in the USA (about 1,000 GW worldwide) 
and the question arises of what the technology options 
are for these new plants. 

Increased efficiency of power generation is the 
most predictable and cost-effective method for C0 2 
emissions reduction. In coal plant without CCS it is 
also the only practical method for mitigating C0 2 
emissions now, and it remains important for future 


plants equipped with CCS to reduce the energy cost 
of C0 2 capture. 

Pulverized coal combustion in Rankine cycle 
steam plant is the prevailing utilization technology. 
Compared to average efficiency of the existing coal- 
based fleet, up to 24% reductions in C0 2 and pollutant 
emissions can be achieved today in commercial 
ultra-supercritical new plants. 

Coal gasification combined cycle (IGCC) plants are 
successfully demonstrated in the USA and in Europe. 
Today, without CCS, IGCC is more expensive, and has 
lower availability than high efficiency PC/SC and PC/ 
USC plants but, if CCS were available today, equipped 
with CCS IGCC would be cheaper. Also, IGCC has the 
capability of less expensively lowering further criteria 
pollutant emissions. 

Use of oxygen instead of air in combustion signif¬ 
icantly reduces the mass flow rate of combustion prod¬ 
ucts with the concomitant reduction of flue gas capital 
and treatment costs. Because of the high concentration 
of C0 2 in the flue gas it is expected that following the 
removal of pollutants and agents that could be corro¬ 
sive in the compressor and the pipeline, the flue gas can 
be sequestered without the separation of C0 2 . 

The RD8cD challenge is in the development of the 
membrane oxygen system that is less energy intensive 
than the presently available cryogenic air separation. 

There are important technology developments 
in progress that can change the performance and eco¬ 
nomics of advanced technology options by the time 
CCS will be commercial. A broad portfolio of advanced 
clean coal technology RD8cD should be aggressively 
pursued to meet the CCS challenge. 

Demonstration of C0 2 sequestration at scale and 
integrated with power generation will give the public 
more confidence that a practical carbon emission con¬ 
trol option exists and maintains opportunities for the 
lowest cost, widely available energy source to be used to 
meet the world’s pressing energy needs in an environ¬ 
mentally responsible manner. 
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Glossary 

Anthracite Anthracite is the highest rank of coal 
because it has undergone the greatest degree of 
metamorphosis away from peat. It features low 
volatile matter (< 10%) and high carbon, giving it 
the highest energy content of all coals. Semi¬ 
anthracite is somewhere in the middle between 
low volatile bituminous coal and anthracite. 

Ash Inorganic residues remaining after combustion. It 
has less than the initial mineral matter content 
because of chemical changes during combustion, 
i.e., the loss of water, carbon dioxide, and sulfurous 
compounds. 

Bituminous coal Bituminous coal lies between subbi- 
tuminous coal and semi-anthracite in terms of 
rank. This rank of coal is commonly divided into 
additional subgroups dependent upon the content 
of volatile material. 

Calorific value Corresponds to the amount of heat per 
unit mass when combusted. Can be expressed as 
gross calorific value, which is the amount of heat 
liberated during combustion under standardized 
conditions at constant volume so that all of the 
water in the products remains in liquid form, or 
as net calorific value, which is the maximum 
achievable heat release obtainable in a furnace at 
constant pressure. 

Carboniferous A geologic period and system that 
extends from 360 to 300 million years ago. 
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Coal preparation Physical and mechanical processes 
often applied to coal to make it suitable for 
a particular use. 

Coalification The process that transforms vegetation 
to peat succeeded by the transformation of peat to 
lignite and later higher coal ranks. 

Cretaceous A geologic period and system spanning 
from 145 to 66 million years ago. 

Devonian A geologic period and system spanning 
from 416 to 360 million years ago. 

Exploration The examination of an area by means of 
surface geological mapping, seismic techniques, 
geophysical methods, drilling of boreholes and 
sampling of the underlying sediments and rocks. 

Jurassic A geologic period and system extending from 
200 to 145 million years ago. 

Macerals Microscopic organic constituents of coal 
that may differ significantly in composition and 
properties depending on rank and geology. 

Ordovician A geologic period and system spanning 
from 490 to 445 million years ago. 

Permian A geologic period and system extending 
from 300 to 250 million years ago. 

Reserves The recoverable volumes that are dependent 
on certain arbitrary limits in respect of thickness, 
depth, quality, economics, legal restrictions, and 
other factors. Reserves are always less than 
resources. 

Resources The amount of coal in place before exploi¬ 
tation; all resources may or may not be economi¬ 
cally or technically recoverable; see Reserves. 

Silurian A geologic period and system spanning from 
445 to 416 million years ago. 

Stripping ratio The ratio of the thickness of overbur¬ 
den to that of the total workable coal section. 

Tertiary A geologic period and system extending from 
66 to 2.6 million years ago. 

Triassic A geologic period and system spanning from 
250 to 200 million years ago. 

Volatile matter Non-moisture component of coal 
that is liberated at high temperature in the absence 
of air. 

Definition of the Subject 

Coal is the second most important fuel currently used 

by mankind, accounting for over 25% of the world’s 


primary energy supply. It provides 41% of global elec¬ 
tricity supplies and is a vital fuel or production input 
for the steel, cement, and chemical industries. How¬ 
ever, coal is a fossil fuel formed from organic material 
by geological processes over millions of years. Hence, 
coal is a finite resource in terms of human timescales 
and its continued availability is important to the 
world economy. 

Peat is a related substance, but is classified some¬ 
where between a fossil fuel and biomass. The energy 
sector uses peat as a fuel to generate electricity and heat. 
It also has applications in industrial, residential, and 
other sectors, but global consumption of peat is insig¬ 
nificant in comparison to coal. Peat shares many sim¬ 
ilarities with coal, and is increasingly often grouped 
with coal for resource estimates in reports and assess¬ 
ments by public agencies. 

Knowing how coal and peat are created is vital to 
understanding how deposits are formed and what their 
basic properties are. Geology provides models and 
methodologies for describing deposits and where to 
find them. Exploration, drilling, and surveys provide 
the data necessary to map deposits and assess the 
resources they contain. Classification schemes are also 
central to understanding how the terms relate to the 
underlying data. 

Future production of coal and peat is essential for 
the development of global energy supplies. It is only the 
produced volumes that can be used in human activities, 
and a detailed appreciation of the production process is 
essential in understanding future supply developments. 
Factors such as economy, technology, legal, and envi¬ 
ronmental constraints affect the recoverable share of 
the available resources, i.e., the reserves. Understanding 
the complexity and the greater whole of the production 
mechanism and the limitations that are imposed on it 
require a wide variety of approaches and conceptual 
infrastructures. 

Introduction 

Natural resources are vital in supporting the continued 
well-being of the world’s population. Metals, fossil 
fuels, minerals, clean water, and many other resources 
are critical components for many aspects of modern 
society. Of all available resources, few are as important 
as energy. In fact, energy has been described as the 
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ultimate resource by several studies [ 1-4] . In physics, 
energy is defined as “the ability to do work” Einstein 
even showed the equivalence between energy and mass 
[5], implying that everything in the universe is made 
from energy in various forms. Naturally, it follows that 
energy influences everything and cannot be substituted 
for other resources. 

Energy resources take many forms, ranging from 
fossil fuels and uranium to biomass, human labor, or 
wind power. Currently, over 80% of all primary energy 
used by mankind is derived from fossil fuels [6]. Petro¬ 
leum dominates, with roughly 35% of world primary 
energy being derived from this particular hydrocarbon, 
coal at number two (~26%), followed quickly by nat¬ 
ural gas (~21%). The contributions from other energy 
sources range from 2% to 10% (biomass, nuclear, 
hydropower) to less than 0.1% (wind, solar). 

To understand the conceptual importance of coal, 
one can begin by studying the exceptional expansion of 
fossil energy in the last 200 years (Fig. 1). The develop¬ 
ment of modern society, population, and the world 
economy under this brief period has been remarkable. 


Nearly all this growth has been powered by fossil energy 
in various forms, originally coal but later oil and natu¬ 
ral gas. The growth soared and mankind both pros¬ 
pered and multiplied in many places, all thanks to 
cheap and abundant fossil fuels that provided useful 
energy for a multitude of activities. 

The postwar boom also coincides with the green 
revolution in agriculture and the use of high-yield 
agricultural techniques based on synthetic fertilizers 
and high energy use. In fact, fossil energy has been 
described as the principal raw material of modern 
agriculture [7-10]. Furthermore, energy is intimately 
linked to economic growth and development [11, 12]. 
To summarize, nothing is as important to society as 
energy and this should never be overlooked when deal¬ 
ing with questions of sustainability. 

The rise of the industrial nations in the Western 
world was chiefly driven by the coal that powered steam 
engines, transportation, and industry as well as heating 
homes. Important industrial complexes and cities arose 
near coal-rich regions, such as the Ruhr or Pennsylvania. 
In the USA, coal had replaced biomass as the dominant 
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primary energy source by late nineteenth century [ 13] . 
Similar events occurred in UK, Germany, and other 
nations around the early twentieth century. Today, the 
world is witnessing the rise of China and India as new 
industrial powers, both countries growth fueled chiefly 
by coal. However, petroleum eventually surpassed coal 
as the dominant fuel globally in the 1950s, but coal has 
remained the second most important energy resource 
ever since. 

Sedimentary layers of coal and peat beds range in 
age from the Upper Paleozoic to the recent (geologi¬ 
cally), and are located all over the world. The layers are 
a result of the accumulation of vegetable matter in 
a specialized deposition environment. Local conditions 
and events can produce different qualities and ranks as 
well as different degrees of structural complexity. 

This chapter provides a fundamental overview of 
global peat and coal resources, their distribution, and 
some of the central characteristics in their recovery and 
utilization. In addition, it presents a brief summary of 
resource classification schemes and their importance in 
understanding the amounts that may be available for 
future supply. Finally, some resource and reserve sta¬ 
tistics will be presented with discussion. Estimates of 
global resources must be dated and can naturally 
change as more is learnt about their geology. The fig¬ 
ures presented here should be seen as a snapshot of the 
current state of our knowledge. Nevertheless, such esti¬ 
mates indicate where the major deposits and mining 
activity are currently located and from where future 
production might originate. 

Coal and Peat: Global Resources 

Formation of Coal and Peat 

Coal and peat derive from vegetable remains that have 
been deposited in various environments where it has 
been transformed by geological processes to potentially 
useful energy resources. The sedimentation of coal and 
peat has been extensively studied for over a century. 
This section describes the general mechanism behind 
the formation of peat and coal. 

Mires are the accepted general term for peat¬ 
forming ecosystems of all types. The cycling of matter 
in most ecosystems is relatively fast and complete. 
In contrast, mires are characterized by an incom¬ 
plete cycle, resulting in a positive carbon balance. 


Peat-forming areas are generally wet places where the 
annual input of dead organic matter from the photo- 
synthetic activity of the plant cover exceeds annual 
breakdown. Over time organic material accumulates 
and becomes preserved. Peat deposits typically occur 
in waterlogged environments where water prevents 
oxidation of the organic remains. The presence of 
water limits the diffusion rate of gasses that leads to 
low oxygen availability. The large heat capacity of 
water also induces a low temperature, which inhibits 
decomposing and decomposition-facilitating organ¬ 
isms and leads to reduced rates of decay. 

Peat formation is typically initiated by two mecha¬ 
nisms. One is called terrestrialization, which is when 
bodies of water are replaced by peat-forming mires. 
The other mechanism is paludification, which is 
replacement of dry land by mires due to rising ground- 
water table or some other hydrological change in 
the local environment. Peat-producing wetlands are 
divided into ombrogenous peatlands, which are rain- 
fed systems, and topogenous peatlands, which owe 
their origin to a place and its surface/groundwater 
regime [14]. Moore defined several important subcat¬ 
egories such as bogs, bog forests, marsh, fens, swamps, 
floating swamps, and swamp forests [15]. 

The plant debris undergoes several chemical 
changes due to bacterial decay, compaction, heat, and 
time, which eventually leads to formation of peat. 
Pristine plant parts such as roots, barks, or spores are 
possible components of peat and can be used to deter¬ 
mine the type of plants that originally made up the 
peatland. The peat harvested in the Northern Hemi¬ 
sphere was chiefly formed during the Holocene epoch 
just after the retreat of the colossal glaciers that covered 
most of Europe. Consequently, most of the constituent 
peat-producing plant species are still thriving and con¬ 
tinue to produce peat as of today. 

The peat must be buried by additional sediments in 
order to be transformed to coal. Burial leads to com¬ 
paction of the peat and squeezes out water during the 
initial stages. A compaction ratio of 9:1 or more is not 
uncommon [ 16] . Continued burial and the addition of 
heat and time will break down the complex hydrocar¬ 
bon compounds in the peat through a series of possible 
reactions. Methane and other gaseous alternation 
products are typically expelled from the deposit, while 
other elements disperse. The result is that the deposit 
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Figure 2 

Principle mechanism of the formation of coal and peat 
(From Steve Greb and Kentucky Geological Survey. 
Reproduced with permission) 


becomes increasingly carbon-rich. The process begins 
with plant debris and continues through peat, lignite, 
subbituminous coal, bituminous coal, anthracite coal, 
to the pure carbon mineral graphite. This process is 
referred to as coalification. 

Some perspectives on the necessary timescales 
behind the formation of peat and coal resources are 
indispensable for a better understanding of the whole 
picture. The surface part of peat below the living 
ground layer, being less than 300 years old, amounts 
on average to 10.2% of the total peat carbon volume 
[17]. Only the deeper and basal parts of the peat are 
thousands of years old. In comparison, coal requires 
much longer time periods to form. Analysis of thick 
bituminous coal bed layers indicates that optimum 
peat-forming conditions must have been maintained 
for 5,000-10,000 years for every meter of clean bitumi¬ 
nous coal [18]. Other studies have shown that the 
global consumption of fossil fuel in 1997 was created 
from ancient plant matter corresponding to over 
400 times the net primary production of the world’s 
current biota [19]. Coal cannot be formed at the same 
rate society uses it and is therefore a nonrenewable fuel. 
Peat requires shorter time scales, but is still slowly 
formed in terms of human time scales. 

Studies have shown similarities between the exten¬ 
sive coastal plain peat deposits in Southeast Asia and 
the North American/European carboniferous coal¬ 
bearing sequences in terms of sediment accumulation, 
mineralogy, and geochemistry [20-23]. Several models 
provide a good understanding of the cyclic nature of 
coal sequences, the lateral continuity of coal beds as 
well as physical and chemical properties. However, no 
single model can provide a comprehensive explanation 
of all aspects regarding occurrence, development, and 
types of coal. 

There are many books and articles on peat and coal 
geology that can provide more detailed descriptions of 
sedimentation, structural changes, and other factors 
and how they influence the formation process. The 
formation of coal and peat can be described in a sim¬ 
plified manner (Fig. 2). 

Occurrences of Coal and Peat 

The oldest known land plant remains have been dated 
to the Ordovician/Silurian Period [24], but did not 


produce any noteworthy accumulations. Certain 
deposits of Devonian age are the oldest known coal 
accumulations, but it was not until the Carboniferous/ 
Permian Period that the world’s plant cover was suffi¬ 
ciently rich to produce and preserve significant 
amounts of coal. Although coal and peat are continu¬ 
ously being formed, there are a few major episodes of 
coal accumulation throughout history. In total, three 
major coal-formation periods have occurred and these 
coals form the bulk of the world’s coal reserves. They 
occurred during the Carboniferous/Permian Period, 
Jurassic/Cretaceous Period, and the Tertiary Period. 

Plate tectonics and climate have been described as 
important factors for answering when and where coal is 
deposited [25]. Places with favorable conditions have 
changed over time as continental plates have drifted or 
when climatic properties have been altered over the 
long time scales involved in the formation process. 
Changes in vegetation types can also be traced to cer¬ 
tain coal components. Geological age distributions of 
the world’s principal black coal and lignite deposits 
have been compiled by Walker [26]. 

During the Carboniferous Period, most of what is 
today the Northern Hemisphere was located near the 
equator and consisted of tropical peat mires, which 
later formed the coal basins of western and eastern 
Europe, the eastern USA, and former Soviet Union. 
Coals were also formed in the areas that today make 
up South America, southern Africa, India, Australia, 
and Antarctica, but this formation occurred under 
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cooler, more temperate conditions. Coals formed at 
this time now make up the bulk of the world’s black 
coal reserves. They are generally high in rank, but may 
have undergone significant structural change. 

The climate changed during the Permian Period 
and the lush vegetation of the northern areas eventually 
ceased to deposit coal-forming sediments. New, but less 
abundant coal formation occurred during the Triassic, 
Jurassic, and Cretaceous Periods. Such coals can chiefly 
be found in Canada, China, Russia, and the USA. 
Changes in floral types, such as the onset of angiosperm 
floras, as well the variety of deposition environments 
generally lead to the formation of coals that are more 
complex in their chemical make-up. Analysis of 
macerals and other constituents in coal provide insight 
in the specific conditions that were present during their 
accumulation. 

The third major period of coal formation took place 
during the Tertiary Period, and those coals form the 
bulk of the world’s brown coal reserves as well as 
important parts of the black coals currently being 
mined. Tertiary coals are mostly lignite, but in some 
areas temperature change has resulted in higher¬ 
ranking coals. 

The age and stratigraphy of all major coal deposits 
have been studied extensively, in particular those 
formations that are of economic interest. Meticulous 
surveys on individual fields are important for develop¬ 
ment and actual production, but such details for 
a single field can easily fill an entire bookshelf! More 
general overviews of the world’s most important coal 
regions have been made by others [26, 27]. Reviews on 
various pollutants in coal, such as arsenic or metals, can 
also be found in more specialized literature. 

Sometimes it is convenient to express coal deposits 
in energy terms, especially if they are going to be used 
as energy sources. Depending on specific properties of 
the coal in a deposit, the actual energy content may 
vary significantly. Coal is divided into different ranks 
depending on the degree of coalification as it matures 
from peat to anthracite. Coal is broadly divided into 
these four ranks, namely anthracite, bituminous, sub- 
bituminous, and lignite [28, 29], although much more 
detailed systems can be found. 

There are many classification systems used all over 
the world, each with their own set of definitions. The 
terms brown coal, black coals, hard coals, stone coals, 


thermal coal, and steam coal are commonly used, but 
their meaning may vary from country to country. Lig¬ 
nite and subbituminous coals are commonly seen as 
low-ranking coals, while bituminous coal and anthra¬ 
cite are considered high-ranking coals. Caution should 
always be exercised when analyzing global coal data to 
avoid mixing up different classification schemes. 

It should be noted that the ranks tend to overlap in 
terms of energy and that the calorific value can vary 
within a rank depending on local geological properties. 
Relatively high moisture levels and a low carbon con¬ 
tent and calorific value characterize low-rank coals, 
such as lignite and subbituminous coal. Higher-rank 
coals, such as anthracite and bituminous coal, have 
a higher carbon and energy content and a lower level 
of moisture. The following energy contents of coal 
ranks are common: 

Anthracite: 30 MJ/kg 
Bituminous coal: 18.8-29.3 MJ/kg 
Subbituminous coal: 8.3-25 MJ/kg, 

Lignite: 5.5-14.3 MJ/kg 

Some of the world’s present peat deposits were 
formed as early as a few hundred thousand years ago, 
although the main part of the global peat resources was 
formed during the last 10,000 years [30]. Occurrences 
of peat are linked to climate, vegetation, local environ¬ 
ment, and precipitation that allow peat-forming con¬ 
ditions to arise. There are also less peat occurrences in 
the Southern Hemisphere, as there is less land there. 
Present-day peat occurrences can be predicted well by 
suitable models [31]. 

Peat is present all over the world and estimates 
conclude that 2-3% of the global surface is covered by 
peat-forming areas, equivalent to about 4 million km 2 . 
This figure includes some 200,000 km 2 of tropical 
peats in Southeast Asia. A major proportion of the 
world’s peatlands are located in North America and 
the northern parts of Asia, but significant deposits are 
also found in northern and central Europe. In total, 
over 90% of the world peatlands are in the temperate 
and cold belt in the Northern Hemisphere. The 
remainder is located in tropical and subtropical lati¬ 
tudes and mostly located under forests. Such accumu¬ 
lations have been identified in the tropical regions 
of Africa, Latin America, as well as Indonesia and 
southern/eastern Asia. 
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Estimates of the total area of the world’s peatlands 
vary considerably from study to study. The basic prob¬ 
lem is to separate areas where the peat layer is 30 cm 
thick at minimum, i.e., peatlands, from areas only 
covered with wetland vegetation, such as floodplains, 
coastal lagoons, mangroves, and swamps. Hence, only 
a few countries have performed accurate peatland map¬ 
pings. More comprehensive discussions on peat and 
closer description of specific formations are available 
in reference literature, scholarly journals, and reports 
from various geological surveys. 

Resource and Reserve Classifications 

It is first necessary to look closer at classifications and 
what the terms actually mean in practice before defin¬ 
ing and discussing global resources of coal and peat. Far 
from all coal and peat being available for utilization by 
society, some formations are technically unfeasible or 
just too challenging economically to recover or may 
even prove to be undesirable to consumers, even 
though they are geologically available. As a result, it is 
important to use precise terms when talking about 
future supply of coal and peat. Especially in outlooks 
related to sustainable utilization as possible energy 
sources for society. 

Concepts such as “endowment” or “potential” are 
multifaceted and the meaning of such terms is disputed 
by different schools of thought. One school argues that 
“resource endowment” is crustal abundance, while 
another school maintains that the concept is relatively 
meaningless for supply assessments and should be 
avoided. As always, the truth is located somewhere in 
between the two extremes. Crustal abundance is solely 
determined by geological processes, which are reason¬ 
ably well understood by geologists. Furthermore, the 
physical in situ amounts will also be virtually fixed as 
a result of the slow underlying formation processes, 
even though in situ estimates might change in reports 
as a result of new information. 

Available coal accumulations can be divided into 
reserves and resources. Reserves are defined as being 
proved and recoverable under present technological 
and economic conditions. Resources include addi¬ 
tional discovered, but uneconomic or unavailable 
amounts together with undiscovered inferred, 
assumed, and speculative quantities. This distinction 


is justified by the idea that over time, and perhaps with 
new technology and higher prices, production and 
exploration activities will allow a reclassification of 
some resources into reserves [32]. The terms resources 
and reserves are often used synonymously by lay peo¬ 
ple, but it should be absolutely clear that this is a 
completely erroneous and misleading practice. Being 
specific and using the right term is essential to paint 
a realistic picture in any supply assessment or when 
planning for a sustainable future. Studies have 
highlighted this problem and the need to include non- 
geological factors in future supply estimates [13, 33]. 

It is necessary to investigate and explore any coal 
or peat deposits to ascertain whether something can 
be mined economically and that a product can be 
obtained that is marketable. Two common points of 
view are typically used to assess resources and reserves: 

1. The degree of geologic assurance 

2. The degree of economic feasibility 

The method of estimation along with the definition 
and expression of geological assurance is a key factor 
for the reliance of coal tonnage estimates. Details sur¬ 
rounding the topography, tectonic variability, struc¬ 
tural alteration, and postdepositional erosion provide 
uncertainties and should be taken into account in 
resource mappings. For example, heavily disturbed 
coals with great variation in thickness will need many 
more data points for an accurate and reliable assess¬ 
ment than relatively undisturbed coals of constant 
thickness. Data point spacing criteria and how far 
trends are extrapolated from measurement points are 
essential for the degree of geologic assurance (Fig. 3). 
Economic feasibility is dependent on the content 
of sulfur and other unwanted pollutants, thickness, 
depth, environmental constraints, coal price versus 
mining costs, and other parameters that affect recovery 
and consumer desirability. 

Sadly there is no international and uniform method 
for assessment, categorization and designation of coal 
resources and reserves. Several agencies, countries, and 
organizations have developed their own sets of defini¬ 
tions and methods to calculate resources and reserves. 
However, it is possible to roughly convert assessments 
made within one system to another and form aggre¬ 
gated pictures of the entire world situation. This sec¬ 
tion will focus on the system used by the United States 
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Figure 3 

Reliability categories based solely on distance from drilling 
measurement points. Measured and indicated coal 
together form the demonstrated coal category. Based on 
USGS resource classification system [34] 

Geological Survey (USGS), while closer descriptions on 
other systems can be found in more specialized litera¬ 
ture. For comparison, the Australian and Russian sys¬ 
tems are conceptually similar but differ in details. In 
contrast, the UK system is excellent for mining opera¬ 
tion, but not very functional for large areas that may 
contain coal but are not mined. USGS use the following 
definition for coal resources [34]: 

► Coal resources are naturally occurring concentrations 
or deposits of coal in the Earth's crust, in such forms 
and amounts that economic extraction is currently or 
potentially feasible. 

This category is divided into a number of subdivi¬ 
sions, such as inferred, indicated, or measured resources 
depending on geological certainty, depth, and thickness. 
Technology and proper investment can, at least in the¬ 
ory, recover all available in situ coal if the required 
economic conditions exist. However, resource numbers 
are inadequate as they do not take into account eco¬ 
nomics, the fundamental utilization properties, or even 
legal issues surrounding extraction in a geographical 


region. Recoverability is the crucial property for actual 
production and this leads to a variety of attempts to 
express the recoverable volumes of coal that can be 
extracted from the available resources. 

Far from all coal being recovered from a coal bed, 
some amounts will be unavoidably lost in the recovery 
process for technical reasons. This includes coal that is 
(1) left to support underground mine roofs, (2) used as 
barrier pillars adjacent to the mine or property bound¬ 
aries, or (3) simply left because for a variety of other 
reasons [34]. Mining and production practices also 
place constraints based on thickness, depth, rank, and 
other properties on the recoverable amount of the in 
situ resource. USGS has chosen the following definition 
for the reserve base: 

► Reserve base: those parts of the identified resources 
that meet specified minimum physical and chemical 
criteria related to current mining and production prac¬ 
tices, including those for quality, depth, thickness, rank, 
and distance from points of measurement. The reserve 
base is the in-place demonstrated (measured plus indi¬ 
cated) resource from which reserves are estimated. The 
reserve base may encompass those parts of a resource 
that have a reasonable potential for becoming eco¬ 
nomically available within planning horizons beyond 
those that assume proven technology and current eco¬ 
nomics. The reserve base includes those resources that 
are currently economic, marginally economic, some of 
those that are currently sub-economic, and some of the 
resources that have been or will be lost-in-mining but 
whose attributes indicate possible future recovery. 

Recoverable volumes are also affected by many 
other factors that make them dynamic. Higher prices, 
new exploration, or improved technology yield 
increased amounts of recoverable coal. In a similar 
way, increased freight rates, extraction costs, or intro¬ 
duction of various restrictions can decrease recoverable 
volumes. Likewise, political influences can greatly 
influence demand patterns or even cause direct physi¬ 
cal restrictions for mining. USGS [34] finally uses the 
following terminology to denote reserves: 

► Reserves: Virgin and/or accessed parts of a coal reserve 
base which could be economically extracted or pro¬ 
duced at the time of determination considering envi¬ 
ronmental, legal, and technologic constraints. 
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Important restrictions are legal prohibitions on coal 
mining in certain areas and/or the use of a particular 
mining methodology, such as strip mining. This can 
prevent the extraction of coal, despite the fact that the 
coal might be technically and economically recover¬ 
able. For example, land-use regulations and federal 
laws prevent mining near homes, public buildings, or 
federally funded highways [30]. Accessibility, availabil¬ 
ity, and restriction factors are linked to the society and 
its legal and political climate. This can have a major 
influence on the volumes practically available for 
recovery. Detailed discussions of how availability of 
recoverable coal is estimated by the USGS can be 
found in other works [35-37] . The Energy Information 
Administration (EIA), the energy authority of the USA, 
utilizes the following definition to include restrictions 
and accessibility in their reports and assessments of 
American coal commonly used by non-geologists, 
planners, and policymakers: 

► Estimated recoverable reserves: cover the coal in the 
demonstrated reserve base considered recoverable 
after excluding coal estimated to be unavailable due 
to land use restrictions or coal currently economically 
unattractive for mining (and after applying assumed 
mining recovery rates). 

EIA creates this category by applying economic 
feasibility criteria factoring downward from the reserve 
base. The “coal reserve” term used by EIA may better 
be described as “potential coal reserves,” as they are 
generally not proved by any detailed drilling [38]. 
A full discussion of coal resources and reserve termi¬ 
nology, as used by EIA, USGS, and US Bureau of Mines, 
can be obtained from the EIA [39]. The estimated 
recoverable reserves category corresponds to “proved 
reserves” discussed in the BP Statistical Review of 
World Energy [40], “proved recoverable reserves” 
used by World Energy Council (WEC) [41], or 
“reserves” as used by the German Federal Institute of 
Geosciences (BGR) [42]. 

Coal and Peat Resources in the World 

To make accurate coal and peat resource estimates, 
exploration geologists make detailed studies of relevant 
accumulations. They take measurements at appropri¬ 
ate intervals and extrapolate using the appropriate 


classification categories, knowledge of the local geol¬ 
ogy, and good judgment. In theory, there is no funda¬ 
mental difference between underground or surface 
formation maps. However, shallow drilling is cheaper 
and makes it possible to drill more holes, which makes 
surface mappings generally more reliable. Because deep 
boreholes are costly (which means less are drilled, thus 
making the assessment less reliable) the exploration 
geologist uses various geophysical methods to supple¬ 
ment the drilling data. 

Resources are commonly expressed as in situ ton¬ 
nage, which is the total amount occurring in place. It 
can be calculated from the thickness of the deposit, the 
area of extent, and the relative density. Thickness is 
commonly determined at each point of measurement, 
while the area is measured on a map or plan. The 
relative density is normally taken from the section 
that will be mined, or alternatively it might be esti¬ 
mated using the known average ash content of a coal or 
peat seam. The formula for in situ tonnage is: 

thickness(m) x area(m 2 ) 

x relative density (tons/m 3 ) = in situ tonnage 

At regular intervals, scientists and governmental 
agencies conduct and publish world coal resource 
assessments. In particular, since its establishment in 
1923, the World Energy Council (WEC) has reviewed 
all forms of energy resources including coal and peat. 
Data from the WEC comes from reports by the WEC 
member committees and constitute a sample that 
reflects the information available in particular coun¬ 
tries. German Federal Institute for Geosciences and 
Natural Resources (BGR) has also released a number 
of assessments on global coal resources. These agencies 
will serve as the main data source for the resource 
figures presented here. 

Peatland resource estimates are commonly quoted 
on an area basis because initial quantification normally 
arises through soil survey programs or via remotely 
sensed data that focuses on geographical extent 
(Tables 1 and 2). Estimates of the world’s peatland 
area vary considerably. This is partly due to different 
wetland classification systems and partly due to insuf¬ 
ficient survey data. Peat resources have been estimated 
to be 5,000-6,000 billion cubic meters, but are hard to 
quantify in any detail [30]. WEC gives an estimate of 


c 




2182 


c 


Coal and Peat: Global Resources and Future Supply 


Coal and Peat: Global Resources and Future Supply. 
Table 1 The distribution of peatlands and other terrestrial 
wetland ecosystems in the world. Data source: [30] 


Region 

Peatlands 

[km 2 ] 

Wetlands 

[km 2 ] 

Total 

[km 2 ] 

Africa 

58,000 

282,000 

340,000 

Asia 

1,119,000 

1,149,000 

2,268,000 

Australia and 
Oceania 

14,000 

10,000 

24,000 

Central and South 
America 

102,000 

330,000 

432,000 

Europe 

957,000 

n. a. 

957,000 

North America 

1,735,000 

657,000 

2,392,000 

World 

3,985,000 

2,428,000 

6,413,000 


3,500-4,000 billion cubic meters [41]. Uncertainties in 
water and ash content and many other factors influence 
the result significantly when converting to tonnage. 
This can be seen as a reason for circumventing tonnage 
figures and preferring volumetric units in resource 
assessments. However, it is clear that the world pos¬ 
sesses a huge tonnage of peat overall. The important 
question is: how much of those resources can be real¬ 
istically recovered in the future? 

World coal resource estimates can be traced back to 
1936, when the WEC gave a world resource estimate of 
around 16,000 billion tons (reported as recoverable 
reserves plus estimated additional in situ amounts). 
In subsequent assessments, their world coal resources 
have varied from 10,000 to 14,000 billion tons, with 
most estimates closer to the lower part of the range. In 
comparison, BGR estimates of world coal resources 
have varied from 6,000 to 15,500 billion tons. Using 
either estimate, it is clear that global coal resources are 
large and the world production of coal (~6 billion 
tons) is seemingly insignificant in comparison. 

However, there are reasons to doubt these agencies’ 
resource assessments. First, there are large fluctuations 
in the reported resource amounts as definitions and 
reporting practices change, coal-bearing areas are 
added or subtracted and new geological information 
becomes available. 

For instance, total coal resources in the USA 
decreased by more than 60% between 1913 and 1962. 


Coal and Peat: Global Resources and Future Supply. 
Table 2 Distribution of peat resources for selected 
countries and regions. Data source: [41] 


Region 

Country 

Peatlands 

[km 2 ] 

Africa 

Guinea 

5,250 


Nigeria 

7,000 


South Africa 

9,500 


Uganda 

14,200 


Zambia 

11,060 


Total Africa 

58,410 

Asia 

China 

10,440 


Indonesia 

270,000 


South Korea 

6,300 


Malaysia 

25,360 


Burma 

9,650 


Total Asia 

331,880 

Australia and Oceania 

Australia 

150 


New Zealand 

2,600 


Papua New Guinea 

6,850 


Total Australia and 
Oceania 

9,640 

Central and South 

Brazil 

15,000 

America 

Chile 

10,470 


Falkland Islands 

11,510 


Guyana 

8,140 


Venezuela 

10,000 


Total South America 

62,400 

Europe 

Belarus 

23,970 


Estonia 

9,020 


Finland 

89,000 


Germany 

14,200 


Iceland 

10,000 


Ireland 

11,800 


Latvia 

6,400 


Norway 

23,700 


Poland 

12,000 


Russia 

568,000 


Sweden 

64,000 


Ukraine 

10,080 


UK 

19,260 


Total Europe 

876,510 
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Table 2 (Continued) 


Region 

Country 

Peatlands 

[km 2 ] 

North America 

Canada 

1,113,280 


Cuba 

6,580 


Mexico 

10,000 


USA 

214,000 


Total North America 

1,354,220 


In contrast, between 1968 and 1974 coal estimates were 
revised upward, in part due to improved accuracy in 
assessing the amount of lower-rank coal [33] and also 
by including undiscovered resources in the estimates. 
Further exploration may show that a substantial part of 
the undiscovered resources do not exist [33]. Similar 
modifications, as well as some brief reasoning behind 
the revisions, can be found in recent world coal 
resource estimates from the BGR (Table 3). 

In 2005, BGR reported that they changed coal 
resource estimates because there was insufficient sub¬ 
stantiating information for many countries and many 
countries had skipped reporting years. In 2006, they 
incorporated new resources in China and former Soviet 
Union that lead to a 115% increase in world hard coal 
resources and a 200% increase in lignite resources. In 
2007, BGR made yet another upward revision, increas¬ 
ing world hard coal resources by 68% and lignite 
resources by 36%. In both cases the majority of the 
increase came from new resources identified in the USA 
by a new USGS study [43]. 

Another important observation and fact of this 
world is that the peatlands are unevenly distributed 
with the vast majority concentrated in a few countries 
(Table 2). This trait can also be seen in the assessment 
literature of world coal resources (Table 3). This 
inequality is a sad fact of this world and a function of 
favorable geology in certain parts of the world, 
bestowing significant deposits of peat and coal on rel¬ 
atively few countries. However, uncertainties are pre¬ 
sent and there is a problem with a lack of available data 
for some countries and regions. 

To summarize, resource estimates are important 
from a geological point of view and are vital for under¬ 
standing where recoverable amounts may be located. 
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Table 3 Estimated total coal resources (recoverable 
reserves plus other in situ resources), as published 
by the BGR. All numbers in million metric tons (Mt). Data 
source: [42] 


Country 

2004 

2005 

2006 

2007 

USA 

1,090,404 

1,057,921 

1,359,404 

6,719,682 

China 

1,089,787 

1,089,789 

4,367,000 

4,367,000 

India 

259,217 

291,863 

252,359 

229,867 

FSU 

2,295,275 

2,326,121 

2,915,146 

2,931,293 

Australia 

298,400 

304,801 

193,700 

148,200 

South 

Africa 

163,851 

163,851 

48,751 

28,559 

Germany 

87,827 

91,497 

84,573 

83,065 

Poland 

70,262 

101,281 

179,459 

179,459 

UK 

5,220 

7,220 

190,005 

187,132 

Indonesia 

10,581 

64,829 

26,839 

23,948 

World 

NA 

6,038,991 

9,553,840 

15,510,618 


However, it is essential to understand that society does 
not have economic access to all the reported resources. 
In situ resources include all amounts, even those 
located deep or of poor quality, making resource esti¬ 
mates largely an academic figure [33]. Resources 
must be converted into reserves by a detailed evaluation 
of the degree of certainty of the size of the particular 
coal resource, and the economic, legal, and environ¬ 
mental constraints placed on its exploitation. Vast 
resources do not necessarily mean significant future 
production, as production is dependent on many 
more factors than just geological availability. This 
naturally leads to the next section in which reserves 
are discussed. 

Coal and Peat Reserves in the World 

Reserves are usually calculated from resources by 
applying suitable and relevant constraints. Legal, envi¬ 
ronmental, and geological factors prohibit the recovery 
of the entire resource so reserve estimates will always be 
less than resource estimates. Rather than perform 
a detailed constraint analysis, it is possible to assume 
a universal recovery factor based on historic recovery 
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factors for similar deposits. The US Bureau of Mines 
performs this type of calculation regularly [33]. How¬ 
ever, the most reliable method is a deposit-by-deposit 
analysis because it can incorporate local economic con¬ 
ditions as well as the technical recoverability of the 
minable coal. 

Coal resources are primarily of academic interest, 
and unfortunately provide little information about 
what quantity of the resource can be practically recov¬ 
ered. For example, let us look at the Gillette coal field in 
Wyoming and USGS’ comprehensive investigation of 
its coal supply. Luppens et al. [44] assessed the Gillette 
coalfield and its 11 coal beds and estimated the original 
coal in place to be 182 Gt, with no restrictions applied. 
Available coal resources, which are a part of the original 
resource that is accessible for potential mine develop¬ 
ment after subtracting all restrictions, are about 148 Gt. 
Recoverable coal, which is the portion of available coal 
remaining after subtracting mining and processing 
losses, was determined for a stripping ratio of 10:1 or 
less and resulted in a total of 70 Gt. Applying current 
economic constraints and using a discounted cash flow 
at 8% rate of return, the coal reserve estimate for the 
Gillette coalfield is only 9.1 Gt. While economic con¬ 
ditions definitely have the largest impact, reductions 
caused by technical and availability restrictions are 
also significant. 

Extraction of peat and coal is primarily an eco¬ 
nomic activity that recovers valuable products from 
certain deposits with a profit. Prevailing and antici¬ 
pated price, location of the deposit relative to the 
market, transportation costs, specific use, and the pro¬ 
duction cost are all important factors for determining 
the feasibility of exploitation. Forecasting costs, esti¬ 
mating revenues, and handling inflation over long time 
periods is an uncertain business. Additional uncer¬ 
tainty comes from the regulatory environment, partic¬ 
ularly environmental laws, tax policies, and similar. In 
essence, profitability is a dynamic factor and can 
quickly change. To obtain an accurate estimate of the 
recoverable volume of a deposit requires a meticulous 
economic study in combination with comprehensive 
physical surveys. 

An often overlooked but fundamental factor is the 
net energy analysis. Hubbert [45] wrote about net 
energy in relation to oil but the same principle applies 
to all energy resources: “there is a different and more 


fundamental cost that is independent of the monetary 
price. That is the energy cost of exploration and pro¬ 
duction. So long as oil is used as a source of energy, 
when the energy cost of recovering a barrel of oil 
becomes greater than the energy content of the oil, 
production will cease no matter what the monetary 
price may be.” In the case of coal and peat, a poor net 
energy return will prevent deposits that are too deep, 
too complicated, or too thin from being usable as 
energy sources. Mining companies leave those 
resources behind because they require more energy to 
exploit than they can yield in return. Several researchers 
have used net energy analysis to determine mining 
boundaries in an objective manner [46]. 

Rohrbacher et al. developed a methodology for 
describing recoverability using geological, technical, 
and economic factors [47]. They concluded that past 
coal estimates did not address the amount that might 
realistically be available for production after environ¬ 
mental and technological restraints are considered. 
They showed how air quality regulations, mine recla¬ 
mation legislation and regulations, new safety and 
health standards, and advancements in mining tech¬ 
nology had dramatically changed the economic com¬ 
petitiveness of the coal fields they studied. More 
importantly, they demonstrated that the generous EIA 
reserve base would be far less than the reported figure 
using their methodology. The EIA estimate did not 
include the effects of mined-out prime reserves, envi¬ 
ronmental, industrial, economic, and social consider¬ 
ations. Nor did it include the technical aspects of the 
minability and washability of the individual seams. 
In contrast, second-generation assessments of avail¬ 
able coal amounts in the Pittsburgh coal bed were 
performed and described by Watson et al. [48]. Their 
assessment was that half of the original coal resource in 
place had already been extracted leaving only 15 Gt and 
those resources were increasingly being restricted by 
new environmental laws, encroaching population, 
and affected by technological factors. 

Several organizations regularly compile peat and 
coal reserve estimates. Many national energy agencies 
and geological surveys keep reserve records for their 
own country. The World Energy Council and the BGR 
collect country level data and publish global coal 
reserve estimates. The BP Statistical Review of World 
Energy [40] is another commonly referenced source of 
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reserve estimates but it only reproduces the numbers 
compiled by World Energy Council without backdating 
the estimates. Table 4 shows the latest reserve figures 
from these three sources. Several studies have criticized 
the reserve estimates found in published literature for 
not adequately accounting for all resource constraints 
[13, 33, 49-51]. 

Changes in policy can also cause rapid and signifi¬ 
cant changes to reserve estimates. For example, when 
Germany removed their mining subsidy Germany’s 
coal reserves were greatly reduced [52, 53]. In Russia’s 
case, Malyshev [54] shows that according to the best 
estimates by experienced geologists, mining engineers, 
and economists around 40% of the Russian coal 
reserves cannot be regarded as of any practical interest 
because they are highly unprofitable. 
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Table 4 Estimated coal reserves for selected countries. 


Note how WEC and BP numbers are identical, except for 
the fact that BP dates them differently. All numbers in 
million metric tons (Mt). Data sources: [40-42] 


Country or 
region 

WEC 

Proved at 
end 2007 

BGR 

Proved at 
end 2007 

BP 

Proved at 
end 2008 

Australia 

76,200 

76,900 

76,200 

Brazil 

7,059 

6,596 

7,059 

Canada 

6,578 

6,578 

6,578 

China 

114,500 

192,000 

114,500 

Germany 

6,708 

40,936 

6,708 

India 

58,600 

73,510 

58,600 

Indonesia 

4,328 

6,981 

4,328 

Japan 

355 

355 

355 

Kazakhstan 

31,300 

18,947 

31,300 

Poland 

7,502 

16,329 

7,502 

Russia 

157,010 

161,553 

157,010 

South Africa 

30,408 

28,559 

30,408 

Ukraine 

33,873 

34,375 

33,873 

UK 

155 

432 

155 

USA 

238,308 

262,971 

238,308 

World 

826,001 

989,913 

826,001 


Highly reliable estimates of coal reserves are rare in 
part because the factors that influence recoverability 
change frequently. Changes in classification systems, 
economics, regulations, and other factors can cause sig¬ 
nificant changes in reserves over time. In the case of the 
USA, significant downward revisions of the reserves 
occurred when mining restrictions and other non- 
geological factors decreased recoverability [13]. Despite 
the problems with reserve estimates, they are still the 
best available estimates of how much of a resource that 
can be economically and technically recovered. 

Regulations can directly affect coal recoverability, 
such as prohibitions due to land-use conflicts, or indi¬ 
rectly, such as taxation. In the USA, the Surface Mining 
Control and Reclamation Act of 1977 directly prohibits 
surface mining in national parks, forests, wildlife ref¬ 
uges, trails, wild and scenic rivers, wilderness, and 
recreation areas. Many national parks have been 
established over the years, further limiting land avail¬ 
able for coal production. Petsch [55] studied Europe 
and the Ruhr area and concluded that people have 
become sensitive to intrusions made on their environ¬ 
ment, for instance from mining or colliery spoil. Phys¬ 
ical limitations from land-use regulations have 
generally increased over time in many countries. His¬ 
torically, air quality regulations have increased the 
demand for low-sulfur coal [13, 56-58]. Expanded 
environmental awareness in the 1960s and 1970s had 
lead to heated debates and stricter regulations. 

In 2009, President Obama stated that “at a time of 
such great challenge for America, no single issue is as 
fundamental to our future as energy.” Part of that 
challenge is reducing the dependence on imported oil 
and political pressure to limit greenhouse gas emis¬ 
sions. Governments may very well tighten coal regula¬ 
tions and pursue future efforts for addressing 
anthropogenic climate change and C0 2 emissions. 
Generally, the air-quality regulations in the USA have 
been an environmental success story although it has 
done little to mitigate C0 2 emissions [59]. Interest¬ 
ingly, tight regulations are not the sole factor in future 
emissions. Regulatory uncertainty can be significant as 
businesses delay investments until they can make solid 
return on investment decisions [60]. Exactly how 
future C0 2 policies will affect coal reserves remains to 
be seen but it is likely that such policies will reduce the 
recoverable proportion of coal resources. 
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Sustainable development, environmental steward¬ 
ship, and the importance of natural resources for the 
continued well-being of mankind are driving forces for 
future policies, at least in any foreseeable outlook. 
Historical trends point toward increased restrictions 
and this implies that future energy policies will result 
in significantly decreased coal availability. However, 
politics are constantly shifting and sudden change in 
any direction cannot be ruled out. To summarize, 
recoverability is a complex issue and future supply of 
coal will be dependent on many factors ranging from 
geological availability and quality to socioeconomic 
acceptability and energy policies. 

Because peat is a small energy source, peat reserve 
estimates are difficult to find and there is no known 
comprehensive global study. Volumetric data can be 
converted to tonnage by using the average bulk density 
of produced peat. USGS Mineral Commodity Survey 
on peat [61] contains some reserve estimates (Table 5). 
Only a small portion of peatlands are available for 
extraction due to environmental constraints and 
other factors. 

Only a few countries have a peat industry of signif¬ 
icant importance, so the global peat reserves should be 
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Table 5 Peat reserves for selected countries. All numbers 
in million metric tons (Mt). Data source: [61] 


Country 

Peat reserves as of 2008 [Mt] 

Belarus 

400 

Canada 

720 

Estonia 

60 

Finland 

6,000 

Ireland 

NA 

Latvia 

76 

Lithuania 

190 

Moldova 

NA 

Russia 

1,000 

Sweden 

NA 

Ukraine 

NA 

USA 

150 

Other countries 

10,000 


regarded as highly uncertain. Economics, demand, 
environmental regulations, and other factors are still 
uncertain for many regions. The world possesses large 
amounts of peat, but that should not be seen as guar¬ 
antee for future production. If demand for peat 
increases, reserve estimates will in all likelihood 
become more accurate. 

As with coal resources, global peat reserves are very 
unevenly distributed. A small number of nations con¬ 
trol the vast majority of the world’s coal reserves. 
The USA, Russia, China, Australia, India, and South 
Africa control over 80% of the world’s coal reserves 
(Table 4) - a remarkable concentration of resources. 
Peat shows a similar distribution with just a few coun¬ 
tries as the world’s major reserve holders (Table 5). 

Global Coal and Peat Production 

Production is the activity that actually transforms coal 
and peat into something useful for consumers and 
society. Some of the produced volumes are inherently 
lost in preparation, washing, or transformation to spe¬ 
cialized products such as briquettes to meet customer 
needs. Generally coal and peat are burned to create 
heat, the heat is transformed into steam, and then the 
steam into electricity. They are also used in industrial 
chemical processes, steel-making, and horticulture. 

The world produced 6.8 Gt of coal in 2008 [40] and 
a comparatively tiny amount of peat - only around 25 
million tons [61]. Peat’s role as an energy source is 
negligible in the global energy context, while coal is 
the second most important fuel, second only to oil. 
As a result, this section will briefly cover peat and 
coal mining and highlight some important aspects 
of production. 

Currently, just a few countries are the principal 
producers of peat and all those nations are located in 
Europe. Finland, Ireland, Belarus, Russia, Sweden, 
Ukraine, and the Baltic states are the largest producers 
of peat. Finland is perhaps the most significant peat 
consumer and derives 6% of its total energy from peat 
as a way to replace imported fuels [62] . In Finland, over 
90% of the total peat production goes to energy pro¬ 
duction. In contrast, the USA uses most of its peat for 
horticultural uses [61]. Peat is often used as a practical 
alternative for smaller heat and/or power plants. There 
is also significant trade flows of peat among certain 
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countries, but most of the peat is still being consumed 
by the same country that produced it. 

Wetlands that contain peat have to be drained 
before they can be exploited. Peat in situ contains 
roughly 90% water and drainage can reduce this sig¬ 
nificantly. Additional drying by the wind and sun will 
further reduce moisture content down to 40-50%. The 
traditional production method was to hand-cut peat 
into sods or strips, but this has been replaced by 
mechanical means. Another production technique 
yields fine peat granules by using a mechanical miller 
to disturb and granulate the top layer of the peatland. 
Some of the produced peat goes to the production of 
peat briquettes, which is a more practical fuel for 
households. WEC [41] states that the bulk of peat 
production is obtained from milling, which is more 
easily used for electricity and heat plants. 

Producing coal requires more complex operations 
and often underground mining methods. Coal mining 
can be done from the surface by using draglines and 
other machines to remove the overburden and expose 
the coal for recovery. Surface mining methods can 
generally recover a high percentage of the coal and are 
relatively cost-effective. However, there are limits to the 


amount of overburden that can be practically removed 
to obtain access to the coal. Above certain stripping 
ratios (the amount of overburden that must be 
removed divided by the amount of recoverable coal), 
the surface method becomes unworkable and is 
replaced by deep mining. 

In comparison, underground mining is more com¬ 
plicated and expensive and recovers less coal since 
certain parts must be left to support the roof. Up to 
40% of the total coal in a coal seam may be left as 
supporting pillars, although this sometimes can be 
recovered at a later stage. Worldwide, about 40% of 
the production originates from surface mines, while 
60% stems from underground operations. Surface 
mining dominates in some important producing coun¬ 
tries [63]. The chosen production method will have 
a significant impact on the economic as well as techni¬ 
cal restrictions that affect the recoverability of the coal. 

Since the 1950s, the six largest coal reserve holders 
have increased their share of world production (Fig. 4). 
China in particular has rapidly increased its coal pro¬ 
duction and now accounts for over 40% of world coal 
production. The USA is the second largest coal pro¬ 
ducer, followed by India, Australia, Russia, and South 
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Coal and Peat: Global Resources and Future Supply. Figure 4 

Total world production 1950-2008. Prior to 1985, the coal production from the Donetsk basin is not included in Russia. 
In 2008, just six countries accounted for roughly 80% of the total world coal production. These countries will determine 
the shape of future global coal production. Data source:[40, 64] 
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Africa. Even though coal is mined in over 50 different 
countries [63], the bulk of production comes from 
these six nations. 

It is impossible for coal and peat production to 
avoid affecting and disturbing the local environment. 
Measures that restrict or even prevent mining in envi¬ 
ronmentally sensitive and other protected areas are 
affecting legal and economic feasibility of new mines. 
Mine tailings, acid mine drainage, leakage of associated 
substances like sodium and heavy metals, land subsi¬ 
dence, impact on local wildlife and landscape changes 
are problems that are intrinsically associated with pro¬ 
duction. Avoiding addressing these issues will stir up 
political or legal controversies [13, 55]. 

In addition, safety regulations are another compli¬ 
cating factor. Mining companies must agree to follow 
specific safety laws to receive a mining permit. Some 
major hazards are explosions (caused by methane gas 
and dust), roof collapse, and machine failure. Surface 
mining is generally safer since the risk of explosions is 
lower and the risk of roof collapses is almost completely 
absent. Since many coal deposits are not reachable by 
surface mining, the company must resort to under¬ 
ground mining. Closer discussions on safety related 
issues have been made by others [65, 66]. Good safety 
brings additional costs, while poor safety will hurt the 
public acceptance of the mining operations as well as 
potentially risking the demand for the “unsafe” coal. 
Poor safety may also result in governmental interven¬ 
tion and the closure of mining operations if required 
safety regulations are not followed. 

New technology and recovery methods can 
change or potentially even revolutionize production. 
Improved mechanization, use of long-wall techniques 
that minimize the coal left behind to support the roof, 
and other measures have improved productivity and 
economic competitiveness of mining operations [67, 
68]. These new mining methods have made previously 
unreachable deposits feasible for development, espe¬ 
cially when they reduce production costs. New inven¬ 
tions will likely continue to improve production even 
further as research and development continues. 

However, there is a danger in expecting too much 
from new technology. Increasing depletion of the best 
and most easily accessible coal seams will force future 
production to be derived from less favorable deposits. 
This will increase mining costs, which in turn will make 


energy more expensive across the whole economy. Some 
studies even show that technological progress will 
not meaningfully be able to offset depletion [69-71]. 
The study authors reason that new technologies and 
mining methods that boost efficiency and decrease 
labor requirements will be unable to offset the higher 
production costs of more complicated and labor- 
intensive deposits. 

Coal has been produced in many countries for 
many decades, which enables us to perform longitudi¬ 
nal studies of production data. Several countries are 
currently producing only a small share of what they 
have produced historically (Fig. 5). Much of the 
reduced production can be explained by the move to 
other energy sources like petroleum or uranium, deple¬ 
tion of commercially feasible deposits, energy policy 
changes, and similar reasons. 

Countries such as the UK still have a significant 
amount of coal resources left (Table 3), but not enough 
to suggest that coal will be the fuel of choice again. The 
remaining coal is generally located deeper than the 
previously mined coal and will be more expensive to 
mine in terms of labor and capital investment. Given 
the drawbacks of coal, especially in terms of climate 
change because of its high carbon content, it is not at all 
certain that demand for coal will grow again. This 
naturally leads to the question of future production of 
coal and peat. 

Future Production 

Future production of peat and coal is an open and 
complex question. The resources might be vast, but 
only a small amount is recoverable once legal, environ¬ 
mental, and economic constraints are accounted for. In 
particular, it would be a mistake to think that the huge 
worldwide resource estimates mean that coal and peat 
will provide a steady energy supply in to the future. In 
addition to the previous factors, production also 
depends on social acceptability (i.e., who would want 
to see mountain-top removal), technical restrictions, 
the development and competitiveness of alternative 
energy sources, and expected demand. Although the 
following overview is unable to discuss all the factors 
that will affect future production, it will illuminate 
some of the complexity and leading indicators of future 
production projections. 
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Coal and Peat: Global Resources and Future Supply. Figure 5 

Historical coal production of UK, France, and Japan. All countries experienced production disturbances around the two 
world wars and the great depression. Data source: [40, 64] 


There are many methodologies for creating produc¬ 
tion and supply projections for coal or peat. Some 
methods are quantitative, and rely on field data, while 
others are qualitative. Some analysts base their pro¬ 
jections on economic theories, others use physical or 
statistical models derived from natural sciences. The 
life of the reserves and whether production can match 
demand are important questions for planning and 
sustainability. 

The economist Adelman [72] has said that the 
amount of mineral in the earth is an irrelevant, 
nonbinding constraint to production. He further says 
that the amount of resource in the ground is probably 
unknowable but surely is unimportant and of no eco¬ 
nomic interest. This is correct in the very limited sense 
that producers will never recover the very last amounts 
of coal or any other resource from the earth. The 
ultimate recovery is undefined and governed by non- 
geological circumstances. However, although the 
recovery factor might be undefined that does not 
make it unlimited. It is bound between 0% and 100% 
of the resource in the earth’s crust, which is clearly 
defined but not known exactly. Thus, the fact that the 
amount of recoverable coal is undefined and unknown 
cannot be an argument against the existence of future 


production limitations. The slow creation of coal and 
peat in comparison to their rapid extraction from min¬ 
ing imposes an inherent finiteness to the amount of 
coal and peat available. 

Hubbert [73] proposed a model that extrapolates 
the production curves of finite resources such as petro¬ 
leum or coal. His model assumes that production 
begins at zero, just before production has started, and 
ends at zero, when the resource has been exhausted. In 
between, the production curve passes through one or 
more maxima. The actual shape of the production 
curves may vary, but they are ultimately limited by 
the amount of resource that was deemed to be recov¬ 
erable. Hubbert initially proposed a bell-shaped curve 
for an idealized production behavior, representing var¬ 
ious development stages without giving any mathemat¬ 
ical description for it. Later, he derived a mathematical 
framework chiefly based on the logistic function and 
curve fitting to production and discovery data [74]. 

Figure 6 shows a simple example of how actual 
production can be described by mathematical curves. 
When doing long-term projections, a suitable curve is 
fitted against historical production and constrained by 
a reasonable estimate on the recoverable volumes that 
may be ultimately produced. The total resource in the 
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* Historical production — Logistic fit Hubbert fit — 5 year moving average 


Coal and Peat: Global Resources and Future Supply. Figure 6 

Historical production of anthracite in Pennsylvania fitted with both logistic and Hubbert curves. Data source: [13] 


earth’s crust will ultimately place a limit on future 
production, but the real limit will be lower since it is 
unrealistic to assume that all geologically existing vol¬ 
umes can be recovered. 

Hubbert himself [73] used 2,500 Gt as the ultimate 
world coal reserves and placed the production peak 
around 2,150 at an annual output of 6 Gt. This implies 
that there are no near-term problems for coal produc¬ 
tion. However, his simple approach does not consider 
all aspects of recoverability. It also agrees poorly with 
actual production history since current production 
reached 6 Gt per year in 2008 and production is still 
increasing. 

Hubbert’s method is sensitive to the assumed ulti¬ 
mate reserve that constrains the curve so it is essential 
to have the best possible resource estimate. It has been 
described as a reasonable model provided that it is 
acknowledged that reserves are dynamic and that 
other factors also influence production [75]. Growth 
curve methods can provide insight into the general 
long-term flow of finite energy resources, but pertur¬ 
bations caused by sudden and unforeseen near-term 
economic or political changes cannot be predicted. 
Therefore, long-term life-cycle projections should not 


be used as a substitute for meticulous economic studies 
that forecast perturbations in coal production over the 
upcoming years or decades [76]. Studies have also used 
the development of regulations, restrictions, and recov¬ 
erable volume estimates to create production projec¬ 
tions [13]. 

Other quantitative assessments that have relied on 
reported reserve and resource assessments found that 
coal output can increase until 2050 or even 2100 
[32, 76]. Thielemann et al. [32] assumed that resources 
will simply be converted into reserves and then pro¬ 
duced due the self-regulating price-driven supply cycle. 
This model gives no foreseeable production limitations 
within the next century. Mohr and Evans [77] point out 
that shrinking coal resource and reserve estimates 
over the last 25 years seem to indicate that factors 
other than the self-regulating supply mechanism are 
playing a role. In any case, future production will still 
be limited to the countries and regions where the 
resources are located. 

Watson et al. [48] concluded that future coal pro¬ 
duction from the Pittsburgh coal area is dependent 
on a complex set of factors, including future trans¬ 
portation costs and environmental policies. Without 
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a holistic view of all factors affecting recoverability and 
geological availability, no reasonable supply projection 
can be made. Although predictions of future produc¬ 
tion will always be uncertain, it is important to remem¬ 
ber that production is dependent on much more 
than just geological existence. Claims that abundant 
resources will automatically provide future energy sup¬ 
ply need to be treated with suspicion - they oversim¬ 
plify the production situation. 

For example, there are significant unexploited coal 
reserves located in eastern Siberia and far eastern Russia. 
However, these deposits are far from major population 
centers and will need major investments in infrastruc¬ 
ture before they can be exploited. How fast this can 
occur is an open question and depends primarily on 
economic and political factors. Also, Russia might never 
develop those deposits and may use other energy 
sources instead. A similar point can be made for the 
USA and the vast coal resources locked up in Alaska 
[43]. Underlying assumptions can greatly affect future 
production projections and one must always remem¬ 
ber that production is a complex issue with many 
factors involved. 

Recent concern for the C0 2 emissions that come 
from burning fossil fuels may turn coal and peat into 
undesired fuels for the future in some countries. On 
the other hand, countries that care less about the envi¬ 
ronment might turn to coal and peat as cheap, but 
polluting, alternatives to imported oil. Clean coal tech¬ 
nologies and carbon capture and storage might allevi¬ 
ate much of the emission problems from combustion, 
which can improve coal’s attractiveness. Cheap and 
reliable wind and solar energy or improved nuclear 
power might also deliver cheaper energy to consumers 
than coal and peat if carbon emissions are taxed. 
Increasing global competition for the remaining petro¬ 
leum might cause countries to increase their reliance 
on domestic energy sources. In summary, there are 
many factors that can affect future production and 
without a holistic approach it is difficult to make reli¬ 
able projections. 

The current world dependence on fossil fuels also 
brings about another problem. Since fossil fuels dom¬ 
inate the existing energy system, they will have to 
power any possible shift to renewable or other energy 
sources [78]. The effect this has on fossil fuel reserves 
will depend largely on the size and speed of the shift, 


since fossil fuel energy used will be additional to that 
already in use. 

In conclusion, there is a lot of coal and peat avail¬ 
able in terms of resources. However, far from all the 
resources are suitable for production, and future pro¬ 
duction will largely depend on the choices people 
make. The model and assumptions about recoverabil¬ 
ity have a significant impact on future production pro¬ 
jections. The key point is that huge worldwide resource 
estimates for coal and peat do not automatically trans¬ 
late into an assured supply. Vast resources are present if 
society demands coal and peat and is prepared to pay 
the price for recovering them. 

Future Directions 

This section should start with a frank admission that 
coal will remain one of the most important energy 
sources for mankind for the foreseeable future. Future 
development of coal production will be linked to the 
development of global energy supply, which will affect 
virtually all aspects of society in some way. However, 
production is a complex issue that requires a holistic 
treatment. This chapter has attempted to highlight 
several of the indicators and parameters that affect 
future production of coal and peat. 

It is also obvious that more work needs to be done 
in exploration and geology to better understand the 
occurrence of coal and peat. Better survey data and 
increased geologic certainty would also help to improve 
the accuracy of resource estimates - especially in those 
countries and regions where few assessments have been 
made or where only old data is available. There is no 
substitute for a good geological understanding of depo- 
sitional systems and detailed surveys when making 
resource estimates. 

Estimation of reserves and recoverable volumes 
involve consideration of numerous economic, techni¬ 
cal, legal, and social parameters relative to specific 
markets and expected demand during a present and 
period with economic and regulatory uncertainty. 
Improved understanding and handling of this complex 
set of factors are required for more accurate and reliable 
assessments in the future. Recoverability is not a topic 
that should be taken lightly or oversimplified. Instead it 
needs holistic treatment to provide reasonable outlooks 
for future world supply of coal and peat. 
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Glossary 

Ash A measure of coal purity that consists of the 
noncombustible residue that remains after coal is 
completely burned. 

Cut size The particle size that has an equal probability 
of reporting to either the oversize or undersize 
product of a sizing device such as a screen or 
hydraulic classifier. 

Cut point The density corresponding to a particle that 
has an equal probability of reporting to either the 
clean coal or reject product of a density-based 
separator. 

Dense medium An aqueous suspension of high- 
density micron-sized particles (usually magnetite) 
used to separate coal from rock based on differences 
in density. 

Organic efficiency An indicator of coal cleaning per¬ 
formance calculated by dividing actual clean coal 
yield by the theoretical maximum yield attainable 
at the same ash content according to washability 
analysis. 

Washability A laboratory procedure that uses dense 
liquids (usually organic) to partition particles 
into various density fractions that represent the 
ideal separation potential of coal (also called 
float-sink analysis). 

Yield An indicator of separation performance (usually 
reported as a percentage) that is calculated by divid¬ 
ing the clean coal tonnage by the feed coal tonnage. 

Definition of the Subject 

Coal preparation, which may also be called washing, 
cleaning or processing, is the methodology by which 
coal feedstocks are upgraded in order to reduce freight 
costs, improve utilization properties and minimize 
environmental impacts. The upgrading, which occurs 
after mining and before transport of the cleaned prod¬ 
uct to market, is achieved using a variety of low-cost 
solid-solid and solid-liquid separation processes. 
Examples of processing technologies used by this 
industry include screening, classification, dense 
medium separation, gravity concentration, froth flota¬ 
tion, centrifugation, filtration and thickening. Several 
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of these processes also play an important role in envi¬ 
ronmental control for the preparation facility. 

Introduction 

According to the annual census of coal preparation 
plants conducted by Coal Age [1], the USA operates 
286 coal preparation plants in 12 states. This number is 
relatively small by comparison to the worldwide fleet 
which is estimated to be 2,283 plants [2]. The capacity 
of the plants can range from less than 200 t/h for small 
operations to 6,000 t/h or more for large industrial 
plants. The need for such facilities can be attributed 
to the fact that freshly mined coals contain a heteroge¬ 
neous mixture of both organic (carbonaceous) and 
inorganic (mineral) matter. The inorganic matter 
includes noncombustible materials such as shale, slate 
and clay. These impurities reduce the coal heating 
value, leave behind an undesirable ash residue, and 
increase the cost of transporting coal to market. The 
presence of unwanted surface moisture also reduces the 
heating value and can lead to handling and freezing 
issues for consumers. Therefore, essentially all coal 
supply agreements with electrical power stations 
impose strict limitations on the specific energy (heat), 
ash and moisture contents of purchased coal. More¬ 
over, as the first step in the power cycle, coal prepara¬ 
tion plants improve the environmental acceptability of 
coal by removing impurities that may be transformed 
into harmful gaseous or particulate pollutants when 
burned. These pollutants typically include particulates 
(fly ash) and sulfur dioxide (S0 2 ) as well as toxic trace 
elements such as mercury. The presence of mineral 
impurities can also influence the suitability of coal for 
high-end uses such as the manufacture of metallurgical 
coke or generation of petrochemicals and synthetic 
fuels. As such, coal preparation is typically needed to 
achieve the high levels of coal purity demanded by 
these secondary markets. 

This entry provides a brief overview of some of the 
technological systems used in coal preparation. In 
addition, this entry highlights the benefits resulting 
from coal preparation activities as well as the issues 
associated with the sustained operation of coal 
processing facilities. This review suggests that coal 
preparation will continue to have a significant impact 
on the cost, recovery and quality of coal supplies. 


However, future improvements in separation technol¬ 
ogy and practices are needed to provide further 
reductions in waste generation and downstream envi¬ 
ronmental impacts. In many cases, these improvements 
may also generate incremental revenue due to the 
recovery of usable coal from waste streams, which 
could provide a financial incentive for pursuing 
these activities. 

Assessment of Cleaning Characteristics 

The capability of coal preparation to improve coal 
quality varies widely due to inherent differences in the 
liberation characteristics of mined coals. The degree of 
liberation is determined by the relative proportion of 
composite particles (i.e., particles of intermixed coal 
and rock that are locked together) present in a partic¬ 
ular coal. The composite particles make it impossible to 
physically separate all of the carbonaceous matter from 
the host inorganic matter. Consequently, plant opera¬ 
tors are often forced to sacrifice coal recovery by 
discarding some composite particles as waste in order 
to improve clean coal quality to a level that can meet 
customer specifications. 

The theoretical trade-off between coal recovery and 
quality can be quantified in the laboratory using wash- 
ability (float-sink) analysis. An example of experimen¬ 
tal data collected from a float-sink analysis is shown in 
Fig. 1. The analysis is performed by sequentially passing 
a coal sample through containers containing liquids 
(usually organic) of increasingly higher densities [3]. 
Pure coal has a relatively low specific gravity (SG < 1.3) 
and is collected as a float product from the first con¬ 
tainer, whereas pure rock is much denser (SG > 2.2) 
and is collected as a sink product from the last con¬ 
tainer. Composite particles report as float products in 
the intermediate containers filled with liquids having 
densities between that of the first and last containers. 
After density partitioning, the products from this pro¬ 
cedure are dried, weighed and analyzed for quality (ash, 
sulfur, mercury, etc.). Washability data are very useful 
for predicting and analyzing the performance of coal 
preparation plants since most cleaning processes sepa¬ 
rate coal and rock based on differences in density. 

Coal washability has a tremendous impact on 
how effectively a preparation plant can upgrade a par¬ 
ticular coal feedstock. However, the types of processes 
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Example: A perfect separation at 1.6 SG would 
provide 72.2% yield of clean coal at 9.0% ash 
and 27.8% yield of reject at 78.9% ash. Thus, a 
plant operating with a 90% organic efficiency 
would be expected to provide a 65.0% clean coal 
yield at 9.0% ash (i.e., 72.2% x 0.9 = 65.0%).] 
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Coal Preparation. Figure 1 

Float-sink (washability) analysis for a 28.4% ash run-of-mine coal 


employed and practices used for operation and main¬ 
tenance (O&M) can also greatly influence the perfor¬ 
mance of the preparation facility. This effectiveness is 
typically reported as an organic efficiency, which is 
defined as the yield of clean coal actually produced by 
the separation divided by the theoretical maximum 
yield of clean coal that could be achieved at the same 
ash content according to washability analysis. Organic 
efficiencies are often in the high 90 percentiles for well 
designed and well run operations, although values well 
below this desired level are not uncommon for prob¬ 
lematic plants. Inefficient processes or practices, which 
misplace significant amounts of clean coal into waste 
and/or rock into the clean product, have a large nega¬ 
tive impact on profitability. For example, a decrease in 
organic efficiency of just one percentage point can 
potentially represent a reduction in plant profitability 
of 20% or more. 


Coal Processing Operations 

During the early part of the last century, coal mines 
made use of labor intensive sorting methods such as 
manual hand-picking to remove unwanted rock from 
mined coal. This inefficient approach was soon 
replaced by simple mechanical sorting processes that 
reduced misplacement and provided higher levels of 
productivity. These early plants typically cleaned only 
the coarser particles (usually larger than >5-10 mm) 
and either recombined the untreated fine particles back 
into the washed product or discarded the fines as waste. 
These inefficient systems dumped large tonnages of 
coal into waste piles, some of which are being re¬ 
mined and reprocessed today. These historic periods 
were followed by many decades of technology develop¬ 
ment that ultimately led to the design and operation of 
relatively efficient plants that are capable of complete or 
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partial upgrading of the entire size range of mined 
coals. Many modern coal preparation plants operating 
today are as complex as industrial facilities once 
employed only by the chemical processing industry. 

Figure 2 shows a simplified process flowsheet for 
a modern plant that includes operations for particle 
sizing, cleaning and dewatering. This sequence of 
operations, which is called a circuit, may be repeated 
for different size fractions since the processes used 
in coal preparation have a limited range of applicability 
in terms of particle size (Fig. 3). Modern plants may 
include as many as four separate processing circuits for 
treating the coarse (plus 10 mm), small (10 x 1 mm), 
fine (1 x 0.15 mm), and ultrafine (minus 0.15 mm) 


feed material. Although many commonalities exist, the 
final selection of what number of circuits to use, which 
types of unit operations to employ, and how they 
should be configured, is highly subjective and depen¬ 
dent on the characteristic properties of the feed coal in 
terms of size and composition. 

Unit Operations for Particle Sizing 

Coal produced by mechanized mining operations con¬ 
tains particles as small as fine powder and as large as 
several hundred millimeters. Particles too large to pass 
into the plant are crushed to an appropriate upper size, 
or rejected where insufficient recoverable coal is pre¬ 
sent in the coarse size fractions. The crushed material is 


c 



Notes : 1 -Raw coat and destimescreen decks may be on the same machine, 2 -D&R screens primarily used to recover 
dense medium, 3 -Sieve bends may also be combined with dean coal classifying cyclones. 


Coal Preparation. Figure 2 

Simplified flowsheet for a modem coal preparation plant showing unit operations for sizing, cleaning and dewatering 
of four different size fractions required for efficiency coal upgrading 
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Coal Preparation. Figure 3 

Effective range of particle sizes treated by various coal preparation processes 


then segregated into groups having well defined 
maximum and minimum sizes. Screens are typically 
employed for sizing coarser particles, while combina¬ 
tions of stationary sieves and classifying cyclones are 
used for sizing finer particles. Figure 3a shows the 
typical sizes of particles that can be effectively handled 
by common types of industrial sizing equipment. 

Screens and Sieves Screens are mechanical sizing 
devices that use a mesh or perforated plate to sort 
particles into fine (particles that pass through the 
screen openings) and coarse (particles that are retained 
on the screen surface). Screens are used for a wide range 
of purposes including the extraction of large rock and 
trash prior to treatment, the production of optimum 
size fractions for separation devices and the rinsing/ 
dewatering of products from cleaning operations. The 
types of screens commonly used in coal preparation 
include grizzly, vibrating and high-frequency screens. 
A grizzly is a series of evenly spaced parallel bars that 
are positioned in the direction of material flow. This 
simple device is used only for gross sizing of very coarse 
particles (>50 mm) and is generally applied only for 
the protection of equipment from large oversize 
material. Vibrating screens are the most common 


equipment for sizing particles and are used in almost 
every aspect of coal preparation. Vibrating screens 
(Fig. 4) utilize an out-of-balance rotating mechanism 
to create a vibrating motion to sort particles and to 
move material along the screen surface. High-frequency 
screens (Fig. 5) are normally used for dewatering fine 
coal or rock by retaining slurry between two opposing 
inclines. As the name implies, this type of screen 
operates at a high vibrating frequency, which is benefi¬ 
cial for dewatering fine solids. 

The sizing of fine coal is particularly difficult due to 
the increased likelihood of plugging and the large num¬ 
ber of particles that must pass through the screen 
surface. As such, fine sizing is performed in many 
cases using sieve bends. A sieve bend (Fig. 6) consists 
of a curved panel constructed from inverted trapezoi¬ 
dal bars (profile or wedge wire) placed at right angles to 
fluid flow. The capacity of the sieve is relatively high 
since the leading edge of the trapezoidal bars “slice” 
material from the flowing stream. However, due to the 
unique geometry, the particle size retained by the sieve 
is smaller than the spacing between bars. Sieve bends 
are commonly used for fine particle dewatering and 
may also be used ahead of vibrating screens to enhance 
sizing performance. 
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Coal Preparation. Figure 4 Coa l preparation. Figure 6 

Vibrating screen used for feed coal sizing Sieve bend used to remove fine "slimes" from coal 



Coal Preparation. Figure 5 

High-frequency screen used for dewatering of fine rock 


Classifying Cyclones Classification is used in coal 
preparation for fine sizing where conventional screen¬ 
ing and sieving becomes impractical or too expensive. 
Classification exploits differences in the settling rates of 


particles of different size (i.e., smaller particles settle 
slower than larger particles). Classification is usually 
applied to dilute suspensions of < 1 mm diameter par¬ 
ticles. This technique is generally less efficient than 
screening or sieving since the separation can be 
influenced by particle shape and density as well as 
particle size. The most common type of classification 
device used in the coal industry is the classifying 
cyclone (Fig. 7). In this device, dilute slurry is pumped 
under pressure to multiple parallel units (banks) of 
cyclones. The rotating flow field within the cyclone 
creates a centrifugal force that increases the settling 
rate of the particles. Larger particles are forced to the 
wall of the cyclone where they are carried to the dis¬ 
charge port (apex) at the bottom of the cyclone, while 
finer particles exit out the top of the cyclone (vortex 
finder). Classifying cyclones are commonly applied to 
obtain cut sizes of 0.10-0.15 mm and represent the only 
practical option for sizing ultrafine particles (cut size of 
50 pm or less). Smaller cyclones are typically used to 
size finer particles since these units provide a higher 
centrifugal force. Classifying cyclones are very popular 
in the coal industry since they offer a high unit capacity, 
operational flexibility and low maintenance. 
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Coal Preparation. Figure 7 

Classifying cyclone used to hydraulically size fine coal 

Unit Operations for Coal Cleaning 

The primary purpose of coal preparation is to separate 
valuable carbonaceous material from contaminants in 
the surrounding host rock. The separation is typically 
accomplished using low-cost processes that exploit dif¬ 
ferences in physical properties that vary according to 
mineralogical composition of individual particles. 
Some of the common properties that are used to sep¬ 
arate coal and rock include size, density and wettability. 
As shown in Fig. 3b, the effectiveness of different types 
of separators is limited to a relatively narrow range of 
particles sizes due to limitations associated with equip¬ 
ment efficiency and geometry. 

Dense Medium Separators The most common pro¬ 
cess for upgrading coarse coal (>12.5 mm) is the dense 
medium vessel. While a wide variety of different 
designs exist in industry, one of the most popular is 
the chain-and-flight vessel (Fig. 8). This density-based 
separator consists of a large open tank through which 
an aqueous suspension of finely pulverized magnetite 
(SG = 5.2) is circulated. The magnetite creates a 
suspension with an artificial density that is between 
that of coal and rock. Consequently, lighter coal parti¬ 
cles introduced into the medium float to the surface 



Coal Preparation. Figure 8 

Chain-and-flight dense medium vessel used to separate 
coarse coal and rock 

of the vessel where they are transported by the 
overflowing medium to a collection screen. Waste 
rock, which is much denser, sinks to the bottom of 
the vessel where it is collected by a series of mechanical 
scrapers called flights. The flights travel across the 
bottom of the vessel and eventually drag the waste 
rock over a discharge lip at one end of the separator. 
The unit is highly flexible since the quality of the clean 
coal product can be readily adjusted by varying the 
density of the medium from 1.3 to 1.7 SG by control¬ 
ling the amount of magnetite and water introduced 
into the suspension. 

Particles smaller than about 0.5 mm float or sink 
too slowly in a static vessel to be effectively separated. 
Therefore, dense medium cyclones (DMCs) are typi¬ 
cally used to clean coal and rock in the size range 
between 0.5 and 12.5 mm. These high-capacity devices 
(Fig. 9) make use of the same basic principle as dense 
medium vessels, i.e., an artificial magnetite-water 
medium is used to separate low-density coal from 
high-density rock. In this case, however, the rate of 
separation is greatly increased by the gravitational 
effect created by passing medium and coal through 
one or more cyclones. Dense particles of rock sink in 
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Coal Preparation. Figure 9 

Dense medium cyclone used to separate small coal and 
rock 



Coal Preparation. Figure 10 

Bank of twin water-only cyclones used to separate fine coal 
and rock 


the rotating medium and are pushed toward the wall of 
the cyclone where they are eventually rejected out the 
bottom. Lighter particles of coal rise to the surface of 
the rotating medium and are carried with the bulk of 
the medium flow out the top of the cyclone. Recent 
advances in cyclone technology have allowed the upper 
particle size limit for DMCs to increase in many current 
applications to 60-75 mm. This development has made 
it possible to simplify plant circuitry in some facilities 
by eliminating the need for a dense medium vessel. 

The primary disadvantage of dense medium pro¬ 
cesses is that they require considerable ancillary 
equipment to support the operation. A typical circuit 
requires that the clean coal and rock products pass over 
drain-and-rinse (D&R) screens to wash the medium 
from the surfaces of the products and to dewater the 
particles. The medium recovered from the drain sec¬ 
tion is circulated back to the separator by a pump, 
while medium from the rinse section is diluted by 
spray water and must be concentrated before being 
fed back. The concentration is achieved using magnetic 
separations that effectively recover magnetite from 
nonmagnetic particles that would otherwise contami¬ 
nate the circulating medium. 


Water-Based Separators A wide variety of water- 
based density separators, which avoid the need to uti¬ 
lize a artificial dense medium, are also available for 
separating coal and rock in the particle size range 
between 0.15 and 1 mm. The most common methods 
include water-only cyclones (WOCs) and spirals. 
A WOC is similar to a classifying cyclone, but typically 
has a stubby wide-angled conical bottom (Fig. 10). 
Separations occur in a WOC due to differences in the 
settling rates of coal and rock in the centrifugal field 
within the cyclone. The separation is also enhanced by 
the formation of autogenous medium created by the 
natural fines already in the feed slurry. These units are 
often employed in two stages or in combination with 
other water-based separators to improve performance. 

Another water-based separator for treating fine coal 
that has grown tremendously in popularity is the spiral 
concentrator (Fig. 11). A spiral consists of a corkscrew 
shaped conduit with a modified semicircular cross- 
section. During operation, feed slurry is introduced to 
the top of the spiral and is permitted to flow by gravity 
along the helical path to the bottom of the unit. Parti¬ 
cles in the flowing film are stratified such that lighter 
coal particles are forced to the outer wall of the spiral, 
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whereas heavier particles are forced inward to the cen¬ 
ter of the spiral. The segregated bands of heavy to light 
materials are collected at the bottom of the spiral. 
Adjustable diverters (called splitters) are used to con¬ 
trol the proportions of particles that report to the 
various products. A three-product split is usually pro¬ 
duced, giving rise to three primary products containing 
clean coal product, refuse and misplaced “middlings.” 
Because of the low unit capacity (2-4 t/h), spirals are 
usually arranged in groups fed by an overhead radial 
distributor. To save space, several spirals (two or three) 
may be intertwined along a single central axis. Spirals 
have been successfully utilized in combination with 
WOCs to improve the efficiency of separating fine coal. 

Froth Flotation Separators The smallest size fraction 
of coal (<0.15 mm) cannot be effectively separated 
using density-based separators. For these particles, 
a process known as froth flotation must be used. Flo¬ 
tation is a physicochemical process that separates par¬ 
ticles based on differences in surface wettability. 
Flotation takes place by passing finely dispersed air 
bubbles through an aqueous suspension of particles. 
A chemical reagent, called a frother, is added to pro¬ 
mote the formation of small bubbles. Typical addition 



Coal Preparation. Figure 11 

Bank of spirals used to separate fine coal and rock 


rates are in the order of 0.1-0.5 lb of reagent per ton of 
coal feed. Coal particles, which are naturally hydropho¬ 
bic, selectively attach to air bubbles and are buoyed 
to the surface of the pulp where they are collected 
from a coal-laden froth bed that forms atop the cell. 
Most of the impurities that associate with coal are 
hydrophilic and remain suspended until they are 
discharged as a dilute slurry waste. Another chemical 
additive, called a collector, may be added to improve 
the adhesion between air bubbles and coal particles. 
Collectors are commonly hydrocarbon liquids such as 
diesel fuel or fuel oil. In some plants, clay slimes (<0.03 
mm) are removed ahead of flotation using small diam¬ 
eter classifying cyclones to minimize the carryover 
of the high ash slimes into the froth product with the 
process water. 

Most industrial installations make use of mechani¬ 
cal flotation machines (Fig. 12). These machines con¬ 
sist of a series of agitated tanks (4-6 cells) through 
which fine coal slurry is passed. The agitators are used 
to ensure that larger particles are kept in suspension 
and to disperse air that enters down through the rotat¬ 
ing shaft assembly. The air is either injected into the 
cell using a blower or drawn into the cell by the negative 
pressure created by the rotating impeller. Most 



Coal Preparation. Figure 12 

Conventional flotation bank used to clean ultrafine coal 
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commercial units are very similar, although some var¬ 
iations exist in terms of cell geometry and impeller 
design. A disadvantage of conventional flotation cells 
is that they allow a small amount of clay slimes to be 
recovered with the water that reports to the froth prod¬ 
uct. Therefore, in cases where high clay concentrations 
are expected, advanced flotation processes such as col¬ 
umn cells (Fig. 13) are often used. Column cells virtu¬ 
ally eliminate the recovery of clay slimes by washing the 
froth with a countercurrent flow of water. This feature 
allows columns to produce a higher purity coal product 
without sacrificing coal recovery. 

Unit Operations for Coal Dewatering 

Solid-liquid separators are used downstream of coal 
cleaning processes to remove unwanted surface mois¬ 
ture from the sorted products. Excess moisture lowers 
the heating value of the coal, may lead to handling/ 
freezing problems, and increases the overall costs of 
transporting the coal to customer sites. As shown in 
Fig. 3 c, several different types of mechanical dewatering 
methods are required depending on the size of particles 
to be treated. The removal of water from the surfaces of 
coarser (>5 mm) coal is predominantly carried out 



using screens, where the prime force involved is gravity. 
Finer particles have a higher surface area and tend to 
have correspondingly higher moisture content. There 
finer particles are typically dewatered using centrifugal 
methods or filtration systems. 

Centrifugal Dewatering Centrifugal dryers, which 
consist of a rotating screen or basket, use centrifugal 
force to pull water away from the surfaces of coal 
particles. These devices operate in much the same 
fashion as the spin cycle in a washing machine. For 
coarser fractions, two popular centrifugal dryers are the 
vibratory centrifuge and the screen-scroll centrifuge. 
The vibratory centrifuge (Fig. 14) uses a vibrating 
mechanism to move solids through the unit, while 
the screen-scroll design uses a screw conveyor to 
move the solids. For finer particles (<1 mm), both 
the screen-scroll centrifuge and another popular 
design, called a screen-bowl centrifuge, may be used. 
The screen-bowl centrifuge is a horizontal unit 
that consists of both a solid bowl section and a screen 
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Coal Preparation. Figure 13 Coal Preparation. Figure 14 

Column-type flotation cell used to clean ultrafine coal Vibratory centrifugal dryer used to dewater small/fine coal 
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Coal Preparation. Figure 15 

Screen-bowl centrifuge used to dewater fine coal 

section (Fig. 15). Feed slurry is introduced into the 
bowl section where most of the solids settle out under 
the influence of the centrifugal field. A rotating scroll 
transports the settled solids up a beach and across the 
screen section where additional dewatering occurs 
before the solids are discharged. Solids that pass 
through the screen section are typically added back to 
the machine with the fresh feed. This unit is capable of 
providing low product moistures, although some 
ultrafine solids are lost with the main effluent that is 
discarded from the solid bowl section. 

Filtration Dewatering Filtration processes may be 
used to dewater fine coal whenever a high coal recovery 
is desirable. Filtration involves the entrapment of fine 
solids as a cake against a porous filtering media. Tradi¬ 
tionally, flotation concentrates have been dewatered 
using some form of vacuum filtration. These units are 
capable of high coal recoveries (>97%) while generat¬ 
ing product moisture in the 20-30% range. The most 
popular type of vacuum filter is the disc filter (Fig. 16). 
This device consists of a series of circular filtration discs 
that are constructed from independent wedge shaped 
segments. At the beginning of a cycle, a segment rotates 
down into the filter tub filled with fine coal slurry. 



Coal Preparation. Figure 16 

Disc vacuum filter used to dewater fine coal 

Vacuum is applied to the segment so that water is 
drawn through the filter media and a particle cake is 
formed. The resultant cake continues to dry as the 
segment rotates out of the slurry and over the top of 
the machine. The drying cycle ends when the vacuum is 
removed and the cake is discharged using scrapers and 
a reverse flow of compressed air. 

Thermal Dewatering Thermal dryers have been suc¬ 
cessfully employed in the coal industry to reduce clean 
coal moisture to single-digit values whenever mechan¬ 
ical dewatering processes are incapable of meeting con¬ 
tract specifications. The most popular design is the 
fluidized bed dryer (Fig. 17), which uses coal, oil or 
gas as the fuel source to heat the incoming air stream. 
The amount of fuel required depends on the amount of 
water that must be evaporated which, in turn, depends 
on the amount of coal fed to the dryer and the percent¬ 
age of water in the dewatered product [4]. When oper¬ 
ating correctly, thermal dryers can reduce the clean coal 
moisture to less than 6% by weight at a unit cost 
proportional to the tonnage of water evaporated [5]. 
Unfortunately, thermal dryers require high capital 
costs, often in excess of several million or more. Dryers 
can also suffer from emission problems associated 
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Coal Preparation. Figure 17 

Thermal dryer used to dry coal to low moisture contents 

fugitive dust and poor opacity. Emissions of nitrous 
oxides, sulfur dioxide, volatile organic compounds 
(VOCs) may also present issues for some sites seeking 
operating permits. Moreover, thermal drying of com¬ 
bustible particles of coal can present safety hazards 
resulting from accidental fires or dust/gas explosions. 

Thickening and Clarification Another essential 
solid-liquid separation process used in coal prepara¬ 
tion is thickening. Thickening (or clarification) pro¬ 
cesses are used to treat the process water so that it can 
be recycled and reused within the plant. A thickener 
consists of a large tank in which particles are permitted 
to settle, thereby producing a clarified overflow and 
thickened underflow (Fig. 18). Thickeners are usually 
fed dilute slurry (<5-10% solids) that is rejected from 
the fine coal cleaning circuits. The clarified water that 
overflows along the circumference of the thickener is 
recovered and reused as process water. The thickened 
underflow, which often contains 20-35% solids, is 
transported to the center of the thickener by a series 
of rotating rakes. The thickened sludge is then pumped 
to an appropriate disposal area or is further dewatered 
prior to disposal. Chemical additives (coagulants and 
flocculants) are usually introduced ahead of the thick¬ 
ener to promote the aggregation of ultrafine particles 




Coal Preparation. Figure 18 

Conventional thickener used to clarify process water and 
thicken solids 


so as to enhance water clarity and increase settling 
rates. The pH levels must also be carefully monitored 
and controlled. Conventional thickeners, which are 
typically 50-200 ft diameter, require constant moni¬ 
toring to ensure that overflow clarity and underflow 
density is maintained and the rake mechanism is 
not overloaded. A torque sensor on the central drive 
is typically used to monitor rake loading, while a 
nuclear density gauge is often used to monitor 
underflow density. 

Waste Disposal and Storage 

The separation processes used in coal preparation often 
generate large volumes of waste rock and slurry. As 
a result, the final step in coal preparation involves the 
disposal and permanent storage of these waste 
products into various types of surface and/or under¬ 
ground repositories. Although many variations exist, 
a common approach is to use separate refuse piles and 
slurry impoundments. Refuse piles are designed to 
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receive coarse particles of waste rock that can be easily 
dewatered by screens or sieves. This material is rela¬ 
tively easy to handle and can be transported by truck or 
belt haulage systems to the disposal area. On the other 
hand, fine coal wastes are difficult to dewater and are 
typically discarded in slurry form. The waste slurry 
contains water, coal fines, silt, clay and other fine 
mineral particles from the processing plant. In most 
cases, the slurry is discarded into an impoundment 
(Fig. 19). An impoundment is an engineered structure 
consisting of a large volume earthen settling basin 
formed behind a man-made dam or embankment. 
The dam or embankment is usually constructed from 
compacted coarse refuse material. The waste slurry is 
transferred to the impoundment by pumping thick¬ 
ened underflow from the plant thickener through 
a pipeline. The volume of the impoundment must be 
sufficiently large to ensure that fine particles settle by 
gravity before the clarified water at the surface is 
recycled back to the plant for reuse. 

Beneficial Impacts of Coal Preparation 

Coal preparation provides several attractive benefits 
from both an economic and environmental perspective. 



Coal Preparation. Figure 19 

Active slurry impoundment used for fine waste disposal 
(before reclamation) 


The most recognized of these include increased coal 
reserves, lower transportation costs, improved utiliza¬ 
tion properties and abatement of pollution. 

Coal Reserves 

Perhaps no other technology has had a greater impact 
on expanding the reserve base of economically recov¬ 
erable coal than coal preparation. Preparation plants 
employ low-cost physical separation processes to con¬ 
vert run-of-mine coal resources into saleable coal 
reserves. The percentage of clean coal tonnage gener¬ 
ated from each ton of run-of-mine coal is commonly 
referred to as plant yield. This parameter is one of the 
most difficult to estimate when evaluating the reserve 
base for a previously undefined coal property. The yield 
can be influenced by several factors including the qual¬ 
ity of the in-place coal, the washability (separating 
characteristics) of the run-of-mine coal, the efficiency 
of the separation processes used by the preparation 
facility and the strictness of the quality demands 
imposed by the coal consumer. The quality of the in- 
place can be affected by small changes to mining prac¬ 
tices that directly influence out-of-seam dilution. 
Variations in washability, which reflect the selective 
liberation of composite particles of intermixed coal 
and rock, can also make estimations of coal yield 
unreliable when coal is subjected to size reduction. 

The average yield produced by coal preparation 
plants has steadily declined over the years due to 
a depletion of lower ash feeds and increased mining 
mechanization. For example, Fig. 20 shows the yield of 
clean coal currently obtained from a random survey of 
several major plants operating in the eastern USA. It is 
not uncommon for eastern operations to experience 
yields under 30-35%, thereby producing only 1 t of 
clean coal from three or more tons of mined product. 
An estimate complied from production records sup¬ 
plied by coal producers suggests that the average yield is 
now less than 50% (i.e., 49.8% zb 3.5%) for the total 
USA. This situation is expected to worsen as eastern 
reserves become thinner and more challenging to mine. 
A study reported by Weisenfluh et al. [6] indicated 
that nearly 52% of the remaining eastern Kentucky 
coalfield resources are located in coal seams that are 
0.35-0.70 m in thickness, while 31% are in 0.70- 
1.1 m thick seams. Likewise, a study of the Virginia 
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Coal Preparation. Figure 20 

Examples of clean coal yield values for a selected 
sampling of eastern US plants 


coalfields found that 30% of the total reserve base 
(more than 200 million tons) exists in seams with 
a thickness less than 0.70 m [7]. Consequently, ever 
increasing amounts of rock from out-of-seam dilution 
are being mined, loaded and hauled to preparation 
plants for removal and disposal. 

Coal Transportation 

Most of the coal consumed in the USA is used for the 
production of electricity. The cost of transporting the 
coal to the power station is usually borne by the utility 
and paid based on the delivered tonnage. The mine 
operator is also paid by the utility based on tonnage, 
although the unit price is typically adjusted up or down 
to account for the actual heat content of the supplied 
coal fuel. In most cases, this simple pricing structure 
provides the base economic justification for the oper¬ 
ation of coal preparation facilities. The high-ash rock 
rejected by coal preparation plants contains insufficient 


heating value to justify the shipment of this material to 
the utility. The savings in transportation costs are 
directly proportional to the increase in heating value. 

Cost-benefit studies suggest that the economics of 
coal production are more sensitive to transportation 
cost than to any other factor. For example, a study by 
Norton [8] was one of the first to demonstrate how 
unfavorable changes in the cost of mining, processing 
and transportation affect profitability. The study con¬ 
cluded that the cost of coal transportation had the 
greatest overall impact on total revenue. This study 
also indicated that the clean coal yield was second 
only to transportation cost in determining revenues. 
In contrast, increases in the capital and operating costs 
of the coal preparation plant had only a minor impact 
on revenue. Therefore, any steps taken to reduce prep¬ 
aration costs (fewer capital improvements, less main¬ 
tenance, workforce reductions, etc.) usually need to be 
closely examined to ensure that clean coal yield is not 
adversely impacted by these cost-cutting measures. 

Utility Performance 

The thermal efficiency of a power station is very impor¬ 
tant. A higher efficiency is obviously beneficial from 
an economic perspective since this provides a pro¬ 
portional improvement in generated revenue for the 
power station. Moreover, a higher efficiency also 
reduces the production of greenhouse gases and 
other pollutants of environmental concern since less 
coal fuel needs to be burned per unit of electrical 
power generated. 

One method for improving thermal efficiency is to 
use washed coals of higher quality can significantly 
improve the thermal efficiency of a boiler [9-12]. 
Higher quality coals are more reactive and require less 
excess air for effective combustion, thereby improving 
efficiency via a reduction in heat lost with the flue 
gas. Higher quality coals also improve efficiency by 
avoiding fouling/slagging problems in the boiler, 
which tend to raise the flue gas temperature and 
increase heat losses. The extent to which the proper 
application of coal preparation technology improves 
thermal efficiency is highly case specific and difficult 
to predict from purely theoretical considerations. The 
most reliable data for quantifying efficiency improve¬ 
ments are typically reported based on actual plant 
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studies. One such classic study [13] monitored 
improvements to boiler performance resulting from 
switching from an untreated coal (15% ash and 3.5% 
sulfur) to washed coal (9% ash and 2.8% sulfur) from 
the same mine. Despite the modest improvement in 
coal quality, the boiler efficiency increased from by 
about 1.5 absolute percentage points as a result burning 
better quality coal. The capacity factor also rose by 
almost 10% due to less fouling/slagging problems. 

The use of coal preparation technologies to 
improve boiler efficiency has also been a major benefit 
reported in other nations. The International Energy 
Agency (IEA) estimates that coal-fired power plants 
in India can increase thermal efficiency by up to 10 per¬ 
centage points by switching from unwashed to washed 
coal [ 14] . China also expects to make greater use of coal 
preparation technology to improve thermal efficiencies 
and environmental performance [15]. The average 
thermal efficiency in China has been reported to be 
less than 29% [ 16] , compared to around 38% in OECD 
countries. As such, coal preparation is expected to 
continue to have a large impact on the international 
community. 

Another benefit of coal preparation is that it 
removes impurities that have a significant influence 
on the O&M costs for a coal-fired boiler. Studies have 
demonstrated that the removal of abrasive mineral 
impurities such as pyrite and quartz can substantially 
reduce wear rates and increase the throughput capacity 
of utility pulverizers [17, 18]. The impacts associated 
with the abrasive wear and slagging/fouling of boiler 
tubes can also be mitigated to a large extent by utilizing 
washed coals that have been properly cleaned to 
remove unwanted mineral matter [19-21]. Vaninetti 
and Busch [22] have provided a detailed description 
of these problems and have developed empirical for¬ 
mula that can be used to assist in the evaluation of 
changes to coal quality in specific types of boilers. 

The handling characteristics of solid coal are also an 
important issue. A poor handling coal may hang in 
railcars, plug chutes and bins and stick to conveyor 
belts. These problems may result in unscheduled shut¬ 
downs, thereby reducing power station availability. 
Washed coals from preparation plants typically have 
superior handling characteristics to run-of-mine coals, 
especially if all or a portion of the ultrafines have been 
removed. These coals also typically present fewer 


problems in terms of unwanted dust generation, solids 
run-off during precipitation events and freezing prob¬ 
lems during colder months [23]. 

Pollution Abatement 

Coal preparation plays an important role in reducing 
the emissions of pollutants that associate with the min¬ 
eral matter contained in coal. These emissions nor¬ 
mally include solid particulate emissions such as fly 
ash as well as gaseous emissions of precursors associ¬ 
ated with acid-rain and air toxics. These emissions are 
strictly regulated for coal-fired utilities through various 
legislative acts such as the 1990 Clean Air Act Amend¬ 
ment (CAAA) and 2005 Clean Air Interstate Rule 
(CAIR). Moreover, coal preparation has a beneficial 
impact on reducing greenhouse gas emissions by 
increasing the thermal efficiencies. 

Noncombustible impurities present in the feed¬ 
stocks supplied to coal power stations generate waste 
steams as either bottom ash/slag or fly ash. Of these, the 
finest particles of fly ash emitted to the atmosphere are 
considered to be of greatest environmental concern due 
to their potential adverse impact on human respiratory 
health [24] . Power stations make use of several types of 
effective control technologies to minimize fine partic¬ 
ulate emissions. These postcombustion technologies 
include efficient processes such as electrostatic precip¬ 
itators (ESPs), fabric filters (FFs), cyclones and wet 
scrubbers. Modern control systems typically achieve 
better than 99.5% removal of all particulates and 
exceed 99.99% in some cases. However, standards for 
particulate emissions continue to become increasingly 
stringent as reflected by expanded regulations by the 
US EPA to include particles finer than 2.5 pm in new 
ambient air quality standards [25]. As such, there is 
continued interest in removing greater amounts of 
particulates upstream of other emission controls 
using coal preparation technologies. 

The separation processes used in coal preparation 
plants remove noncombustible minerals that ulti¬ 
mately affect the amount and type of particulate matter 
(PM) that passes downstream to emission control sys¬ 
tems. For these systems, proper levels of coal washing 
can be identified that effectively reduce ash loading 
and improve removal efficiencies. A recent presenta¬ 
tion by American Electric Power to the Asia Pacific 
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Partnership [26] concluded that “ Consistent and proper 
quality coal is best tool to improve plant operating per¬ 
formance and reduce PM and S0 2 emissions. Removal of 
some of the coal ash (includes rocks) at the mine is more 
economic than in the pulverizer, boiler, precipitator and 
scrubber Washing also minimizes the total amount of 
high-surface-area fly ash that is more hazardous to 
dispose due to its high reactivity. Even in cases where 
particulate controls are currently deemed adequate, 
greater use of coal preparation may be required in the 
future to compensate for deterioration in feedstock 
quality as higher quality coal reserves become depleted. 

Coal preparation has also played an important role 
in environmental compliance by reducing the sulfur 
contained in coal feedstocks prior to combustion 
[27]. Sulfur occurs in coal as three distinct forms 
(e.g., sulfate, organic and pyritic). Sulfate sulfur is 
present in very small quantities and is not considered 
a serious problem. Organic sulfur, which is part of the 
basic structure of coal, is not removable by conven¬ 
tional cleaning. The precombustion removal of organic 
sulfur is possible using chemical cleaning methods; 
however, these elaborate processes are not economi¬ 
cally competitive with flue gas scrubbing. Pyritic sulfur 
is present as discrete inclusions of iron sulfides distrib¬ 
uted within the coal matrix. As such, this form of sulfur 
has the potential to be removed by physical cleaning 
processes such as coal preparation [28]. Fortunately, 
many of the high sulfur coals in the USA also contain 
a high proportion of pyritic sulfur (Fig. 21). 

Coal preparation plants have been reported to 
remove up to 90% by weight of pyritic sulfur, although 
rejections are typically in the 30-70% range due to 
liberation constraints [28-30]. When compared to the 
postcombustion control of sulfur, coal preparation 
offers several distinct advantages including improved 
market flexibility, lower scrubber loading and concur¬ 
rent removal of other impurities (ash, trace elements, 
moisture, etc.). Although coal preparation does not 
directly impact the nitrogen content of coal, washing 
has been shown to help reduce NO* emissions by pro¬ 
viding a consistent high quality fuel that makes it easier 
to control the combustion environment [31]. 

Coal preparation also has a tremendous impact on 
the amount of air toxics released to the atmosphere. 
The 1990 CAAA contained provisions that established 
new emission standards for a variety of air toxics, which 
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Distribution of sulfur types typically in US coals 
(After Cavallaro et al. [29]) 


are commonly referred to as hazardous air pollutants 
(HAPs). Many of the HAPs identified in the CAAA are 
present as trace elements in coal [32, 33]. Some of the 
most noteworthy of these are antimony, arsenic, beryl¬ 
lium, cadmium, chlorine, chromium, cobalt, fluorine, 
lead, manganese, mercury, nickel, selenium and radio¬ 
nuclides. The concentrations of these elements are 
known to vary considerably from seam to seam, and 
in some cases within the same seam [34, 35]. During 
combustion at electrical utilities, these elements may 
be released to the atmosphere as solid compounds 
with the fly-ash and in the vapor phase with the flue¬ 
gas. Existing post-combustion control technologies, 
such as electrostatic precipitators, can be reasonably 
effective in reducing the concentration of trace ele¬ 
ments associated with fly-ash. These commonly 
include elements such as antimony, beryllium, cad¬ 
mium, cobalt, lead and manganese. Capture efficien¬ 
cies of >97% have been reported for electrostatic 
separators [36]. On the other hand, trace elements 
such as arsenic, chlorine, mercury and selenium have 
the potential to volatilize and are less effectively con¬ 
trolled by postcombustion methods. 

Studies have shown that many of the HAP precur¬ 
sors identified in the CAAA are associated with the 
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mineral matter commonly rejected by coal preparation 
plants. This approach to HAPs control is attractive 
since the waste rock rejected by coal preparation plants 
is coarser and has a lower reactivity than the high- 
surface-area ash generated by power stations [37]. In- 
plant sampling campaigns conducted by various 
researchers [36, 38] suggest a good correlation between 
the rejection of mineral matter and the removal of 
trace elements during physical cleaning. These find¬ 
ings are also supported by laboratory float-sink tests 
performed using a variety of eastern US coals [39-41]. 
These data suggest that trace elements are typically 
rejected at levels of 40-70% by weight using conven¬ 
tional preparation technologies. These values appear to 
be in good agreement with earlier values reported by 
Fonseca et al. [36], which showed an average trace 
element removal by conventional coal preparation of 
approximately 64% for six different coals (Fig. 22). 
On the other hand, the large degree of variability 
observed in the data from these and other studies 
suggest that the rejections of trace elements by coal 
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Coal Preparation. Figure 22 

Reduction in trace element content after coal 
preparation (After Fonseca et al. [36]) 


preparation are very site specific and need to be quan¬ 
tified on a case by case basis. 

Of the air toxics reduced by coal preparation, mer¬ 
cury is perhaps the trace element in coal of greatest 
environmental concern [35]. Mercury can be released 
during coal combustion and subsequently deposited in 
the environment. Ecological studies have shown that 
mercury bio-accumulates in the food chain as higher 
species consume lower life forms exposed to mercury 
contamination [42]. Data reported by the US Environ¬ 
mental Protection Agency (EPA) indicate that coal- 
fired utilities are currently the largest human-generated 
source of mercury releases in the USA. It is estimated 
that these plants release approximately 48 t annually 
[43]. In order to curb these emissions, EPA issued the 
world’s first-ever rule to cap and reduce mercury emis¬ 
sions for coal-fired power plants. Compliance options 
available to utilities include postcombustion capture of 
mercury by existing or new flue gas scrubbing technol¬ 
ogies as well as precombustion control of mercury by 
coal preparation and coal switching [44]. 

A recent study by Quick et al. [45] showed that the 
mercury content of coal delivered to utilities (based on 
ICR data) was lower than that of the in-ground coal 
resources in the USA (based on COALQUAL data). 
This comparison is shown in Fig. 23 for the primary 
eastern coal producing states. Based on this study, 
Quick et al. [45] concluded that “.. .selective mining 
and more extensive coal washing may accelerate the 
current trend toward lower mercury content in coal 
burned at US electric utilities...” and “.. .since recent 
reductions of sulfur emissions from coal-burning elec¬ 
tric utilities are largely due to a declining sulfur content 
of delivered coal, rather than from scrubbing combus¬ 
tion gases, these simple, low-cost approaches to reduce 
Hg emissions should not be overlooked.” 

According to Alderman [46], cleaning can reduce 
mercury by more than 50% in many eastern and west¬ 
ern coals and lignites, excluding southern Powder River 
basin coals. Greater rejections of mercury by coal prep¬ 
aration appear to be limited by inadequate liberation 
and/or the presence of organically associated mercury. 
Several studies have suggested that mercury has some 
degree of association with the iron sulfides present in 
many run-of-mine coals [47-49] . In fact, the data sum¬ 
marized in Fig. 24 suggest that the concentration of 
many metallic elements found in coal often correlate 
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Coal Preparation. Figure 23 

Comparison of mercury contents for delivered (ICR) and 
in-ground (COALQUAL) coals (after Quick et al. [45]) 
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energy consumed (and C0 2 generated) during coal 
transportation by increasing the specific heating value 
of the delivered coal. Smith [13] also found that freshly 
mined coals subjected to coal preparation processes 
tended to display slower releases of methane, another 
greenhouse gas. 
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Challenges for Coal Preparation 

Coal preparation offers may attractive benefits for coal- 
fired power generation. These typically include lower 
transportation costs, improved properties for coal uti¬ 
lization and reduced emissions of particulate and gas¬ 
eous pollutants. However, the industry also faces 
several challenges that need to be resolved to ensure 
that preparation plants can continue to operate at 
a profit without damage to the environment. These 
concerns can be generally classified as technical factors 
that relate to shortcomings in current processing 
systems, environmental factors that involve improving 
waste handling and disposal, or health and safety 
factors that may affect workers or citizens in the 
surrounding area. 


well with the presence of sulfide minerals [50]. As 
a result, the rejection level for mercury is often in the 
same range as the pyritic sulfur rejection for coals 
subjected to coal preparation. While liberation can be 
improved by reducing the topsize of the feed coal [51], 
this approach is difficult to justify in todays market¬ 
place due to the high costs associated with the fine 
grinding of coal [28]. Moreover, fine particles are also 
difficult and costly to upgrade, dewater and handle in 
existing coal preparation facilities. 

Another important impact of coal preparation is in 
the reduction of greenhouse gas emissions resulting 
from improvements in thermal efficiency, i.e., less 
C0 2 is produced per unit of electricity generated. 
Calculations indicate that a one percentage point 
improvement in thermal efficiency provides a two to 
three percentage point reduction in C0 2 emissions for 
a typical coal-fired utility. An investigation conducted 
by Couch [52] indicated that there are more than 4,000 
coal-fired boilers (>50 MW capacity) worldwide that 
could improve thermal efficiencies and reduce C0 2 
emissions by using coal preparation to improve coal 
quality. Moreover, von Hippel and Hayes [53] have 
noted that coal preparation reduces the amount of 


Technical Issues 

Cleaning of Fine Particles The freshly mined coals 
that are fed to coal preparation plants are typically 
crushed to liberate rock before washing and to limit 
the largest size of particles that enter the plant. Opera¬ 
tors prefer to keep the particles as large as possible 
(usually >50 mm) since fine coal processes are consid¬ 
erably less efficient and substantially more costly. This 
is unfortunate since the amount of high quality recov¬ 
erable coal within a given reserve can theoretically be 
increased by crushing. The size reduction improves the 
liberation by reducing the population of intermixed 
composite particles of coal and rock. A study 
conducted by DOE [30] indicates that the reserves of 
compliance coal in central Appalachia could be nearly 
doubled by efficiently cleaning at a particle size of 1 mm 
(Fig. 25). Although a systematic assessment has not 
been performed to date for trace elements, the size 
reduction would also be expected to substantially 
improve the removal of coal-related pollutants other 
than just ash and sulfur. 

Unfortunately, the solid-solid separation processes 
used industrially to treat fine coal represent the single 
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Coal Preparation. Figure 24 

Effect of sulfide mineral content on the concentration of trace metals in a sample of Pittsburgh seam coal 
(After Luttrell et al. [50]) 


greatest loss of potentially recoverable coal in a 
preparation facility. Field studies indicate that the 
froth flotation process, which is normally used 
to recover <0.2 mm coal, typically recovers only 
60-80% of the organic matter in this size range. This 
surface-based separation is inherently less effective in 
removing pyritic sulfur than density-based processes 
used to treat the coarser sizes of coal [54]. As such, the 
desulfurization of fine particles is also often poor in 
many operating preparation facilities. Therefore, the 
development of effective, low-cost processes for 
treating fine coal is a major need for the preparation 
industry. Effective solutions need to be found for 
improving the recovery, selectivity and capacity of 
froth flotation processes. This goal may be achieved 
through fundamental and applied studies that seek to 
understand and improve flotation chemistry, equip¬ 
ment design and process control. In addition, new 
types of density separators need to be developed for 
treating fine coal. Centrifugal separators such as 


enhanced gravity concentrators, which have been 
successfully applied in the gold industry, may prove 
useful for this purpose. These devices have the potential 
to reject significantly greater amounts of pyritic 
sulfur (and mercury) that is not efficiently removed 
using surface-based separators such as froth flotation 
[55, 56]. However, these machines have several lim¬ 
itations including low throughputs and control 
difficulties. 

Dewatering of Fine Particles The solid-solid sepa¬ 
ration processes employed by modern coal preparation 
plants require large amounts of process water. After 
cleaning, the unwanted water must be removed 
from the surfaces of the particles using mechanical 
dewatering equipment. Inefficient removal of moisture 
lowers the heating value, increases transport costs and 
creates handling/freezing problems for the cleaned 
coal. Coarse particles can be readily dewatered using 
simple screening systems, while finer particles require 
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Coal Preparation. Figure 25 

Effect of decreasing top size on low-sulfur coal 
availability (After Cavallero et al. [30]) 

more complicated unit operations such as centrifuges 
and filters. Unfortunately, the mechanical systems used 
to dewater fine coal are inefficient and costly [57] . Fines 
often represent as little as 10% of the total run-of-mine 
feed; however, this size fraction may contain one third 
or more of the total moisture in the delivered product. 

The availability of low-cost mechanical dewatering 
equipment that can efficiently remove moisture from 
fine coal is widely considered to be an important need 
for the coal preparation industry. Existing technologies 
for fine coal dewatering tend to produce unacceptably 
high moistures exceeding 25-35% by weight or inten¬ 
tionally sacrifice half or more of the ultrafines as a waste 
in an attempt to lower the product moisture. In addi¬ 
tion, these processes are typically the most energy 
intensive processes used in coal preparation, often con¬ 
suming 6-12 kW/t/h of dry solids processed [58]. 
Thermal dryer systems can effectively reduce mois¬ 
ture; however, these massive units require very large 
capital expenditures that are difficult to justify in the 
coal industry. Also, indirect thermal drying systems 
(e.g., Holoflite and Torus Disc) typically require 
200-400 kW/t/h of product for drying fine coal solids 
to single-digit moistures [59]. Moreover, stringent air 


quality standards make it impossible in many cases to 
obtain new operating permits for thermal dryers. 
Therefore, the coal preparation industry needs to 
develop new mechanical solid-liquid separation pro¬ 
cesses that are substantially more efficient in terms of 
removing moisture and less expensive to purchase, 
operate and maintain. Innovative systems are critically 
needed, which may require fundamental studies to 
identify controlling mechanisms that can lead to the 
development “breakthrough” technologies. 

Sizing of Fine Particles There are many technical 
challenges for the coal preparation industry that 
directly relate to size-size separations. Size-size sepa¬ 
rations are required ahead of solid-solid separations 
since these units are only effective within a narrow 
particle size range. Vibrating screens are generally effi¬ 
cient and cost effective for sizing and dewatering 
coarser particles. On the other hand, screening systems 
for finer particles, particularly those finer than 0.5 mm, 
tend to blind easily, wear quickly, and suffer from low 
throughput and low efficiency. The misplacement of 
incorrectly sized particles into equipment not designed 
to handle such sizes can have a large adverse impact on 
both the separating performance and maintenance 
requirements for a preparation plant. Another impor¬ 
tant issue with screening is the desliming of coal prod¬ 
ucts to remove ultrafine mineral sediments that are 
detrimental to quality and moisture. Many in the 
industry believe that the ability to screen ultrafine 
particles at sizes of 0.15 mm and smaller is particularly 
important. 

In addition to new screening systems, breakthrough 
technologies in ultrafine classification are also needed 
in the coal preparation industry [60]. Classification is 
the separation of particles due to differences in settling 
velocities, which depends not only on particle size, but 
also on particle density and shape. Firth and O’Brian 
[61] noted that while existing classification systems 
were adequate for the coal preparation industry of the 
past, “.. .it is apparent that further improvements in 
yield/ash/moisture relationship achieved by coal prep¬ 
aration plants will require increased efficiency in this 
size separation step.” The ability to classify and better 
utilize ultrafine particles will increase industry produc¬ 
tivity and reduce the generation of wastes. Many in the 
industry believe that currently there is a lack of efficient 
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ultrafine sizing and desliming technologies for separat¬ 
ing at cut sizes of 100 pm and finer. 

On-Line Analysis and Control Tremendous strides 
have been made in the automation and control of coal 
preparation plants during the past several decades [62] . 
The application of on-line sensors together with pro¬ 
grammable logic controllers (PLCs) has allowed mod¬ 
ern plants to operate more efficiently and to improve 
safety by reducing manpower requirements. On the 
other hand, the industry continues to struggle with 
the determination in real-time of the quality of their 
coal products [63]. Analyzers are commercially avail¬ 
able for the on-line analysis of many quality parameters 
for coal including ash, sulfur and moisture, although 
the measurement accuracy is often poor due to sam¬ 
pling and calibration issues. On-line analyzers do not 
currently exist that can be used to determine important 
data such as particle size distributions and washability 
in real-time. Therefore, improving automation, control 
and sensor technologies is a key challenge for the 
industry to overcome. Advances in on-line analysis 
would enhance the industry’s ability to maximize the 
recovered energy in a marketable product and mini¬ 
mize the generation of unwanted wastes. 

Environmental Issues 

Environmental compliance in the coal mining industry 
is strictly controlled by government regulations. His¬ 
tory suggests that these regulations were necessary in 
many cases to ensure uniform environmental steward¬ 
ship across the industry as a whole. Today, mining 
companies accept these environmental standards as 
good business practice and, in some cases, may go 
beyond simple compliance to promote goodwill and 
to set an example for others. Nevertheless, potential 
environmental issues associated with coal processing 
still exist. These issues have been generically classified 
by the World Bank to include air emissions, wastewater, 
hazardous materials, solid wastes and noise [64]. 

Coarse Waste Disposal Of the various challenges 
facing the coal preparation industry, perhaps none are 
as significant as those which relate to waste handling 
and disposal. This importance can be attributed to 
the fact that coal cleaning operations produce large 


volumes of waste that must be discarded into refuse 
piles or impoundments. Refuse piles are designed to 
receive coarse particles of waste rock that can be easily 
dewatered. This material is relatively easy to handle and 
can be safely transported by truck or belt haulage 
systems to the disposal area with little or no potential 
for environmental damage. On the other hand, the 
waste contains solid and liquid components that may 
present long-term disposal problems depending on 
the sizes, types and quantities of minerals present and 
the conditions under which the wastes are stored 
(dry vs wet, loose vs compacted, etc.). The factors 
play a key role in establishing the structural integrity 
(slope stability, surface water run-off, sediment con¬ 
tainment, seepage, etc.) and chemical nature (acid gen¬ 
eration, metal dissolution, etc.) of the wastes. Solid 
sediments and dissolved may be transported by rain¬ 
water where it would pollute streams or groundwater. 
Many of these issues can be effectively managed via 
proper disposal practices and monitoring programs. 
On the other hand, uncertainties related to the intricate 
biochemistry and complex hydrology of the waste war¬ 
rant continued investigation to fully assess the poten¬ 
tial for negative impacts associated with long-term 
disposal of coarse waste. Overall, improved waste char¬ 
acterization, including better methods to define the 
nature of wastes from coal preparation operations, is 
considered by many to be a high-priority need for the 
coal preparation industry. 

Slurry Disposal and Storage The handling and dis¬ 
posal of fine slurry waste is widely considered to be one 
of the most difficult challenges facing the coal prepa¬ 
ration industry. Fine wastes have historically been 
discarded into earthen impoundments for permanent 
disposal. An impoundment is an engineered structure 
consisting of a large volume settling basin formed 
behind a man-made dam or embankment. The waste, 
which is difficult to dewater, is normally pumped from 
the preparation plant thickener to the impoundment as 
slurry. The slurry contains water, coal fines, silt, clay 
and other fine mineral particulates from the processing 
plant. In most cases, the slurry is retained behind 
a man-made embankment (earthen dam) constructed 
from compacted refuse material. The impoundment is 
designed to have a volume that is sufficiently large to 
ensure that fine particles settle by gravity before the 
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clear water at the surface is recycled back to the plant 
for reuse. In some cases, chemical additives may be 
used to promote settling and to control pH. According 
to the National Research Council [65], the coal indus¬ 
try discards 70-90 million tons of fine wastes each year 
into existing impoundments. There are 713 active 
impoundments and ponds, most of which are located 
in central Appalachia. In addition, it has been esti¬ 
mated that more than 2.5 billion tons of fine coal 
waste has been discarded in the past into existing and 
abandoned impoundments. 

Impoundments, like any body of water contained 
behind a dam, can pose a safety and environmental 
risk if not properly constructed, monitored and 
maintained. Potential problems include structural fail¬ 
ures, seepage/piping, overtopping, acid drainage and 
black water discharges. Since the well-known Buffalo 
Creek dam failure in 1972, strict engineering standards 
have been mandated by government agencies to regu¬ 
late the design and operation of impoundments. 
A detailed report on the design and evaluation of tail¬ 
ings dams for the mining industry has been published 
by the EPA [66]. No failures of impoundment dams or 
overspills have occurred since this legislation was 
enacted. However, several breakthroughs of slurry 
into old mine workings beneath impoundments have 
occurred. The most notable was the Martin County 
incident, which released about 309 million gallons of 
slurry into streams and rivers in late 2000. A number of 
small accidental releases of black water have also been 
reported at various plant sites. 

Several alternatives to impoundments have been 
employed by the coal industry in an attempt to avoid 
any future potential for environmental damage. 
These alternatives have been examined extensively by 
Gardner et al. [67]. For example, some mines use new 
modes of slurry disposal such as slurry cells and under¬ 
ground injection wells. Slurry cells have been used 
successfully in some cases (albeit at higher cost), but 
limitations associated with maintaining less than 
20 acre-ft of settling area make this alternative diffi¬ 
cult to apply in all cases. Likewise, the use of injection 
wells has raised public concerns about groundwater 
contamination and well water quality [68, 69]. To 
overcome these problems, various types of mechan¬ 
ical solid-liquid separators have recently been investi¬ 
gated as a means of more fully dewatering the fine 


solids prior to disposal. Notable examples include 
deep-cone (paste) thickeners and different types of 
filters (pressure, vacuum, belt press, plate-and-frame, 
etc.). Deep-cone thickeners have been shown to be 
capable of producing a paste of 45-55% solids as 
underflow in waste coal applications [70]. Ideally, the 
paste can be discarded as a stacked pile, thereby 
avoiding the need for impoundments to handle waste 
slurry. This technology has already been demonstrated 
projects at two mining sites in the eastern USA [71, 
72]. Despite these advancements, the Committee on 
Coal Waste Impoundments [65] that recently exam¬ 
ined the issue of slurry disposal concluded that 
“.. .although there are alternatives to disposing of coal 
waste in impoundments , no specific alternative can be 
recommended in all cases .” Therefore, in the absence of a 
preferred disposal method, continued development of 
new and improved processes and practices for slurry 
disposal is critically needed in the coal preparation 
industry. 

Process Water Quality The overwhelming majority 
of cleaning processes used in coal preparation require 
large amounts of process water. Nearly all of the process 
water is supplied by thickening units, which settle out 
ultrafine suspended solids and recycle clarified water 
back into the plant. A small amount of fresh make-up 
water from an external source is usually required to 
satisfy the balance between moisture contents of solids 
entering and exiting with the plant. The clarification 
and recycling of process water provides an effective 
means of reducing fresh water demands and lowing 
environmental impacts. On the other hand, plant oper¬ 
ators are faced with the difficulty of avoiding the 
buildup of suspended solids and dissolved substances 
in the process water. The effects of these components 
on separating performance and plant maintenance 
(e.g., rusting and scaling) are not well understood. 
Deterioration of process water quality is known to 
reduce sizing efficiency, lower flotation recovery and 
increase magnetite losses [73]. Evidence also exists 
suggesting that dissolved ions also adversely impact 
the performance of dewatering processes [74]. Detailed 
studies are needed to better understand these prob¬ 
lems so that effective solutions can be identified and 
implemented before they impede preparation plant 
operations. 
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There are also growing concerns by the public that 
chemicals used in the process of coal may be harmful to 
the environment. The vast majority of coal is upgraded 
without being contacted with any chemical additives 
using density-based separation processes. On the other 
hand, finer coal particles (typically <0.2 mm) are 
processed using froth flotation circuits, which require 
small dosages of reagents known as collectors and 
frothers. Collectors consist of oily hydrocarbons, such 
as diesel fuel, kerosene and fuel oil, which are insoluble 
in water and coat fine coal particles. If required, dos¬ 
ages are typically less than 0.3-2 lbs of collector per ton 
of fine coal processed. Likewise, frothers are added to 
all flotation systems to promote the formation of small 
air bubbles and to create a stable froth. Frothers are 
typically various types of alcohol and/or polyglycol 
surfactants [75]. Addition rates for frothers are typi¬ 
cally less than 10-15 ppm in the process water. In 
addition, plants add water treatment chemicals into 
their thickening circuits to enhance settling rates and 
improve water clarity. These reagents commonly 
include various types of natural and synthetic coagu¬ 
lants and flocculants. Polymer flocculants, which are 
added in very small amounts, are diluted in strength to 
0.01-0.05% solutions with water to improve perfor¬ 
mance. These chemical are also widely employed in 
the purification of drinking water and food as well as 
in the treatment of sewage, paper-making, oil recovery, 
storm water run-off and many other types of industrial 
wastewater [76]. Finally, coal operations may occasion¬ 
ally use small amounts of dust suppressants or freeze 
inhibitors to address seasonal problems that may arise 
at the plant site. 

Several studies have been carried out in recent years 
to assess the ecotoxicological impacts of chemicals 
used by processing plants. For example, several studies 
have raised concerns that seepage from impound¬ 
ments containing mine influenced waters (e.g., resid¬ 
ual chemicals, particulates, acid drainage) may be 
impacting the population of fresh water muscles/ 
mollusks in tributaries around coal mining and 
processing facilities [77]. Many of these issues were 
addressed in a recent symposium hosted by the Nature 
Conservancy that addressed the coexistence of coal 
mining and healthy aquatic ecosystems [78]. Another 
recent study commissioned by the New Zealand Auck¬ 
land Regional Council (ARC) examined the effects of 


residual coagulants and flocculants on natural waters 
[79]. The study concluded that the negative impacts of 
these chemicals were “... low level and not likely to be 
significant in relation to other factors which govern the 
health of aquatic communities. The benefit of reduced 
sediment levels in discharges is considered to outweigh 
the risk of any low level impacts attributable to residual 
flocculants.” However, the study noted that improper 
application of these chemicals associated with misus- 
age or overdosing may create an environmental risk. 
Therefore, to avoid these concerns, the development of 
new technologies that eliminate and/or significantly 
reduce the additions of process reagents is desirable as 
a long-term solution to this potential problem. In the 
near-term, it is likely that chemical manufacturers will 
continue to develop low-risk “green” reagents for use in 
coal preparation facilities. Many plants that dispose of 
waste slurry via underground injection have success¬ 
fully switched to natural flotation collectors, such as 
blends of canola oil, vegetable oil, soybean oil, etc., as 
environmentally friendly replacements for petroleum- 
based hydrocarbon collectors [80]. 

Air Quality and Dust The US EPA promulgated 
standards of performance for air quality for all new 
and modified coal preparation plants under the 1976 
Clean Air Act. These New Source Performance Stan¬ 
dards (NSPS) address all types of particulate emissions, 
including fugitive dust, which may result from produc¬ 
ing, handling, transporting and storing coal. Opera¬ 
tions impacted by such regulation include crushers 
and breakers, sizing equipment, cleaning systems, ther¬ 
mal dryers, conveying systems, coal storage areas and 
coal transfer/loading systems that are directly part of 
the coal preparation facility. For some processes, such 
as thermal dryers, opacity less than 10% must be 
maintained [81]. Dusting problems are not uncom¬ 
mon around material handling transfer points where 
ultrafine particles have the opportunity to become 
airborne. These locations may include truck and railcar 
load-outs as well as conveyor and chute transfer points 
in and around the preparation plant. Many operations 
utilize water trucks and water sprays in combination 
with additions of various chemical and/or crusting 
agents to lower dusting problems. In addition, some 
operations make use of enclosures, such as silos and 
bins, to avoid exposure of coal to the environment. 
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The use of inflatable structures over coal storage piles 
have also been attempted with limited success [82]. 
Despite these efforts, fugitive dust still seems to be an 
issue for many plant sites, often leading to disputes 
with the public and creating concerns for worker 
health. Therefore, work is needed to develop better 
practices and control technologies for reducing dust 
emissions. The reconstitution of fines via agglomera¬ 
tion and briquetting technologies would be expected 
to substantially reduce dust emissions, but these pro¬ 
cesses are currently too expensive to be practical in the 
coal industry. 

Future Directions 

Dry Processing 

The low-sulfur coal reserves in the western states have 
become the most important supply of domestic fossil 
fuel in the USA during the past few decades. Histori¬ 
cally, the majority of coal mined in this region was of 
sufficient quality to not require any coal preparation 
except for simple crushing and sizing. More recently, 
however, increased levels of rock dilution have been 
noted for coals mined in this region, largely due to 
the mining of more challenging reserves and the use 
of larger mining equipment that are less selective [83]. 
This trend is pushing some coal producers to consider 
coal washing for the first time. In addition, new federal 
and state clean air quality requirements are pressuring 
utilities and coal companies to use precombustion 
cleaning as a means of reducing SO* and trace 
element emissions [84]. One example is the proposed 
Springerville power plant in New Mexico, which has 
been tentatively approved contingent upon the use of 
precombustion cleaning to improve the quality of their 
coal feedstock. 

There are many challenges in using precombustion 
cleaning to upgrade western coals. The processes tradi¬ 
tionally used to wash eastern coals cannot be readily 
adopted in the west since water resources are lacking 
and low rank coals often disintegrate in water. As 
a result, current industry efforts are focusing on the 
development of dry cleaning processes to upgrade these 
lower rank reserves. Dry coal separators, such as pneu¬ 
matic jigs and air tables, are already finding applica¬ 
tions at selected mine and utility sites [85-87]. Other 
developing technologies that may be applicable for this 


purpose include various types of electrostatic [88, 89] 
and magnetic separators [90, 91]. The continued devel¬ 
opment of these low-cost units is believed to be impor¬ 
tant for upgrading many of the western coal reserves 
where water is scarce. The development of automated 
sorters, which use optical, electromagnetic or X-ray 
detection, to identify and extract rock from coal, also 
show considerable promise for dry coal concentration 
[92]. Optical sorters are used for separating particles 
that are liberated at relatively large sizes (>10 mm). 
These devices are already being used for separating 
diamond, gold, uranium and sulfide ores as well as 
for separating plastic bottles in the recycling 
industry. Such devices are particularly useful for pre¬ 
concentration, which can increase throughput and 
result in energy savings for other types of solid-solid 
separators. Continued R8d3 is necessary to improve 
separation efficiency (particularly for finer sizes), 
increase reliability and lower costs for this new gener¬ 
ation of coal preparation technology. 

Conversion Processes 

Coal preparation activities have traditionally been 
limited to include only those processes that involve 
physical separations. These processes include unit 
operations for particle sizing, concentration of organic 
matter and dewatering/disposal of plant products. 
These processes are generally considered to be inher¬ 
ently benign since they do not alter the chemical 
structure of the individual particles contained in the 
coal. Conversion processes, which include carboniza¬ 
tion, gasification and liquefaction operations, are by 
this definition not considered to be part of the coal 
preparation industry. On the other hand, a new gen¬ 
eration of coal preparation technology is being 
developed and commercialized that bridges the gap 
between traditional coal cleaning and coal conversion 
processes. These gateway technologies have the poten¬ 
tial to reduce transportation costs and improve utili¬ 
zation properties for many of the low rank coals 
located in the Powder River basin. Under this 
expanded definition, conversion processes that are 
geared to the production of enhanced solid fuels 
may also be included under the coal preparation 
umbrella. Perhaps the best known of these 
processes include the K-Fuel, Encoal and SynCoal 
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processes [93-95]. Some of these solid fuel production 
facilities may also produce gaseous or liquid 
byproducts that are of value in the synthetic fuel 
market. The upgrading of low rank coals, which are 
abundant in the USA, is believed to be an attractive 
means of producing low-emission coals capable of 
meeting the 2010 CAIR standards. This approach is 
particularly attractive if the process simultaneously 
produces syncrude oil as a byproduct [96]. 

One of the most highly developed and well demon¬ 
strated technologies for upgrading low rank coal is the 
K-Fuel process. This technology is a decarboxylation 
process that uses heat and pressure to modify the struc¬ 
ture of subbituminous coals. By driving off moisture 
and oxygen, the process has been shown to be capable 
of reducing emissions of nitrous oxides by 10-20%, 
carbon dioxide by 8-12% and mercury by as much 
as 70% [97]. A commercial-size demonstration plant 
owned by Evergreen Energy is currently being operated 
in Gillette, Wyoming. Specific concerns associated with 
this technology include the high cost of thermal 
processing as well as problems related to disposal of 
process wastewater and spontaneous combustion of 
the treated products. 

Another widely discussed coal conversion technol¬ 
ogy is the Encoal Process developed by SMC Mining 
Company and SGI International. The technology uses 
a two-step thermal treatment process to produce an 
enhanced solid coal fuel (char) as well as some derived 
liquid fuel. In the first processing step, the low rank 
feed coal is heated until a completely dry solid is pro¬ 
duced. The temperature is then increased in a second 
processing step to promote decomposition and drive 
off gases via mild gasification. According to published 
reports [98], the Encoal process generates about one- 
half ton of solid fuel and one-half barrel of condensed 
liquids from each ton of feed coal supplied to the 
thermal reactor. The products, as alternatives to 
existing fuel sources, are capable of lowering sulfur 
emissions in coal-fired boilers nationwide [99]. The 
gaseous products that are not condensed into useful 
liquid are burned to supply thermal energy for the 
process. A 1,000 t/day demonstration plant was suc¬ 
cessfully operated between 1992 and 1998 under DOE 
sponsorship. However, concerns associated with this 
process include high treatment costs, excess fines 


production, dusting problems, wastewater generation 
and need for coal-char stabilization (to prevent spon¬ 
taneous combustion). 

Another intriguing conversion processes that cou¬ 
ples thermal upgrading with physical cleaning to 
improve the quality of low rank coals is the SynCoal 
technology. In this process, high moisture coal is 
processed through vibrating fluidized bed reactors 
in three sequential stages, two heating stages followed 
by an inert cooling stage. These reactors remove chem¬ 
ically bound water, carboxyl groups and volatile sulfur 
compounds. After thermal upgrading, the coal is put 
through a deep bed stratifier cleaning process to sepa¬ 
rate the pyrite rich ash from the coal. When fed 
a typical low rank western coal, the SynCoal process 
can provide a product with a heating value of up to 
12,000 BTU/lb with moisture and ash contents as low 
as 1% and 0.3%, respectively [100]. A demonstration 
plant (45 t/h) was successfully operated near Western 
Energy Company’s Rosebud coal mine near Colstrip, 
Montana. Although the plant closed in 2001, the facil¬ 
ity generated more than 2 million tons of products 
during its lifespan [101]. Similar to the other mild 
conversion technologies, concerns associated with this 
process include high treatment costs, dusting problems 
and product instability. 
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Glossary 

Coal-to-liquids (CTL) The conversion of coal to liq¬ 
uid fuels and/or chemicals 

Coprocessing (of coal) The simultaneous conversion 
of coal and waste carbonaceous feedstocks such as 
petroleum-based residual oil or tar, plastics, or 
rubbers via once-through direct liquefaction into 
liquid, solid, and gaseous hydrocarbonaceous 
materials intended primarily for use as fuel. 

Direct coal liquefaction (DCL) The conversion of coal 
to liquids via dissolution and/or hydroprocessing, 
without first gasifying the coal. 

DOE United States Department of Energy. 

Fischer-Tropsch reaction The catalytic conversion 
of synthesis gas to primarily hydrocarbons, the 
discovery being credited to Franz Fischer and 
Hans Tropsch. 

Gasification (of coal) The conversion of coal to pri¬ 
marily synthesis gas plus ash at elevated tempera¬ 
ture and pressure of oxygen and steam. 


Indirect coal liquefaction (ICL) The conversion of 
coal to liquids via the intermediacy of synthesis gas. 
Light distillate A distillation cut of low molecular 
weight and low boiling range, obtained from refin¬ 
ing of hydrocarbonaceous feedstocks, used to pro¬ 
duce liquefied petroleum gas (LPG), gasoline, and 
naphtha. 

Middle distillate A distillation cut of mid-range 
boiling point, obtained from refining hydrocar¬ 
bonaceous feedstocks, containing hydrocarbons 
ranging from C 5 through about C 2 o or C 22 . When 
further distilled, the portion of middle distillates 
containing C 5 through about C 15 is often referred 
to as naphtha, and the portion containing C 16 
through up to C 22 is referred to as diesel. The 
naphtha is often distilled further to produce gaso¬ 
line and kerosene/jet fuel, or can be used as feed for 
a naphtha cracker unit to make light olefins. (Less 
commonly, the gasoline cut is initially collected 
along with the light distillates). 

Pyrolysis (of coal) A mild gasification process wherein 
coal is heated in the absence of oxygen at sufficiently 
mildly elevated temperature as to produce a mixture 
of hydrocarbonaceous gases, liquids, and solids suit¬ 
able for use as chemical precursors or fuel. 

Synthesis gas (syngas) A mixture of primarily hydro¬ 
gen and carbon monoxide produced by gasification 
or reforming of hydrocarbonaceous materials, used 
to synthesize fuels or chemicals. 

Water gas shift reaction The chemically reversible, 
catalyzed conversion of water and carbon monox¬ 
ide to hydrogen and carbon dioxide. 

Definition of the Subject 

Like oil and natural gas, coal is a fossil resource and as 
such is not what first comes to mind when considering 
sustainable fuel sources. Coal contributes to sustain¬ 
ability as a complement to maintaining continuous 
availability of sufficient fuel to meet growing global 
demand [1-3]. 

As for other fossil sources, conversion of coal to 
transportation fuel is currently more affordable than 
conversion of renewable resources such as wind and 
solar, which are technically far away from availability at 
even a fraction of the scale required for sustainability 
over the coming decades. Given that proved coal 
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reserves in North America alone are currently 
estimated as capable of producing more than 
840,000,000,000 equivalent barrels of oil (the energy 
equivalent of 32 cubic miles of oil) [4], the abundance 
of coal is a boon to a sustained global liquid fuel supply 
if used wisely. 

Just as renewable fuel sources will be benchmarked 
against fossil sources in terms of economics, so also coal 
will be benchmarked against renewable fuel sources in 
terms of environmental sustainability. Today, modern 
coal-to-liquids (CTL) technology can produce synthetic 
fuels that burn more cleanly than conventionally pro¬ 
duced fuels. CTL technology can produce a virtually 
sulfur-free diesel fuel, much cleaner than conventional 
diesel - reducing smog and acid rain - and can also 
produce gasoline, jet fuel, or chemicals. 

Most coal used today is for production of power or 
for steelmaking. Coke plants use coal to produce high 
purity carbons for use in steel manufacture. Combus¬ 
tion of coal provides heat and power for industrial 
process operations. Production of liquid fuels and 
chemicals from coal are lower volume uses; these appli¬ 
cations will tend to increase in importance as the ease 
of oil recovery decreases and its price increases. 

Introduction 

Coal has been mined and used as a source of energy for 
centuries. The first recorded use of coal was in China, 
with indications that the Chinese began using coal for 
heating and smelting about 2,500 years ago, and most 
historians believe that coal was used in various parts of 
the world 3,000-4,000 years ago during the Bronze Age 
[5] . A system for mining coal was present in England by 
1180 [6]. Coal provided heat for homes and industry, 
fuel gas for town lighting, process gas for industry, coke 
for the iron industry, and fuel for steam locomotives 
and shipping during the first quarter of the twentieth 
century. Although electric power has supplanted coal 
gas for public lighting (town gas), the electric power 
industry remains significantly coal based today. 

Coal resources have been used to produce liquid 
transportation fuels by several process routes, collec¬ 
tively referred to as coal liquefaction or, more generally 
stated, as Coal to Liquids (CTL). Early records of coal 
conversion to liquid fuel date to 1913 and an extraction 
process developed by Friedrich Bergius in Germany [7] . 


Bergius treated a mixture of pulverized bituminous 
coal and coal-derived heavy oil with hydrogen, at high 
temperature and pressure, to give a liquid product 
consisting of heavy oils, middle distillates, gasoline, 
and gases. Various metal compounds have been used 
to catalyze the process. This type of CTL technology is 
called Direct Coal Liquefaction (DCL). 

DCL technology involves making a partially refined 
synthetic crude oil directly from coal, which is then 
further refined into synthetic gasoline and diesel as 
well as liquefied petroleum gas (LPG). Raw coal oil 
is highly aromatic and requires further upgrading. 
Heavy residues generated in the process, referred to as 
asphaltenes, also include ash from the coal and are 
either disposed of or gasified to produce part of the 
hydrogen required for the upgrading. The hydrocarbon 
fuel products of DCL are similar to hydrocarbon fuels 
derived from crude oil. A slate of partially refined 
gasoline-like and diesel-like products, as well as pro¬ 
pane and butane, are recovered from the synthetic 
crude oil mainly by distillation. Liquid yields in excess 
of 70% by weight of the dry, mineral matter-free coal 
feed have been demonstrated under favorable circum¬ 
stances. Overall thermal efficiencies (% calorific value 
of the input fuel converted to finished products) for 
modern processes are in the range of 60-70% if allow¬ 
ance is made for generating losses and other non-coal 
energy imports. These processes generally have been 
developed to process development unit (PDU) or pilot 
plant scale. However, no demonstration or commer¬ 
cial-scale plant had yet been built until very recently, 
primarily because petroleum fuels had been available at 
lower overall cost. 

Development of the DCL approach in the twentieth 
century proceeded with ambition and then reserve. 
DCL was first commercialized in the 1920s (the Pott- 
Broche/IG Farben Process). In the Second World War, 
Germany produced more than 4 million tons of fuels by 
DCL. Plants were also built during this time in the UK, 
France, and Japan. A Japanese-influenced pilot unit at 
Fushun Coal Mine was operated by the South Man¬ 
churian Railway Company from 1938 [8] until 1943 
and processed 20,000 t/year of coal. During this same 
period, the Korean Artificial Petroleum Company 
operated a 100 t/day plant at its Agochi factory [9]. 

DCL underwent a new phase of development in 
the 1970s in response to oil shocks. Major research 
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efforts in the USA during the 1970s and 1980s, in 
particular, involved work by Hydrocarbon Research 
Inc. (HRI, now Hydrocarbon Technologies Inc, HTI), 
cosponsored by the US Department of Energy (DOE) 
and including substantial R&D at a dedicated facility in 
Wilsonville, Alabama. 

Research on DCL in Japan began in 1974 with the 
Sunshine project and sponsorship by the New Energy 
and Industrial Technology Development Organization 
(NEDO) [ 10] . Early players included Sumitomo Metals 
and Hasaki, and Nippon Steel and Kimitsu. The pro¬ 
gram experienced delays in the next decade, presum¬ 
ably resulting from the drop in oil price. A 150 t/day 
pilot plant was built at Sumitomo Metal Industries 
Kashima Steelworks and operated during 1997 and 
1998, with a longest run of 1,921 h [11].' Research on 
DCL in Britain included the two-stage Liquid Solvent 
Extraction (LSE) and the Supercritical Gas Extraction 
(SGE) processes from British Coal. Research on DCL in 
Germany has included the single-stage Kohleol process, 
as improved with the Integrated Gross Oil Refining 
(IGOR+) development by Ruhrkohle AG and Veba Ol 
AG, as well as the single-stage Imhausen High-Pressure 
Process [12]. 

The main focus of DCL research and development 
continues to be making the process more economically 
competitive with crude oil refining by decreasing the 
cost of production, particularly the capital cost of the 
plant, and the related challenge of finding more effi¬ 
cient ways to deal with tar and solid by-products of the 
liquefaction. 

In addition to the price of oil, increasingly stringent 
environmental limitations on the content of aromatic 
compounds and sulfur in motor fuels have presented 
challenge. Fuels generated by DCL are rich in high- 
octane aromatics, which is an asset to engine perfor¬ 
mance but also an environmental issue. Coal-based 
processes also require desulfurization. The challenge 
of altering the product slate to meet environmental 
regulations is compounded by the resultant loss of 
benefits imparted by aromatics: in facilitating the 
liquefaction chemistry itself and then in promoting 
engine performance when using the final fuel product. 
Desulfurization presents its own performance chal¬ 
lenges, because sulfur functions as an engine lubricant. 

In the 1920s, Franz Fischer and Hans Tropsch of the 
Kaiser Wilhelm Institute developed a catalyst to 


convert hydrogen and carbon monoxide obtained 
from gasification of coal to hydrocarbon liquids. 
Because the coal is first gasified before conversion to 
liquid fuels, the Fischer-Tropsch (F-T) process for CTL 
is known as a type of Indirect Coal Liquefaction (ICL). 

Indirect coal liquefaction (ICL) is a completely dif¬ 
ferent approach from DCL for providing liquid fuels 
from coal. ICL processes first convert coal to a gaseous 
intermediate, which is then converted to liquid fuel. 
Coal gasification technologies rely on incomplete com¬ 
bustion, wherein coal is heated with a quantity of air 
that is insufficient to allow complete combustion (for¬ 
mation of carbon dioxide and water). The products are 
condensables and tars, flammable gases, and a solid fuel 
(or waste). Most coal-derived chemicals are made from 
the tars, although synthesis gas from coal gasification 
also can be used for the production of chemicals such 
as BTX (benzene, toluene, and xylenes), hydrogen, 
sulfur, methanol, and ammonia. Coal-derived fuels 
are obtained from the condensables, which are distilled 
and in some cases hydrotreated to give gasoline, diesel, 
and fuel oil cuts. 

Of particular utility is the gasification of coal to 
provide the process intermediate synthesis gas, or syn¬ 
gas, a mixture of primarily CO (carbon monoxide) and 
H 2 (hydrogen). The ratio of H 2 /CO varies with the 
source within the range of 0.7-1 for coal. Syngas can 
be used to generate liquid fuels via the aforementioned 
Fischer-Tropsch (F-T) technology, which provides 
hydrocarbon fuels that resemble crude oil derived 
products. The F-T processes produce synthetic middle 
distillates that can be used either directly as diesel or in 
blends with petroleum-derived diesel. 

The CTL Fischer-Tropsch process was first taken 
forward commercially in the 1930s by the German 
company Ruhrchemie, in conjunction with other part¬ 
ners including Lurgi, to produce gasoline and oil. By 

1938 there were nine German plants in operation hav¬ 
ing a total capacity of 660,000 t/year [13]. Individual 
reactors yielded about 5,000 L of gasoline/day, with 100 
such units used in the first German F-T plants. How¬ 
ever, the Bergius process DCL plants were chosen in 

1939 when Germany expanded production during 
World War II, because these were further developed 
and could be more readily scaled to larger size, eventu¬ 
ally processing up to 350 t of coal and yielding 
250,000 L of gasoline/day. 
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By the mid-1940s, natural gas and oil production 
had become more developed and cost-competitive with 
coal, and technology for production of synthetic trans¬ 
portation fuels was not considered economic after 
the Second World War. Since then, research on CTL 
processes has waxed and waned largely in response 
to surges and ebbs in the price of oil. However, 
a perception by some at the time that reserves of crude 
oil were limited led to continued low-level development 
of CTL. Sasol purchased rights from the German com¬ 
panies and developed its own CTL process. Construction 
began in 1950 and an F-T plant came on line in 
Sasolburg, South Africa during the mid-1950s using 
Lurgi coal gasifiers and Sasol Synthol reactors to produce 
gasoline and diesel fuels as well as chemical feedstocks 
from coal. Even so, in the late 1950s and 1960s the huge 
oil fields of the Middle East dominated fuel supply 
chains to the point that further development of CTL 
technology worldwide essentially disappeared. 

Shocks in the price of oil during the 1970s 
prompted renewed efforts; particularly in South Africa, 
the USA, Japan, and Britain; to promote an alternative 
to crude oil: a synthetic crude (syncrude) that would be 
cost-competitive with oil for refining to transportation 
fuels. Trade embargoes lasting until the mid-1980s 
effected large-scale application of F-T ICL in South 
Africa, at that point up to 60% of the country’s trans¬ 
portation fuel requirements being met by coal. With 
the benchmark price of oil at US $30 per barrel, Sasol 
brought two F-T ICL plants online, in 1980 and 1983, 
with the total capacity of the company’s three plants at 
6,000 t/year [14]. 

As a result of a drop in the price of crude oil, 
worldwide developments in indirect coal liquefaction 
were largely put on hold after the 1980s, with the 
exception of those in South Africa where Sasol has 
been using its F-T Advanced Synthol (SAS) reactors 
since 1989. The ICL approach underwent an R8d3 
resurgence in the 1990s, then in response to an increase 
in the price of oil [ 15] . All of these early plants utilized 
high-temperature Fischer-Tropsch (HTFT) technol¬ 
ogy with an iron-based catalyst. In 1993, Sasol success¬ 
fully proved its low temperature Fischer-Tropsch 
(LTFT) technology with an iron-based catalyst to con¬ 
vert coal-derived syngas to liquid products. The plant is 
a relatively small scale, 2,500 bpd commercial facility 
that was recently converted to use natural gas as feed. 


Sasol’s newest generation of LTFT utilizes high perfor¬ 
mance cobalt based catalysts in the Sasol Slurry Phase 
Distillate (SSPD) process [14], now used in Qatar to 
convert natural gas to middle distillates. 

Currently, the best option for using coal to produce 
a high cetane number diesel fuel virtually free of aro¬ 
matics and of S and N compounds is the F-T process 
coupled with the applicable downstream work-up of 
the crude F-T product [16]. Gasification of coal pro¬ 
vides syngas, which is converted to a range of hydro¬ 
carbons. Alternatively, the syngas can be converted to 
liquid specialty [17, 18] or commodity chemicals 
including methanol or dimethylether (DME), either 
or both of which also can be used as automotive fuels. 
Methanol can be used alone, or preferably as a bend 
with conventional gasoline, in a spark ignition internal 
combustion engine (but not in a diesel engine, because 
methanol’s cetane number is too low). A separate fuel 
distribution network for methanol also would be 
required as well as consideration of toxicity (and 
preventing people from drinking it as a replacement 
for ethanol). In contrast to methanol, DME can be used 
as a fuel or fuel additive for automotive transportation 
in a diesel engine. 

DME has an inherent combination of both oxygen¬ 
ate functionality, for antiknock characteristics, and 
saturated hydrocarbon functionality, for fuel energy 
content. DME also imparts improved cold-start 
characteristics as an additive to methanol fuel and has 
almost twice the fuel content of methanol on a volume 
basis. DME can be used as a utility fuel or can serve as 
a blending component or replacement fuel for diesel 
engines. DME is also an intermediate in production of 
other fuel, as in ExxonMobil’s methane-to-gasoline 
(MTG) process [19]. DME is a potential feedstock for 
novel oxygenated fuels [20] and is an alternative source 
of lower olefins such as propylene [21]. Using low CO 
content syngas, as derived from coal, a one-step DME 
process is more efficient and achieves higher syngas 
conversion than using two or three separate unit oper¬ 
ations. DME can be produced with methanol in 
a dedicated plant, or it can be used as the fuel’s pro¬ 
duction arm of a cogeneration plant to produce power 
and clean fuels (including even cooking fuel [22]) from 
coal [23]. 

ChemSystems Inc., the University of Ohio at Akron 
and the University of Missouri-Columbia, and Air 


c 




2226 


c 


Coal to Liquids Technologies 


Products and Chemicals Inc. (APCI) all performed 
primary early research toward one-step conversion of 
syngas to methanol in the liquid phase. This develop¬ 
ment and demonstration work in the USA was 
performed predominantly in the 1980s and 1990s. Air 
Products’ work with the Alternative Fuels Program of 
the US DOE additionally focused on dual catalysts for 
one-step DME production from coal-derived syngas 
and led to demonstration of processes for conversion 
of coal-derived syngas to fuels and chemicals, including 
demonstration at the DOE’s Alternative Fuels Devel¬ 
opment Unit (AFDU) at LaPorte, Texas [24]. 

In Japan, work on DME was intensified in the 1990s 
by NKK Corporation [25]. NKK first researched the 
one-step slurry DME synthesis at bench scale and then 
teamed with the Center for Coal Utilization, Taiheiyo 
Coal Mining, and Sumitomo Metal Industries to pilot 
a 5 t/day DME plant with funding from METI (the 
Ministry of Economy, Trade, and Industry). The long- 
range plan at that time was described as producing DME 
at the site of coal mining and then transporting it to user 
sites. In terms of fuel value per unit weight, this would be 
much more efficient than transporting coal itself. 

Substantial research and development efforts have 
been devoted to optimizing product yields and process 
efficiencies in conversion of syngas to fuels and 
chemicals. Mechanistic and kinetic studies have been 
conducted to interpret the fundamentals of specific 
conversion processes. Methods and hardware to impart 
temperature control and stability in conversion reac¬ 
tors are under continuous development because of the 
large excess heat of reaction. In the case of syngas 
conversions, this research has included discovery of 
catalytic systems having optimized formulations 
containing metals with high activity in combination 
with cocatalysts or additives that improve activity and 
selectivity for the desired type of product slate. Reactor 
design and engineering have proceeded in tandem with 
the catalyst developments in syngas conversions. 

Survey of Specific Coal Liquefaction Technologies 

A primary requirement of all coal liquefaction pro¬ 
cesses is that the fuel value inherent in the coal feed¬ 
stock must be increased to be more similar to that of 
conventional liquid fuels. The ratio of elemental hydro¬ 
gen to carbon, H/C, or even just simply the content of 


hydrogen itself, is important to determining the fit 
between feedstock and process options. Coal has 
a hydrogen content of about 5% whereas that of refined 
petroleum-based transportation fuel is in the range of 
12-15%. Fuel derived from petroleum has an average 
H/C ratio of about 2 whereas that of coal itself ranges 
from about 0.7-1. 

Table 1 shows the full range of process options that 
have been studied for converting coal to liquids [26]. 
These approaches include pyrolysis, direct liquefaction, 
coprocessing, and indirect liquefaction. 

Pyrolysis 

In pyrolysis processes, the H/C ratio of coal is raised by 
rejecting carbon. The liquid products have little value, 
but pyrolysis is used on a massive scale to turn coal into 
coke for metallurgy, especially steelmaking. The coking 
process involves heating coal in closed vessels to very 
high temperatures (up to 2,000°C) in the absence of 
oxygen. Without oxygen present to cause combustion, 
at these high temperatures the coal is broken down into 
lighter molecular weight substances that leave the vessel 
in the gaseous state. A hard, porous residue containing 
mostly carbon and inorganic ash is left behind. The 
amount of volatiles is typically 25-30 wt% of the coal. 
Pyrolysis is also used to make chemicals. 

In coking, the coal beds are deep and the temperatures 
are kept very high to promote resolidification of the 
decomposing coal. When making chemicals, the coal 
beds are thinner and temperatures are lower than for 
a more complete gasification near the melting or the 
decomposition temperature of coal to promote produc¬ 
tion of liquids and gases. In both of these processes, 
fuel gas containing carbon and hydrogen is formed in 
significant quantities and high quality as measured by 
the gas heating value (BTU/scf). Where possible, this gas 
has been used for process heat but any excess gas also 
could be sold. 

Chemical products can be obtained from high- 
temperature coking by-products, synthesis gas, and 
coal liquids. The by-products of the high-temperature 
coking process are fuel gas, crude tar, and light oils such 
as benzene, toluene, and xylene (BTX), naphtha, and 
ammonia. Most coal- derived chemicals today are 
obtained from the processing of tars. Coke manufac¬ 
turers burn crude coal tar in coke ovens, refine it 
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to tar-based products, or sell it to coal tar distillers. Tar 
derived products include ammonium sulfate, sulfuric 
acid, phenols, cresols, BTX, naphthalene, pyridine, 
phenanthrene, anthracene, creosote, road tars, roofing 
pitches, and pipeline enamels. Today, synthesis gas 
from coal gasification also can be used for the produc¬ 
tion of petrochemicals such as BTX, sulfur, methanol, 
and ammonia as well as hydrogen. 

The Liquids from Coal (LFC™) process uses mild 
coal gasification to upgrade low-rank coals to two fuels: 
a stable, low-sulfur, high-BTU solid fuel similar in 
composition and handling properties to bituminous 
coal (referred to as process-derived fuel, PDFTM) and 
a low-sulfur industrial fuel oil (referred to as coal- 
derived liquid, or CDL™) [27, 28]. The process was 
originally developed by SGI International and 
then taken further by the ENCOAL Corporation. 
Development was about 50% financially supported by 
$90 million from the US DOE. 

A PDU was constructed in 1986 and then a 1,000 t/day 
demonstration unit was constructed and operated for 
over 4 years near Gillette, WY in the mid-1990s. Later 
configurations included a vibrating fluidized bed (VFB) 
as part of a deactivation loop for solid product, where 
quenched solids are partially fluidized and exposed to 
oxidative deactivation to reduce the tendency for sponta¬ 
neous ignition. The residence time, oxygen content, and 
temperature of the gas stream are selected to deactivate 
the coal within the VFB unit. Hydration further stabilizes 
the solid PDF product, and after treatment in the VFB 
system the solids are cooled and a controlled amount 
of water is added to rehydrate the PDF to near its equi¬ 
librium moisture content. The CDL portion of the prod¬ 
uct is condensed. The technology appears to be moving 
forward, with the transformation of ENCOAL into the 
Saskatchewan-based company NuCoal, which has memos 
of understanding with several potential partners includ¬ 
ing Sinopec. 

Direct Liquefaction 

In DCL, the H/C ratio of coal is raised by adding hydro¬ 
gen. The objective of hydrogenation is to maximize the 
yield of distillate fractions that can be subsequently 
converted into fuels. Distillate materials comprise the 
naphtha fraction, typically boiling C 5 - 215°C, plus the 
middle distillate fraction typically boiling between 
150°C and 370°C, with the exact cuts depending on 


product specifications. The liquid products are then 
refined into diesel, gasoline, and other petroleum 
products. 

The traditional mechanistic view has been that upon 
heating, inherently weak bonds in coal are broken, and 
that the resulting and then hydrogen is rapidly trans¬ 
ferred from the solvent to cap the resultant free radicals. 
Rapid hydrogenation is indeed critical to liquid yield; 
preventing retrogressive reactions leading to formation 
of solid char. While this picture is largely true, in-depth 
research showed that it is insufficient. Concurrent with 
this pathway are solvent-mediated hydrogenolyses of 
strong C-C bonds - bonds too strong undergo simple 
thermolysis under reaction conditions - engendered 
by addition of a free hydrogen atom or a bimolecular 
H-transfer from solvent species and their radicals [29]. 
Inclusion of these pathways helps rationalize many 
otherwise inexplicable phenomena in coal liquefaction, 
and also helps identify conditions that maximize utili¬ 
zation of hydrogen for liquid production rather than 
produce the less desirable light gases. 

Hydrogen is also critical in the DCL process to 
reduce oxygen, sulfur, and nitrogen in the coal feed¬ 
stock. These elements are removed from the crude 
liquid product as water (H 2 0), hydrogen sulfide 
(H 2 S), and ammonia (NH 3 ), respectively. Oxygenates 
are reduced to hydrocarbons, because their oxygen 
content would otherwise lower the fuel value. The 
nitrogen and sulfur compounds are removed because 
they would otherwise poison the cracking catalysts in 
refining operations downstream of the DCL unit. DCL 
products are only partially refined at the DCL plant. 
They must be further refined into finished liquid fuel 
products at conventional refineries, where additional 
H 2 is added to bring the H/C ratio up to about 2 for the 
fuel products. 

Many factors affect the rate and extent of coal 
liquefaction, including temperature, hydrogen partial 
pressure, residence time, coal type and properties, sol¬ 
vent properties, solvent to coal ratio, ash composition, 
and catalyst. In general, liquefaction is promoted by 
increasing the temperature, hydrogen partial pressure, 
and residence time. However, if the temperature is too 
high, gas yield is increased and coking can occur. Sol¬ 
vent to coal ratio is important. If the ratio is too low, 
there will be insufficient hydrogen transfer activity. If 
the ratio is high, a larger reactor will be necessary to 
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provide the required residence time. Typical operating 
conditions are listed in Table 2. 

For the most highly developed processes, coal con¬ 
version can be as high as 90% on a mineral ash free 
basis, with a C 5+ distillate yield up to 75% and hydro¬ 
gen consumption of 5-7% wt. When an external cata¬ 
lyst is used, it is typically some combination of cobalt, 
nickel, and molybdenum on a solid acid support such 
as silica-alumina [30]. In slurry hydrogenation pro¬ 
cesses, catalyst lifetime is fairly short because of the 
large number of potential poisons present in the 
system. 

DCL process concepts have been based on a variety of 
operating conditions and catalysts and can be performed 
in a single stage or two stages. In a typical single-stage 
process, coal is contacted with hydrogen or with hydro¬ 
genated solvent at ca. 450° C. The yield of fuel oil is about 
3 barrels of oil/t of coal with bituminous coals. The light 
oil fraction is recycled. A high yield of light hydrocarbons 
is obtained and the efficiency of hydrogen utilization is 
low. The product is difficult to refine because of its high 
aromaticity and nitrogen content. Single stage liquefac¬ 
tion is not as effective for subbituminous coals as for 
higher rank coal. 

Most current DCL processes have settled on 
a multistage approach. Multistage DCL typically uses 
dispersed catalysts in the liquefaction stage followed by 
use of supported catalysts in subsequent upgrading 
stages, with different pressures and temperatures for 
the different stages. The technologies were demon¬ 
strated in several laboratory and pilot plants and the 
products were more easily refined than many crude 
oils. There are many variations of two-stage DCL 


Coal to Liquids Technologies. Table 2 Typical operating 
conditions for direct coal liquefaction 


Operating parameter 

Typical operating 
conditions 

Temperature (°F) 

750-850 

Pressure (psia) 

1,500-3,000 

Residence time (h) 

0.5 

Solvent to coal ratio 
(Ib/lb) 

1.5/2 


processes, including use of catalyst in the two stages 
(both stages being catalyzed, versus thermal and cata¬ 
lyzed stages), distillation between stages, whether solids 
are separated between the two stages or only after the 
second stage, recycle of ashy bottoms, and the operat¬ 
ing hydrogen balance. 

A block flow diagram (BFD) for a typical direct coal 
liquefaction plant using coal slurry hydrogenation is 
shown below in Fig. 1. Coal is ground and slurried with 
a process-derived solvent, mixed with a hydrogen-rich 
gas stream, preheated, and sent to a two-stage liquefac¬ 
tion reactor system where the organic fraction of the 
coal dissolves in the solvent and the dissolved frag¬ 
ments react with hydrogen to form liquid and gaseous 
products. Sulfur in the coal is converted to hydrogen 
sulfide, nitrogen to ammonia, and oxygen to water. The 
reactor products go to vapor/liquid separation. The gas 
is cleaned, and after removal of a purge stream to 
prevent buildup of inert materials, the gas is mixed 
with fresh hydrogen and recycled. The liquid is sent to 
vacuum fractionation for recovery of distillates. Heavy 
gasoil is recycled as process solvent and vacuum 
bottoms are gasified for hydrogen production. 

Ash from the gasifier is sent to disposal. Heavy 
direct liquefaction products contain polynuclear aro¬ 
matics that are carcinogenic. This problem can be 
solved, however, by recycling to extinction all material 
boiling above the desired product end point. A recent 
development, shown in Fig. 1, is to put the ash residue 
and carbon from recycled solvent directly through the 
gasifier to produce hydrogen (and power). 

The longest experience base with DCL in the USA 
was a PDU at the R8cD facility of Hydrocarbon Tech¬ 
nology Inc. (HTI) that consumed 3 metric tons of coal/ 
day. Among the many liquefaction processes developed 
and tested at pilot plant scale have been Kohleol (by 
Ruhrkohle, Germany); Exxon Donor Solvent (by 
Exxon, now ExxonMobil, USA); Solvent Refined 
Coal-I (SRC-I) and SRC-II (by Gulf Oil, USA.); 
H-Coal (by HRI, now HTI, and DOE, USA); Catalytic 
Two-Stage Liquefaction (CTSL, by HRI and DOE, 
USA); Liquid Solvent Extraction (LSE, by British Coal 
Corporation, UK); Brown Coal Liquefaction (BCL, by 
NEDO, Japan) and NEDOL (by NEDO, Japan). Only 
the CTSL, LSE, BCL, and NEDOL processes continued 
in development at pilot plant scale beyond the late 
1980s. Between 1975 and 2000, the USA alone invested 
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Coal to Liquids Technologies. Figure 1 

Block flow diagram of a typical direct coal liquefaction process [31] 


$3.6 billion in direct liquefaction technologies; some 
are now licensed to China [31]. 

H-Coal was an early single-stage DCL process 
piloted by HRI and Ashland Synthetic Fuels, Inc. in 
a 600 t/day pilot plant adjacent to Ashland’s refinery in 
Cattlesburg, Kentucky [32]. The process consists of 
slurry preparation followed by catalytic hydrogena¬ 
tion/hydrocracking at a single temperature and pres¬ 
sure, 450°C and 2,200 psia respectively, in an ebullating 
bed reactor. The ebullating bed is a backmixed three- 
phase isothermal reactor that keeps catalyst in constant 
motion. It uses the upward flow of liquid to expand the 
catalyst bed and to distribute liquid, gas, and catalyst 
evenly across the reactor. The reactor has the charac¬ 
teristics of a uniform temperature similar to continu¬ 
ous stirred tank reactor. 

The H-Coal process was a starting basis for subse¬ 
quent process development sponsored by the US 
DOE over about 3 decades time. A principal focus of 
this long-term project was R&D of catalysts that were 
tolerant of, and would not be rapidly deactivated by, 
coal-derived mineral matter in the reactor. Typical 


early catalysts showed rapid deactivation because of 
coking and metals laydown with loss of surface area. 
A later process version sought to address this deactiva¬ 
tion problem by using two coupled reactor stages, one 
operating at 400° C and the other operating at 420- 
425°C. The catalyst was also changed from cobalt/ 
molybdenum/alumina to nickel/molybdenum on 
bimodal alumina. This Catalytic Two-Stage Liquefac¬ 
tion (CTSL) process is shown in a block flow diagram 
in Fig. 2. The role of catalyst in the first stage of the 
CTSL process is to promote hydrogenation of the sol¬ 
vent, stabilization of the primary liquefaction products, 
and hydrogenation of the primary and recycle resid. In 
the second stage, the catalyst promotes heteroatom 
removal and thus product quality improvement, con¬ 
version of resid to distillate, and secondary conversion 
to lighter products. The catalyst also aids in avoiding 
dehydrogenation. 

In the first ebullating bed reactor, coal is dissolved 
with minimal catalysis at about 400° C for bituminous 
coal (or 435-460°C for subbituminous coal) and 
170 bar. The catalyst for this first reaction is 
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Coal to Liquids Technologies. Figure 2 

Process flow schematic of two-stage CTSL direct liquefaction process [33] 


typically nickel-molybdenum on alumina and aids in 
re-hydrogenating the recycled process solvent. The 
resultant heavy coal liquids are fed to a second 
ebullating bed reactor and hydrogenated at about 
440° C and 170 bar with a catalyst such as that used in 
the first reactor, or a high-activity iron catalyst. After 
separation and depressurization, the product of the 
second reactor is sent to a distillation column at atmo¬ 
spheric pressure, where distillate up to 400° C is 
removed, with yield from coal of up to 65%. The 
distillation residue is unconverted coal and asphalt, 
which are typically gasified for captive fuel value or 
feed hydrogen. 

The CTSL process was further developed over 
almost 15 years in a pilot plant at the Wilsonville 
liquefaction test facility. Beginning in 1973, a 6 t/day 
coal liquefaction plant was built by the Edison Electric 
Institute (EEI) and Southern Company Services, and 
the US DOE became the primary sponsor in 1976. 
Amoco Oil Co. (now BP) joined the project in 1984. 

Developments at the Wilsonville Facility in the USA 

Because of the role of Wilsonville in consistently dem¬ 
onstrating success or failure of a large number of con¬ 
cepts, particular attention has been paid to the results 


from this facility. The plant began operation in 1974 in 
a single-stage mode referred to as Solvent Refined Coal 
(SRC-II, which made liquid fuel as opposed to SRC-I, 
which made a solid fuel from coal) but evolved to 
a two-stage operation utilizing two ebullating bed cat¬ 
alytic reactors. Initial efforts in two-stage liquefaction 
focused on catalytic upgrading of the thermal products, 
referred to as Nonintegrated Two-Stage Liquefaction 
(NTSL). This configuration was termed nonintegrated 
because the coal-derived resid hydrocracking step did 
not interact with the primary thermal part of the plant. 
The NTSL concept proved excessively inefficient 
because of the high hydrogen consumption associated 
with the thermal part of the operation. 

Coal throughput, expressed as space velocity per 
unit reactor volume, was substantially improved in 
going from NTSL to Integrated Two-Stage Liquefaction 
(ITSL) mode. In ITSL, a short contact time thermal 
reactor is closely coupled to an ebullating bed catalytic 
reactor where hydrocracking occurs. The thermal resid 
produced in the ITSL at short contact times is more 
reactive toward expanded-bed hydrocracking, which 
permits operation of the hydrocracker at lower tem¬ 
perature and minimizes gas make. Minimizing forma¬ 
tion of light hydrocarbon gases in turn would be 
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expected to reduce the rate of catalyst coking, improve 
hydrogen utilization efficiency, and increase selectivity 
for higher distillate. A 35% increased yield from C 5+ 
distillation was observed. 

From 1985 to 1992, work at Wilsonville focused on 
development of the CTSL, utilizing ebullating bed cat¬ 
alytic reactors in both stages. The ebullating bed allows 
use of the substantial heat of reaction to preheat coal 
slurry feeds, and this improves process thermal effi¬ 
ciency. The converted and refined products are sepa¬ 
rated at high pressure and temperature into lighter oils 
and gases, and heavy oils and solids. Following pressure 
letdown and heat exchange, the lower boiling oils 
are fractionated while the heavier oils are separated 
from the unconverted coal and ash and recycled as 
the slurry solvent for the coal. Initial work indicated 
that distillate yields as high as 78% could be obtained 
by operating the first stage at low severity (399°C) and 
by using a large pore bimodal Ni-Mo catalyst. Signifi¬ 
cant improvement in distillate production can be 
achieved but again at the cost of increased hydrogen 
consumption. 

Also explored were Reconfigured Integrated Two- 
Stage Liquefaction (RITSL), where solvent de-ashing 
was practiced after the hydrocracking stem, and Close- 
Coupled Integrated Two-Stage Liquefaction (CC- 
ITSL), where two reactors (thermal/catalytic) were 
linked directly without any intervening processing 
steps. Incremental improvements in distillate yield 
and selectivity were realized by changing the process 
configuration, but at the expense of increased hydrogen 
consumption. 

The Wilsonville facility was closed in 1992 and 
was decommissioned shortly thereafter, but develop¬ 
ment work funded by the DOE continued at the 
Lawrenceville, N.J. facility [34-36]. The process has 
now evolved into a generic composite of much of the 
liquefaction development work funded by the DOE in 
the 1980s and 1990s. Most recently, a close-coupled 
configuration has been adopted in which both stages 
use an active supported catalyst, and HTI has included 
an in-line hydrotreater after the second-stage reactor to 
improve product quality. In 1997, with support from 
the DOE, HTI signed a 2-year agreement with the 
Chinese Coal Research Institute (CCRI) to carry out 
a feasibility study for a direct coal liquefaction plant 
using Shenhua coal [37]. 


The Kerr-McGee critical solvent de-ashing (CSD) 
technology (also termed the Residual Oil Solvent 
Extraction, or ROSE, process) was also tested at 
Wilsonville, resulting in the development of alternative 
methods for solids removal from primary liquefaction 
products. Other DCL process developments in the USA 
have included the single-stage Exxon Donor Solvent 
(EDS) process, and two-stage processes have included 
Lummus’s Integrated Two-Stage Liquefaction (ITSL), 
Kerr-McGee’s ITSL, Chevron Coal Liquefaction 
(CCLP), Consol Synthetic Fuel (CSF), and Amoco’s 
(now BP’s) Close-Coupled Two-Stage Liquefaction 
(CC-TSL) process. 

In 1996, Sandia National Labs published results of 
its study on the efficacy of hydrogenation for removing 
nitrogen and sulfur from the CTSL crude product 
[38]. The kerosene fraction from HTI’s first proof- 
of-concept run was used and denitrogenation and 
desulfurization were achieved under relatively mild 
conditions. Processing at the highest severity condition 
(388°C, 103 psig H 2 , 1.5 g/h/g-cat) decreased nitrogen 
from 645 ppm in the feed to 7.9 ppm in the 
hydrotreated product. Sulfur content of 239 ppm in 
the feed was decreased by hydrotreating to less than 
20 ppm. 

In the early the 1990s, HTI was engaged in DOE- 
sponsored research on coprocessing of coal and plastics 
waste using DCL. 

DCL Developments in Europe The Kohleol DCL 
process [39] is based on technology used at commercial 
scale in Germany, until after the Second World War, 
and then improved with the advent of the Integrated 
Gross Oil Refining (IGOR + ) process [40]. Coal is slur¬ 
ried with a process-derived recycle solvent and a “red 
mud” disposable iron catalyst. The pressurized and 
preheated slurry is sent to an up-flow tubular reactor 
operated at about 470°C and 300 bar H 2 . Products 
from the top of the reactor pass to a hot separator, the 
overheads from which are hydrotreated at 350-420°C 
in a fixed-bed reactor maintained at the same pressure 
as the main reactor. The hydrotreated products are 
depressurized and cooled in two stages. The liquid 
product from the first stage is recycled for use as solvent 
in the slurry step. The liquid product from the second 
stage is routed to an atmospheric distillation column 
to give light oil (C 5 - 200°C bp) and a medium oil 
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(200-325°C bp) product. The bottoms from the orig¬ 
inal hot separator pass to a vacuum distillation column 
to recover distillable liquids. These are added to the 
hydrotreating reactor feed and are subsequently largely 
recycled as solvent. The vacuum column bottoms con¬ 
sist of pitch, mineral matter, unreacted coal, and cata¬ 
lyst, which in commercial operation would be used as 
a gasifier feedstock for hydrogen production. Greater 
than 90% conversion can be obtained with bituminous 
coals, with liquid yields in the range 50-60% on a dry 
ash-free coal basis. The product distribution is about 
19% C!-C 4 hydrocarbons, 25% light oil (C 5 through 
200°C bp), 33% medium oil (200-325°C bp) and 22 % 
unreacted coal and pitch. 

The LSE process was developed more fully than 
other European DCL processes, by British Coal Corpo¬ 
ration between 1973 and 1995 [12]. In the LSE process, 
coal is slurried with a process-derived recycle solvent, 
preheated, and passed to a noncatalytic digestion step, 
which consists of two or more continuous-stirred 
tank reactors. These reactors operate in series at 
a temperature of 410-440° C and 10-20 bar. The sol¬ 
vent acts as a hydrogen donor, transferring up to 2 % by 
weight of hydrogen to the coal. The digester product is 
partially cooled, filtered to remove unreacted coal and 
ash, and dried. The filtered coal extract is distilled to 
recover the light oil wash solvent and is then preheated, 
mixed with H 2 , and sent to one or more ebullating bed 
reactors in series. There is no interstage separation and 
the reactors operate at 400-440°C and about 200 bar. 
The cooled, depressurized reactor products are distilled 
with cut-point adjusted to maintain solvent balance, 
resulting in a product that typically boils below 300° C. 
The column bottoms are routed partially to a vacuum 
distillation column, used to control the level of pitch in 
the recycle solvent. The overheads from this column are 
recombined with the main atmospheric column bot¬ 
toms stream and recycled as the solvent to the slurrying 
step. The product distribution is about 15% Q-C 4 
hydrocarbons, 50% light oil (C 5 through 300°C bp), 
50% solvent surplus (300-450°C bp), 24% filter cake 
organics, and less than 1% pitch. A 2.5 t/day pilot plant 
was built and operated for 4 years at Point of Ayr, North 
Wales but has since been decommissioned. 

DCL Developments in Japan The Sunshine project 
pursued in Japan examined three approaches to direct 


liquefaction of bituminous coal: solvolysis, solvent 
extraction, and direct hydrogenation of coal. From 
these studies, NEDO found that increasing catalyst 
activity correlated with increased liquid yield; that 
hydrogenation was most effective for liquefaction at 
pressures in the 150-250 atm range and temperatures 
in the 400-450°C range; and that increased solvent 
viscosity aided production of light oil as fuel. The 
approaches were consolidated over a time span of 
18 years, with demonstration of the resultant NEDOL 
process completed in 2000 . The process uses 170 atm 
H 2 in the liquefaction reactor at 450°C. A process flow 
diagram of the NEDOL process is shown in Fig. 3. [11]. 

Like H-Coal, the NEDOL process is single-stage, 
meaning that all liquefaction reactors are operated at 
a single temperature and pressure. The NEDOL process 
has been further developed, however, and is said to 
have low severity conditions and high distillate yield 
comparable to those of the CC-ITSL and IGOR + 
processes. 

NEDO also developed the BCL process, in the early 
1980s, to use brown coal (lower rank, wetter coal), to 
the scale of 50 t/day in a pilot-plant constructed at 
Morwell in Victoria, Australia [41]. It was operated 
during 1985-1990, processing a total of about 
60,000 t of coal. Operations ceased in October 1990. 
In the 1990s, NEDO worked with Indonesia on devel¬ 
oping technology for use with that country’s brown 
coal. Also in Japan, Mitsubishi Heavy Industries 
(MHI) researched a two-stage DCL process referred 
to as Mitsubishi Solvolysis. 

The development of DCL is depicted from 
a historical process economic perspective in Fig. 4, 
which shows the cost that the oil product would need 
to sell for to make specific DCL processes economically 
competitive. As the process technology advanced the 
liquids yield from bituminous coals increased from 2.7 
to 4.3 bbl/t. The increase in the yield of liquids are 
partly due to the fact that the later process designs 
used steam reforming of natural gas to provide hydro¬ 
gen instead of gasifying a portion of the coal. The costs 
decreased concomitantly, but today direct coal lique¬ 
faction still remains highly capital-intensive. 

Overcoming the techno-economic difficulties of 
DCL could enable a lower cost overall CTL process 
than ICL, given that DCL does not require the gasifi¬ 
cation step. Although largely dormant for over a decade 
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Coal to Liquids Technologies. Figure 3 

Flowchart for the NEDOL process for direct liquefaction of bituminous coal 
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Coal to Liquids Technologies. Figure 4 

Relative economics for representative DCL processes [36] 
(O&M is operations and maintenance) 


in the USA, Europe, and Japan; direct liquefaction is 
undergoing renewed research effort, most actively 
pursued in the People’s Republic of China (PRC). 

Recent DCL Developments in China Shenhua has 
recently constructed the world’s first commercial DCL 
facility, in the Inner Mongolia Autonomous Region. 
Shenhua’s direct liquefaction process combines ele¬ 
ments of German, Japanese, and, primarily American 
technologies with its own innovations. The single-train 
reactor system and process plant were designed by HTI 
and licensed from Headwaters Inc. (HTI is currently 
a subsidiary of Headwaters), and operations of its first 


production line commenced in 2008. Shenhua has 
planned expansion in two phases, for a total of ten 
production lines with projected oil equivalent products 
of 10 million t/year, including approximately 150,000 
barrels/day of middle distillate fuels (naphtha + diesel). 
The engineering design of the plant, called SH-I, calls 
for a daily input of 7,000 metric tons of Shenhua’s 
subbituminous coal. Also, Shenhua Group, Shanghai 
Huayuan Group, and Shanghai Electric Group have set 
up a coal liquefaction research center in Shanghai. The 
center will mainly explore and develop direct and 
indirect coal liquefaction technologies. 

In summary, advantages of DCL include: 

• High-octane gasoline is produced 

• More energy efficient than ICL 

• Products have a higher energy density than indirect 
conversion 

Disadvantages of DCL include: 

• Processing difficulties associated with solids 
separation 

• High aromatic content 

• Low cetane number diesel (45) 

• Potential water and air emissions issues 

• High capital intensity 

• May have higher operation expenses than ICL 
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• Lower feedstock flexibility than ICL 

• High requirements for external hydrogen supply 
(which has to be provided by, e.g., additional coal 
gasification or natural gas) 

Coprocessing 

Coprocessing combines waste materials having high 
hydrogen content (relative to coal) with coal as feed 
for conversion to liquid fuel products. Over 44 billion 
pounds of plastic waste materials, 280 million automo¬ 
tive tires, and about 1.2 billion barrels of waste oil are 
generated in the USA each year [42] . Coprocessing with 
coal would allow capture of energy content in these 
waste materials instead of simply disposing of them. 
It has frequently been stated that synergistic interac¬ 
tions between coal and heavy oil can result in distillate 
yields and coal conversion levels greater than expected 
from individually processing the coal and heavy oil. 
Several companies researched coprocessing, including 
Chevron s 6 t/day coprocessing plant run in 1983 at the 
company’s Richmond, California site. The research 
included synergistic incorporation of pretreated petro¬ 
leum vacuum resids with coal as feedstocks, and 
demetallation of resids was also performed [43]. The 
process was said to have good operability. 

The poorer solvent properties of heavy oils 
detracted from the conceptual simplicity and led 
to use of forcing conditions and increased hydrogen 
consumption that have adversely affected economics 
of coprocessing [44]. Although developments in 
coprocessing have been essentially on hiatus for over 
a decade, the topics of energy and reuse of materials 
remain prominent. Improvements in technologies for 
gasification, solvation, and C0 2 capture might lead to 
a resurgence of this CTL approach. 

Indirect Liquefaction 

Indirect coal liquefaction typically refers to processes 
that first generate synthesis gas by gasification of coal 


followed by production of solid, liquid, and gaseous 
hydrocarbons via catalytic reduction of CO in subse¬ 
quent stages of the process. Frequently, the term ICL 
refers to the synthesis of liquid transportation fuels by 
means of Fischer-Tropsch synthesis catalyst. Other ICL 
technologies include methanol and/or dimethylether 
from syngas, or methanol-to-gasoline (MTG). MTG 
was commercialized in 1986 and uses ZMS-5 zeolite 
catalyst to convert methanol to gasoline. 

A basic block flow diagram of key process steps in 
F-T ICL is shown below in Fig. 5. 

After mining and coal preparation, beneficiated 
coal (coal that has been subjected to physical grinding 
and chemical cleaning methods during coal prepara¬ 
tion) is first gasified with oxygen and steam to produce 
syngas. This syngas is cleaned of all impurities and sent 
to the F-T reactors where most is converted to liquid 
hydrocarbon fuels. The liquid products from F-T ICL 
contain zero sulfur and essentially zero aromatic 
hydrocarbons, and therefore ultra clean diesel or jet 
fuels can be produced after additional refining. Carbon 
dioxide is produced mainly during water gas shift (vide 
infra) and the F-T synthesis reactions, and can be cap¬ 
tured for subsequent storage or use. The unconverted 
synthesis gas can be either recycled back to the F-T 
reactors, to maximize liquid fuels production, or 
combusted in a gas turbine combined cycle power 
plant to generate internal and external power. Thus 
ICL plants can be configured to produce liquid fuels 
only, on a freestanding basis, or to produce a combi¬ 
nation of liquid fuels and power for export. 

The other major ICL approach produces methanol 
from the syngas, via a different set of chemistries from 
F-T, and uses the methanol as a chemical or as a fuel, 
typically as a blend with petroleum derived gasoline. 
The methanol serves as oxygenate and fuel extender. 
Dimethylether (DME) also can be produced from syn¬ 
gas, either directly or via dehydration of methanol. 

Synthesis of fuels and chemicals from syngas 
does not occur in the absence of appropriate catalysts. 
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Key process steps for indirect coal liquefaction [45] 
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The basic concept of a catalytic reaction is that reac¬ 
tants absorb onto the catalyst surface, rearrange atoms, 
and combine into products that desorb from the sur¬ 
face. One fundamental functional difference between 
different syngas conversion catalysts is whether or not 
the adsorbed CO molecule is cleaved during reaction at 
the catalyst surface. In Fischer-Tropsch synthesis, C-O 
bond cleavage is a necessary reaction step. In methanol 
synthesis, the CO bond remains intact. Hydrogen plays 
two roles in catalytic syngas conversions, as a reactant 
needed for hydrogenation of CO to give the hydrocar¬ 
bon moieties, and as a reductant to activate and regen¬ 
erate the metal surface. 

Much of the research and process development on 
coal liquefaction after the 1990s was aimed at matching 
the fuels synthesis conditions with modern, efficient 
coal gasifiers such as those developed by Uhde, Texaco 
(now GE), Shell, and ConocoPhillips. A block flow 
diagram of a typical F-T CTL process to produce naph¬ 
tha and diesel middle distillate cuts is shown below in 
Fig. 6, with the liquefaction end of the process indi¬ 
cated within the dash-lined boundary. Coal feed is 
prepared in the milling and drying section and then 
sent to the gasifier where it is fully converted to raw 
synthesis gas (or a coal slurry gasifier may also be 
employed instead). 

In the F-T synthesis step, the syngas is converted to 
raw product. The lighter ends of the raw F-T product 


are partially recycled internally and the remainder is 
sent to the heavy ends recovery unit. Oxygen for coal 
gasification is supplied by an air separation unit (ASU, 
not shown). A part of the raw synthesis gas is treated in 
the water gas CO-shift unit to convert steam and CO to 
hydrogen and C0 2 , after which this treated gas is 
remixed with the untreated portion. The split between 
the portions is controlled such that the required H 2 /CO 
ratio of the F-T synthesis is always met. The shifted 
syngas is then submitted to an acid gas removal (AGR) 
step, where COS, H 2 S, and C0 2 are removed to pro¬ 
duce clean syngas feed for the F-T synthesis unit. The 
Rectisol adsorption process is commercially proven in 
CTL applications and is typically deployed for AGR, 
applying methanol at low temperatures as a solvent. 
However, a more energy efficient system, such as poly¬ 
ethylene glycol (PEG)- or glycol diether-based pro¬ 
cesses (e.g., Selexol or Genosorb) may be used instead 
of Rectisol, although in these processes the COS has to 
be removed in a separate, COS hydrolysis step. A small 
fraction of the clean syngas is sent to a pressure 
swing adsorption (PSA) unit for production of pure 
hydrogen, as H 2 is required in the product workup 
section, and the major portion of the syngas is routed 
to the F-T plant. 

The tail gas from the heavy ends recovery unit is 
sent to a combined cycle plant to produce sufficient 
power for the entire CTL complex. The F-T waxes and 
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Fischer-Tropsch coal to liquids process [15] (Figure courtesy Uhde Corporation of America) 
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FT condensate products are sent to the product 
upgrade section, where they are hydroprocessed with 
hydrogen (from the PSA unit) to produce primarily 
diesel product. A significant amount of water is 
coproduced with the hydrocarbons during F-T synthe¬ 
sis hydrocarbons. This water fraction contains impuri¬ 
ties such as oxygenates, which have to be removed in 
a water treatment step. Sour gas (H 2 S, COS) removed 
in the AGR unit is sent to a sulfur recovery unit, a Claus 
plant, where it is converted into elementary sulfur. The 
tail gas from the Claus plant is hydrogenated and 
recycled to the acid gas removal unit, or it can be 
treated in a SCOT (Shell Claus Off-gas Treating) treat¬ 
ment unit. Using the SCOT process, the sulfur compo¬ 
nents in the Claus tailgas are catalytically converted to 
hydrogen sulfide, which is removed by solvent extrac¬ 
tion and recycled to the Claus plant, and the off gas 
from the absorber is incinerated. As an alternative to 
the recovery of elemental sulfur, sulfuric acid can be 
produced from the sour gas components. 

The yield of products in a fully self-sufficient CTL 
plant based on hard coal gasification is approximately 
1,875 barrels per ton of coal. The amount of sulfur 
produced depends on the concentration of sulfur in 
the coal. 

Gasification Gasification breaks down coal into mol¬ 
ecules of much lower molecular weight, CO and H 2 , 
usually using high temperature in the presence of 
oxygen at high pressure. Gasification technologies 
introduce air and/or oxygen, often with steam to mod¬ 
erate the temperature in the combustion zone of the 
gasifier. The product is syngas, a mixture of hydrogen 
and carbon monoxide. The syngas is subsequently 
converted to liquid fuels and chemicals in ICL pro¬ 
cesses. The three main types of gasifiers are the moving 
bed, fluidized bed, and entrained flow reactors. Gasifi¬ 
cation technology alternatives are grouped under the 
three technology types in Table 3. All three types work 
well and are in commercial use. 

Moving Bed Gasifiers The moving bed may use co¬ 
current or counter-current flows and is sometimes 
referred to as a fixed bed because the coal bed is kept 
at a fixed, constant height. The moving bed gasifier first 
provided a convenient means to handle solid residuals 
(ash) from the gasification. Moving bed gasifiers can 


Coal to Liquids Technologies. Table 3 Types of coal 
gasification technology 


Gasification 

type 

Technology 

Moving bed 

Lurgi - Dry Gas (Germany) 

FBDB - Sasol (South Africa) 

British Gas and Lurgi - BGL (UK) 

Fluidized bed 

High Temperature Winkler (HTW) - RWE, 
formerly Rheinbraun - marketing 
byUhde (Germany) 

Kvaerner (Finland) 

Foster-Wheeler (Switzerland) 

GTI/SES - U-gas (USA) 

Entrained 

SCGP - Shell (USA) 

Koppers-Totzek - Uhde (Germany) 

PRENFLO™ - Uhde (Germany) 

SFG - Siemens (Germany) 

GE (USA) 

ConocoPhillips (USA) 


accommodate coal with moisture content as high as 
35%, as long as the ash content is less than 10%. 
Depending on the temperature at the base of the bed 
of coal, the ash is either dry solid or a molten slag. If 
excess steam is added, the temperature can be kept 
below the ash fusion point. The coal bed typically 
rests on a rotating grate through which dry ash 
residue falls for removal. To reduce steam usage, 
British Gas and Lurgi developed a slagging bottom 
gasifier (BGL gasifier) in which the ash is allowed to 
melt and drain off through a slag tap. This gasifier 
has over twice the capacity per unit of cross-sectional 
area compared to the dry bottom gasifier. Allied 
Syngas, Advantica, and Envirotherm offer the BGL 
technology commercially. 

Fluidized Bed Gasifiers In the fluidized bed gasifier, 
the feed is fluidized in air, or sometimes oxygen, and 
steam with upward, turbulent flow for efficient mixing. 
The temperature is normally below ash melting 
temperature. The ash is removed as dry particles, 
which is advantageous if the ash is highly corrosive 
and would damage the reactor if removed as a molten 
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slag (a problem when gasifying high-sulfur coal or 
biomass). Throughput of the fluidized bed gasifier is 
higher than for the moving bed. The low operating 
temperature of the circulating fluidized bed reactor 
technologies of Kvaerner and Foster Wheeler are 
particularly appropriate for low-rank coal (or other 
reactive feed such as biomass). The higher pressure of 
the HTW gasifier technology is suitable for coal (and 
biomass) [46].GTI developed an agglomerating ash 
fluidized bed gasifier for its U-Gas process in which 
crushed limestone for sulfur capture is injected with the 
coal, and adding the limestone aids in processing ashes 
having very high fusion temperatures. Char and ash 
that exit the gasifier with the product gas are recycled 
to the hot agglomerating and jetting zone, where 
temperatures are high enough to pyrolyze freshly 
introduced coal, gasify the char, and soften the ash 
particles. The ash particles stick together and fall to 
the base of the gasifier where they are cooled and 
removed. The agglomerating fluid bed gasifier can be 
blown by either air or oxygen. The U-Gas technology is 
licensed by Synthesis Energy Systems (SES). 

Alkaline metal compounds, especially chlorides and 
carbonates, can be used as catalysts for coal gasification 
in a fluidized bed including such compounds as potas¬ 
sium carbonate, zinc chloride, and sodium carbonate. 

Entrained Flow Gasifiers Entrained flow technologies 
particularly target issues of solids handling in coal 
gasification. When bituminous coals are heated to 
300-350°C, the particles tend to swell and 
agglomerate in producing a consolidated cake. The 
handling of this caking char and the heavy tars that 
accompany it has been critical to the development of 
gasification processes. The agglomerate that forms can 
disrupt gas flow patterns and lower thermal efficiency. 

In the entrained flow reactor technology, coal is fed 
as a dry, pulverized solid or as a fine slurry with liquid 
and is most commonly gasified with oxygen (rather 
than air) in co-current flow. It is operated above the 
ash-melting point, which means that the molten ash 
forms a liquid slag inside the gasifier that runs down 
the walls of the gasifier and is removed through 
a bottom hole. The use of oxygen at high pressure 
and temperature is a source of both high throughput 
and versatility. Typically a dry-feed gasifier such as 
SCGP or PRENFLO can process a wide range of coals, 


from low-grade soft brown coal to top-grade hard 
anthracite. These dry-fed entrained-flow gasifiers have 
single-train capacities of about 3,500-4,000 t/day. Most 
of the ash is removed as slag. 

Uhde’s PRENFLO entrained gasifier technology has 
operated successfully for the last 12 years in the world’s 
largest dry-fed IGCC plant, located in the Elcogas plant 
in Puertollano, Spain. PRENFLO employs dry-feeding, 
multiple side-fired burners, and a cooled membrane 
wall for robustness under the slagging conditions inside 
the gasifier. The raw gas leaving the PRENFLO™ gas¬ 
ifier is first cooled down to approximately 900°C in 
a gas quench system with a recycle gas stream. The 
sensible heat of the raw gas is used to produce saturated 
high pressure steam, and saturated medium pressure 
steam, both of which are ultimately used in the steam 
turbine of the power block to produce power. The raw 
gas is next sent to gas conditioning operations; 
consisting of filtration to remove particulates, scrub¬ 
bing with water to remove chlorine and other soluble 
components, removal of COS and HCN in a hydro¬ 
lysis reactor, and removal of H 2 S using a methyl- 
diethanolamine (MDEA) solution. The H 2 S fraction 
from the desulfurization unit is treated in the Claus 
unit with tail-gas recycle, where liquid elemental sulfur 
is produced for sale on the local market. Electrical 
power is generated by a single combustion turbine 
and a single steam turbine. 

A slurry feed may be used with low rank coals 
as these can exhibit serious particle decrepitation, 
dusting, and even spontaneous combustion when 
exposed to drying conditions upon storage, crushing, 
and handling. A typical coal water slurry contains about 
70% mass of coal, pulverized to minus 200 mesh or 
finer, and water with about 1% mass of an additive to 
alter the rheological properties of the liquid mixture 
and thus allow a larger coal loading [47]. Slurry-fed 
entrained flow gasifiers include the GE gasifier and the 
E-Gas gasifier (now owned by ConocoPhillips). The GE 
gasifier coal slurry is fed through single-burner at the 
top of the gasifier, which is a refractory lined pressure 
vessel. Refractory stability under slagging conditions 
depends largely on the ash content and ash properties. 
Furthermore the burner lifetime of slurry-fed burners 
is considerably lower than that of dry-fed burners. 
Reactor exit gas is cooled either by direct water injec¬ 
tion or by a radiative cooler located directly below the 
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reactor. The E-Gas gasifier differs from other systems in 
that it uses a two-stage reactor, also involving refractory 
lined pressure vessels. The bulk of the feed slurry and all 
the oxygen are sent to the first (horizontal) stage, where 
the coal is gasified. Hot gas flows into the second 
(vertical) stage, where the remainder of the coal slurry 
is injected. Hot fuel gas is cooled in a fire tube boiler gas 
cooler and then sent to the F-T unit. The process pro¬ 
duces no ash and recycles by-products for uses such as 
road construction materials. 

Today, both solid (dry) and slurry-fed gasifiers are 
likely to be entrained flow gasification systems. The 
newer entrained flow gasifiers are considerably more 
efficient, albeit at the price of making product gas 
having a higher content of syngas, total H 2 and CO, 
but a lower H 2 /CO ratio [48]. A critical aspect of long¬ 
term sustainability of any gasification technology is 
that it be versatile enough to accommodate various 
ranks of coal as well as other feeds such as biomass. 


The first commercial scale and currently longest 
running coal gasification plant in the USA was put 
into service in 1983 by Eastman at its Kingsport, 
Tennessee facility. A schematic of the gasification 
plant is shown in Fig. 7. The process uses two high- 
pressure (1,000 psig) GE gasifiers, one being a spare, to 
convert about 1,250 t/day of bituminous coal into 
acetyl chemicals via methanol and CO. Over 99.9% of 
the sulfur (as H 2 S) is removed from the syngas via 
a Rectisol unit designed by Linde AG and is recovered 
as elemental S by a combination of Claus sulfur recov¬ 
ery units (SRUs) and Shell Claus off-gas treating 
(SCOT) processes. C0 2 is also removed by the Rectisol 
process and is currently vented, but could easily be 
captured for sequestration. A cold box, also designed 
by Linde AG, is used for cryogenic separation of CO 
from a portion of the syngas, and some of the syngas 
itself is used in downstream methanol plants based on 
technology from Lurgi and Air Products. Mercury 
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Coal to Liquids Technologies. Figure 7 

Flow diagram of Eastman's coal gasification plant (Figure courtesy of Eastman Chemical Company) 
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contaminant originally present in the coal is removed 
using a packed bed of sulfur-impregnated activated 
carbon. Originally built for production of chemicals 
from coal, the Eastman gasification process and asso¬ 
ciated learning curve with a variety of coals are repre¬ 
sentative of what would be used for future production 
of power and fuels from coal in processes such as 
integrated combined cycle gasification (IGCC) and 
industrial gasification. 

Disadvantages of entrained-flow gasification include 
a rather high oxygen requirement and a high waste heat 
recovery duty. However, the ability to feed coal that is 
dry reduces the oxygen requirement and increases effi¬ 
ciency compared to a single-stage entrained gasifier 
using slurry feed. 

Fischer-Tropsch The Fischer-Tropsch (F-T) reac¬ 
tion is a polymerization or oligomerization of carbon 
monoxide performed catalytically under reducing 
conditions. Chemically F-T is single reaction, with 
approximate stoichiometry for linear hydrocarbon 
production as shown in Eq. 1. The greater hydrogen 
content of methyl groups at the chain termini increase 
the stoichiometric ratio of H 2 :CO slightly beyond 2:1, 
with the actual F-T stoichiometry for paraffinic hydro¬ 
carbon production being about 2.1:1. 

nCO + (2n+ 1)H 2 <-► CH 3 -(C n _ 2 H 2n _ 4 )-CH 3 
T nH 2 0 

AH 298 « —38 kcal/molCO 

(i) 

Mechanistically, the F-T reaction consists of four 
steps: (1) formation of metal-bound C 3 species 
from feed CO (or possibly C0 2 if present); (2) chain 
initiation wherein metal-bound C 2 species form from 
intramolecular or intermolecular coupling of the 
metal-bound C 3 species; (3) chain propagation 
wherein the initial metal-bound C 2 species grows as it 
couples with newly formed metal-bound C 3 species; 
and (4) chain termination wherein metal-bound C n 
species undergo hydrogen transfer reactions that dis¬ 
able further propagation [49]. Mechanistic specifics 
remain in debate but it is well accepted that hydrogen 
transfer reactions play key mechanistic roles throughout 
the reaction, including metal-catalyzed hydrogenation 
of CO, (3-hydride elimination from the growing chain to 


give terminal olefins, and intramolecular reductive 
elimination of hydrocarbon (R) and hydrogen (H) frag¬ 
ments from the catalytic center to give products during 
chain termination. Hydrogenation is not always com¬ 
plete and oxygenation products such as lower alcohols 
and carbonyl compounds also form. Olefins form 
throughout the process via (3-hydride elimination, and 
these either re-hydrogenate during propagation or exit 
from the catalytic cycle during termination. 

Performance Variables and Parameters, and Product 
Distribution The reaction temperature and catalytic 
metal strongly influence the chemistry and product 
distribution. The reaction is run in either a high 
temperature (HTFT) or low temperature (FTFT) 
regime. The HTFT reaction is always performed with 
iron as catalyst, at ~340°C, and the FTFT reaction is 
run at ~220°C when using a Co-based catalyst. The 
Fe-catalyzed FTFT reaction can be run a little hotter, 
at 230-270°C. The HTFT reaction is traditionally used 
to make light syncrude for production of gasoline 
and diesel. The FTFT reaction produces primarily 
high molecular weight hydrocarbons, referred to 
collectively as F-T wax, that are typically hydrocracked 
to make diesel [50]. 

Much novel work on stable cobalt catalysts for 
slurry bed gas to liquids (GTE) applications has been 
patented in the past 15 years. Although some compa¬ 
nies, notably Syntroleum, apparently plan to use cobalt 
in CTF F-T operations, reports dedicated specifically to 
the use of Co in F-T CTF are essentially unavailable 
[51]. This is mainly because unlike iron, cobalt does 
not undergo the water gas shift reaction (WGSR) at the 
conditions needed to produce the supplemental hydro¬ 
gen needed to make saturated hydrocarbons from coal. 
It is also the case that, while essentially all examples of 
Co patents use GTF conditions, many of these Co 
patents state that various feed sources including syngas 
from coal may be used instead of natural gas [52]. 
Major research domains of emphasis in F-T CTF catal¬ 
ysis include understanding and fixing problems of cat¬ 
alyst instability and attrition at slurry FTFT conditions 
[53, 54]. 

The parameters used to assess and report perfor¬ 
mance of F-T catalysts are those typically used for 
catalytic reactions: conversion, selectivity, product dis¬ 
tribution, yield, reactor productivity, and catalyst fate 
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or lifetime. The selectivity of a catalyst with a given feed 
composition is the main determinant of measured 
product distribution (although product distribution 
is also influenced by process design). The yield of 
a given cut is the product of overall conversion and 
selectivity for that cut. Yield is usually expressed as 
a function of time and catalyst quantity in these con¬ 
tinuous processes. Catalyst fate relates to the mecha¬ 
nism and time frame of catalyst deactivation and 
therefore determines its useful lifetime. 

The conversion for a F-T process is generally 
reported as %CO converted to any product, although 
conversion of total feed (CO + H 2 ) is sometimes used. 
Coproduction of the undesired by-product methane 
typically rises directly with conversion. F-T synthesis 
with Co is typically run at about 60% conversion, and 
conversions in LTFT are typically in the range of 
50-70%. 

The reactor productivity is the quantity of product 
produced per quantity of catalyst per unit time, 
expressed by weights of product and catalyst (e.g., 
kgprod/kgcat/h) or by volumes (L prod /L cat /h). Productiv¬ 
ity may be reported for bulk product or for specific 
cuts, or even as the quantity of CO converted per 
quantity of catalyst per unit time. Productivity of the 
LTFT reaction performed in a slurry bed reactor is 
higher than for a fixed bed, primarily as a result of 
improved heat management in the slurry system. The 
continuous motion of the liquid-catalyst matrix allows 


sufficient heat transfer to achieve commercial slurry 
reactor productivities of about 200 volumes of CO 
converted/vol catalyst/h. 

The product distribution reflects the selectivity of 
the catalytic system at the reaction conditions, typically 
reported as a percentage of the total product or as 
a percentage of the CO converted (carbon basis). Selec¬ 
tivity data also may be supplied in absolute quantities, 
as reactor productivities for specific cuts. The weight 
fractions of various product cuts obtained within the 
F-T kinetic regime are shown in Fig. 8. The product 
slate is influenced by many reaction variables; particu¬ 
larly the feed composition, the reaction temperature, 
the catalytic metal and reaction promoters, and the 
support material if present. 

F-T product slates include gaseous and liquid frac¬ 
tions. The gaseous reaction product includes hydrocar¬ 
bons boiling below about 650°F (tail gases through 
middle distillates). The liquid reaction product (the 
condensate fraction) includes hydrocarbons boiling 
above about 650° F (vacuum gas oil through heavy 
paraffins). Commercially, the product boiling below 
650° F is typically separated into a tail gas fraction 
and a condensate fraction of about C 5 -C 2 o normal 
paraffins plus higher boiling, branched hydrocarbons. 
Branching may be advantageous to a number of end- 
uses, particularly when increased octane values and/or 
decreased pour points are desired for lube oil additives. 
The degree of isomerization is preferably greater than 1, 
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Product distribution plots for the Fischer-Tropsch reaction [55]. (a) General distribution, and (b) additional breakdown of 
the C 12 + cuts 
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and more preferably, greater than 3 mol of isoparaffin 
per mole of 77 -paraffin. If used in a diesel fuel compo¬ 
sition, the products preferably have a cetane number of 
at least 60. 

The F-T reaction produces a wide range of saturated 
and olefinic hydrocarbons. Figure 9 shows the percent¬ 
age proportions of olefins, alkanes, oxygenates, and 
aromatics in upgrades syncrude obtained from the 
F-T reaction at varied conditions of temperature 
regime and catalytic metal. The HTFT reaction gives 
aromatics, while the LTFT reaction does not give aro¬ 
matics with any catalytic metal. Cobalt is a better 
hydrogenation catalyst than iron, and therefore prod¬ 
ucts of the Co-catalyzed F-T reaction tend to have 
a lower olefin/paraffin ratio than those of iron. (Not 
shown in Fig. 9, the high temperature reaction also 
gives more branched hydrocarbons, and with a higher 
degree of branching, than the low temperature reaction 
using either Fe or Co.) The LTFT and HTFT both give 
oxygenates, typically 10-15% for the HTFT reaction 
and about half of that for the LTFT reaction with Fe 
(and much less with Co). Most oxygenates produced in 
the LTFT reaction are alcohols. The HTFT reaction 
gives alcohols and ketones, plus a lesser amount of 
carboxylic acids and aldehydes. The LTFT reaction 


gives more methanol than the HTFT reaction, and the 
two temperature regimes give comparable amounts 
of ethanol. 

Product slates for F-T catalysis are controlled by 
polymerization kinetics, with fairly constant chain 
growth probabilities that fix the ranges of possible 
product distributions [56]. The moles of species with 
carbon number n decrease exponentially with n. The 
weight fraction of product having carbon number n, 
W n , is given by the Anderson-Schultz-Flory (ASF) 
distribution (Eq. 2): 

W„ = (1 — a) 2 «a" -1 (2) 

The parameter “a” is the Schultz-Flory distribution 
factor and represents the ratio of the rate of chain 
propagation to the rate of chain propagation plus the 
rate of chain termination. The parameter “a” is the 
chain growth probability, and 1—a is the probability 
of chain termination. In using this equation, a and a are 
considered as independent of chain length. Either value 
can be obtained graphically using this equation in log 
form, with analytical data for W M as obtained, for 
example, by gas chromatography. In practice, the 
parameter a is used most often to characterize product 
distribution [45]. 
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F-T product slates dependence on temperature and catalytic metal (Figure courtesy of SRI Consulting) [57] 
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Values of a > 0.9 are representative of processes 
producing high amounts of wax, a values of about 0.8 
maximize diesel cuts, and a values of about 0.7 are 
characteristic of naphtha production. For the interme¬ 
diate molecular weight cuts of relevance to fuels pro¬ 
duction, diesel and naphtha, the ASF distribution 
reveals the limitations on selectivity shown in Fig. 8. 
Because selectivity for diesel cuts does not exceed about 
22% but wax selectivity can reach 90 + %, much indus¬ 
trial research has focused on wax production with 
downstream hydroprocessing of wax to diesel (or to 
lube oil base stocks). The a values for oxygenated 
products appear to be about the same as those for 
hydrocarbons, which suggests that the products all 
form by the same or similar chain propagation steps 
and with similar rates of chain termination, albeit with 
different chain termination chemistries for the oxygen¬ 
ates, olefins, and paraffins. 

In practice, there are several deviations from the 
ASF product distribution inherent to the F-T process. 
The amount of methane produced is often higher 
than would be predicted from this equation, because 
methane is formed by a chain termination mechanism 
that is separate from those forming the higher molec¬ 
ular weight products (methane is formed by addition 
of 2H atoms to a metal-bound Q). Catalysts that 
preferentially catalyze hydrogenation over chain ini¬ 
tiation, particularly nickel, increase the probability of 
termination to give methane. A second deviation from 
the ASF distribution is that the amount of C 2 species 
formed is often lower than expected based on the ASF 
equation, although the reason for this is not clear. It 
has been suggested that an adsorbed ethylene can 
react with a surface methylene from either side to 
propagate the chain; whereas higher alkyl adsorbed 
species would show a lesser rate of chain propagation 
due to steric constraints. 

Some approaches to catalyst design have succeeded 
in controlling the F-T product distribution by altering 
the value of alpha. One such approach is the inclusion 
of alkali metal elements, particularly potassium. Oxides 
of the alkali metals increase a and promote formation 
of wax in LTFT catalysis by iron. Use of K 2 0 helps keep 
the iron reduced, in the carbide form, which promotes 
growth of longer chains. 

With iron catalysts, alpha can exhibit two regimes 
of relatively constant value, sometimes referred to as 


the “two alpha” phenomenon. The value of a can 
increase with chain length rather than remain constant, 
and typically changes from about 0.60-0.65 to about 
0.90-0.93 at roughly C 20 [58]. A double a is most 
evident for low temperature catalyst systems with 
high wax productivity and is not evident for high 
temperature systems that make diesel or functionalized 
hydrocarbons. 

HTFT The HTFT reaction has been run in a fixed bed 
reactor (FBR) or fluidized bed reactor. Traditionally, 
Sasol used a circulating fluidized bed reactor (CFBR), 
in its Synthol process. The Synthol CFBR was replaced 
in 1989 with the Sasol Advanced Synthol (SAS) reactor, 
based on a conventional fluidized bed and providing 
superior RAMO (Reliability, cost-effective Availability, 
Maintainability, and efficient Operability). A block 
flow diagram of the F-T and downstream portion of 
Sasol’s Secunda plant using the SAS technology is 
shown in Fig. 10. 

The process is employed to produce a range of fuels 
and also olefins. The fluidized, iron-based catalyst gives 
primarily Ci~C 2 o raw product. The C 2 -rich stream is 
split into ethylene and ethane. Ethane is cracked in 
a high-temperature furnace to ethylene, which is then 
purified. Propylene from the light hydrocarbon gases is 
purified and used in the production of polypropylene. 
Large quantities of olefins in the C 5 -C n range are 
produced at these HTFT conditions and the alpha- 
olefins (straight chain olefins with an ene group in 
the terminal position) are recovered for use as 
chemicals (C 5 , C 6 , and C 8 ) are recovered or sent into 
the fuel pool (C 7 -C n ). Oxygenates in the aqueous 
stream are separated and purified in the chemical 
work-up plant to produce alcohols, acetic acid, and 
ketones including acetone, methyl ethyl ketone 
(MEK), and methyl isobutyl ketone (MIBK). 

LTFT The early German LTFT plants used fixed bed 
reactor (FBR) technology with Fe catalysts, through 
World War II. In 1955, Sasol began using a tubular 
fixed bed reactor (TFBR), consisting of a shell 
containing 2,050 tubes of 12 m length and 5 cm 
diameter packed with extruded precipitated iron- 
based catalyst. A typical catalyst system would be 
potassium-promoted precipitated iron (such as 
a Sasol-type catalyst consisting of K = 1.9 wt%, Cu = 
2.4 wt%, Si = 5.9 wt%, Fe = 47.3 wt%, 0 2 = ~42.5 wt.) 
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Secunda plant block flow diagram [59] 


This LTFT is referred to as the Arge process and is 
operated with a shell side temperature of about 220°C 
and reactor pressure of 25 bar (early Arge) or 45 bar 
(Arge commissioned in 1987), wherein the heat of 
reaction is dissipated by generation of steam on the 
shell side. Further scalability could be achieved by more 
than doubling the number of tubes, to, for example, 
5,000, but the reactor size eventually becomes 
unwieldy. Sasol found that a slurry phase system is 
more readily scaled, in part due to engineering design 
and in part because the Co-based catalysts used are 
more efficient than the Fe-based catalysts used for the 
LTFT reaction. Sasol commissioned its first slurry bed 
reactor (SBR) in 1993. 

Table 4 shows typical product slate selectivities for 
the LTFT, TFBR; the LTFT, SBR; and the HTFT FBR. 
Both fixed- and slurry-bed LTFT processes give 
much more paraffin, less olefin, no aromatics, and 
less oxygenates compared to the HTFT process. LTFT 
is particularly suited for a modern coal to diesel 
(CRD) plant. 

As indicated in Eq. 1, the F-T reaction is highly 
exothermic. The reaction heat must be removed rap¬ 
idly, to avoid temperature increases that would result in 


undesired formation of high levels of methane, light 
hydrocarbons, and, in the extreme, catalyst deactiva¬ 
tion due to coking, sintering, and disintegration. Depo¬ 
sition of carbon by the Boudouard disproportionation 
reaction, Eq. 3, is to be avoided, and the heat distribu¬ 
tion imparted by the slurry bed is advantageous. 

2CO > C T CO 2 

(3) 

AH 2 98 = —20.6 kcal/molCO 

Attrition and breakage of catalyst particles is a more 
difficult issue to overcome than coking, particularly for 
slurry bed catalyst systems [61]. Attrition refers to 
abrasive or erosive removal of surface material that is 
less than about 50 pm in particle diameter. Breakage 
refers to a decrease in median particle size caused by 
fracture of catalyst particles. As the concentration 
of smaller particles builds up, catalytic activity eventu¬ 
ally decreases and also the downstream separation of 
wax from solid catalyst becomes inefficient. Catalyst 
particles are subjected to hydrodynamically induced 
mechanical stress at slurry bubble column conditions 
that has severely limited the lifetime of LTFT iron 
catalysts. 































Coal to Liquids Technologies 


C 


2245 


Coal to Liquids Technologies. Table 4 Carbon selectivity of selected LTFT and HTFT processes (%) [60] 


Product 

TFBR 

(LTFT) 

SBR 

(LTFT) 


CH 4 (methane) 

4 

7 

C 2 -C 4 olefins 

4 

24 

C 2 -C 4 paraffins 

4 

6 

Gasoline 

18 

36 

Middle distillate 

19 

12 

Heavy oil, waxes 

48 

9 

Polar oxygenates 

3 

6 

TFBR 

(LTFT) 

SBR 

(LTFT) 

SASRFBR 
(HTFT) 
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Cs-C,2 

00 

u 

1 

PO 

u 

n-Paraffins 

95 

93 

96 

95 

55 

60 

Paraffins 

53 

65 

29 

44 

13 

15 

Olefins 

40 

28 

64 

50 

70 

60 

Aromatics 

0 

0 

0 

0 

5 

15 

Oxygenates 

7 

7 

7 

6 

12 

10 


Precipitated, unsupported iron catalysts are partic¬ 
ularly prone to disintegration into smaller particles. 
Supporting the iron imparts strength to the particle 
to decrease fracture, but at the expense of decreased 
volumetric productivity due to dilution of the active 
catalyst by the support material. Making spherical 
particles by spray drying [62] can help with fracture 
or with attrition, but not always with both [63, 64]. 

The pursuit of mechanically robust iron F-T cata¬ 
lysts remains an area of active research. Traditionally, Fe 
catalysts have been disposed of rather than regenerated, 
because iron is inexpensive. However, a few reports of 
potential use and even regeneration [65] of Fe slurry 
catalysts have recently appeared, notably from Rentech 
[66], presumably because management of even low 
hazard waste volumes is of increasing interest. It is 
also the case that using a fluidized slurry bed catalyst 
system allows longer runs, since deactivated catalyst 
can be removed and fresh catalyst added without taking 
the reactor off-line as is required for fixed bed reactors. 
On the other hand, an advantage of fixed bed reactors is 
that since the wax produced simply trickles down the 
bed, there is no problem separating the wax from the 


catalyst. In the slurry system, special internal and/or 
external devices are required to continuously remove 
wax from the finely divided catalyst in the slurry. 

Water-Gas Shift Reaction The water-gas shift reaction 
(WGSR), Eq. 4, is important for CTL because the 
H 2 /CO ratio produced by gasification of coal is lower 
than that produced by steam reforming of methane in 
GTL (gas to liquids), by about a factor of 3. Without the 
WGSR, other sources of hydrogen must be provided in 
CTL, if hydrocarbon fuels are the intended product 
slate as per Eq. 1, and additional sources of hydrogen 
in turn typically require additional use of power that 
itself generates C0 2 . 

co + h 2 o ^ co 2 + h 2 

(4) 

AH 298 = -9.8 kcal/molCO v ' 

The in situ WGSR occurs concurrently during the 
iron-catalyzed F-T reaction. Cobalt does not catalyze 
the WGSR under F-T conditions and therefore iron 
catalysts produce more C0 2 than do cobalt catalysts. 
In practice the WGSR is one of several factors that affect 
the overall efficiency of an F-T CTL process when using 
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Fe versus Co as catalyst, another being the greater 
efficiency of the smaller reactor used for Co compared 
to Fe, and each must be taken into account by process 
engineers when designing CTL plants. 

Another key aspect of the WGSR is that it can run 
in reverse. A potential use for some of the by-product 
heat typically available in hot outlet gases from high 
temperature gasifiers is to generate steam heat for an 
external water gas shift reaction, to be conducted 
downstream of the gasifier, wherein some H 2 (from 
recycle or tailgas) consumes the undesirable by¬ 
product co 2 . 

Downstream Product Upgrading Upgrading tech¬ 
nologies for F-T syncrude are modifications of those 
developed for crude oil refining. Refinery process 
options for producing fuels from F-T middle distillates 
include wax hydrocracking, distillate hydrotreating, 
catalytic reforming, naphtha hydrotreating, alkylation, 
and isomerization. The many technology licensors for 
hydroconversion processes include ChevronTexaco, 
UOP, IFP (Axens), Sud-Chemie Sasol Catalysts, and 
Haldor-Topsoe. Catalysts can be obtained from these 
companies as well as from AkzoNobel, Sud-Chemie, 
and Axens among others. 

Reforming and alkylation of gaseous products to 
produce liquid fuel-range products are appropriate 
when upgrading synthetic crude from the Fe-catalyzed 
HTFT reaction. Skeletal modification technologies 
most important to upgrading synthetic crude from 
HTFT include isomerization, mild hydrocracking, 
and even fluidized catalytic cracking. Catalytic oligo¬ 
merization of olefins is especially applicable to the 
product from an iron-catalyzed, HTFT process because 
the bulk of the crude product consists of these olefins, 
particularly C 3 -C 5 olefins (gases). The Sasol Synfuels 
CTL refinery in Secunda uses an alkene oligomeriza¬ 
tion process based on solid phosphoric acid (SPA) 
catalysis to convert most of the light alkene HTFT 
product into liquid fuels [67]. 

Currently, the production of a high quality diesel 
fuel is preferred to the production of gasoline in F-T 
fuels production. This is because the same factors that 
count against F-T gasoline, such as product linearity 
and low aromatic content, serve as very positive factors 
in favoring high quality, high cetane number in F-T 
diesel fuel. Straight run F-T diesel makes up about 20% 


of the total F-T product and has a cetane number of 
about 75 because it is predominantly linear. For max¬ 
imum production of high quality diesel, the slurry 
bed reaction operating in the high wax selectivity 
mode with cobalt-based catalyst, or the high diesel 
selectivity mode with iron-based catalyst, is the most 
efficient approach. Lighter and heavier fractions are 
usually less desirable due to limited markets and/or 
lower prices. 

The Co-catalyzed process is typically chosen for 
maximum wax production in LTFT gas to liquid 
(GTL) processes; however, the use of iron-based cata¬ 
lysts is generally preferred for CTL because coal- 
derived syngas is deficient in H 2 and cobalt does not 
catalyze the WGSR that supplements H 2 production 
during the iron-catalyzed F-T reaction. When operat¬ 
ing in the LTFT mode, iron catalyst can produce up to 
50-75% linear wax products. LTFT-derived feeds can 
be hydrocracked under much milder conditions than 
typical crude oil derived feeds, such as vacuum gas 
oils, which require pressures as high as 150 bar 
to prevent coking of the catalyst by aromatic com¬ 
pounds. This is not an issue with paraffinic feeds, and 
pressures between 35 and 70 bar are used to hydrocrack 
LTFT products using commercial hydrocracking 
catalysts. The relatively low levels of oxygenates present 
in F-T waxes, mainly alcohols and lesser amounts 
of acids or carbonyls, are easily and completely 
hydrodeoxygenated. 

After hydrotreatment, the straight run diesel fuel 
cut has a cetane number ranging from 70 to 75. If both 
hydrotreating and hydrocracking are used, they may be 
operated at the same pressure using a common hydro¬ 
gen loop. Alternatively, the hydrotreater can be oper¬ 
ated in a once-through mode, at lower pressure, and 
the hydrogen tail gas then subsequently pressurized 
and fed to the hydrocracker loop. For a 30,000 bpd 
equivalent product upgrading system involving both 
hydrocracking and hydrotreating, the size of the unit is 
comparable to those used in oil refineries. There are 
many process configurations possible depending on 
site-specific factors including plant capacity, available 
land, and possible integration with other facilities. 

When using the slurry phase LTFT reactor, water is 
removed from overhead gases in a three-phase separa¬ 
tor and then the remaining condensate is sent to a 
low temperature distillation column, in the heavy 
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ends recovery section, to recover condensate from tail 
gas in the reactor overheads. The overheads from the 
low temperature fractionation column are used as fuel 
gas. Recovered condensates from the column are sent to 
the product work up section. The waxy reactor product 
is withdrawn from the side of the F-T reactor and 
continuously separated from the catalyst. The waxy 
product is then sent to the first low temperature col¬ 
umn, which sends a heavy cut to the hydrocracker and 
a light cut to the hydrotreater. The wax hydrocracker 
produces the diesel product, while olefins in the con¬ 
densate are hydrogenated to paraffins in the 
hydrotreater. The hydrocracking step typically uses 
a silica-alumina supported Ni-Mo-based catalyst and 
can be adjusted to maximize the production of 
ultraclean diesel product. The products from the 
hydrotreater and hydrocracker are then combined and 
fractionated into diesel and naphtha products. Up to 
approximately 80% of the product is diesel and the 
remaining portion is naphtha. 

Product upgrading schemes center around three 
fuel products (diesel, LPG, and naphtha), but the 
upgrading plant is quite amenable to different config¬ 
urations if market demand so indicates. The ASF dis¬ 
tribution means that the F-T process and its associated 
refinery also produce transportation fuels and products 
other than diesel and gasoline. Other saleable products 
may include solvents, kerosene, jet fuel, illuminating 
paraffins (sold for use in lighting and appliances in 
South Africa), base oils (for lubricant products), and 
chemicals. Maximizing recovery of the higher boiling 
jet fuel while maintaining diesel production will tend to 
increase economic return. 

Major projects planned in F-T ICL include a US 
$3.3 billion plant in Ningdong with Shenhua Ningxia 
Coal Group and Sasol Ltd, scheduled to begin con¬ 
struction in 2010 [68]. 

Methanol and Dimethylether Supplementation of 
liquid fuel supplies with methanol made from abun¬ 
dant coal reserves has become increasingly attractive in 
countries that are oil and natural gas resource poor. 
Methanol has a broad range of applications including: 

• Fuel substitute for gasoline 

• Gasoline fuel blend to extend conventional fuel 

supplies 


• Energy storage medium in coproduction of power 
and fuel 

• Rapid start combustion turbines to meet peak 
electricity demand 

• Advanced methanol powered and plug-powered 
vehicles 

• Direct methanol fuel cells 

Methanol can be synthesized from syngas according 
to Eq. 5, where the thermodynamics assume gas phase 
components. The formation of methanol from syngas 
is highly exothermic. 

CO + 2H 2 <-► CH 3 OH 

AH 29 8 = -21.6 kcal/molCO U j 

The feasible operating temperature for methanol 
synthesis is strongly affected by the temperature sensi¬ 
tivity of the copper-based catalyst, which is prone to 
deactivation via sintering of copper or coking [69]. 
Methanol synthesis occurs at significant rate in the 
temperature range of 210-260°C, optimizing at about 
237°C [70]. Modern commercial methanol synthesis 
catalysts make use of the high reactivity of copper 
oxides combined with the dispersant function of zinc 
oxide, supported on alumina. This combination was 
pioneered by ICI [71] (Imperial Chemical Industries 
PLC, acquired by AkzoNobel in 2008) and allows meth¬ 
anol synthesis to be performed at low pressure with 
recycle (35-55 atm) and relatively low temperature 
(200-300°C, typically 250°C). The component con¬ 
tents are in the range of about 40-80% CuO and 
10-40% ZnO, with 5-10% A1 2 0 3 . For example, the 
unreduced material RPJ-19, from United Catalysts 
Inc., has been described as 55 wt% CuO, 36% ZnO, 
and 8% A1 2 0 3 , with 1% Si0 2 [70]. Magnesia may be 
added to control sintering and increase the available 
surface area of copper. 

The methanol synthesis catalysts are also sensitive 
to catalyst poisons. The contaminants sulfur and halide 
are typically removed upstream to <100 ppm. Because 
catalyst lifetime is significantly diminished by excessive 
temperature, reactor temperature control is very 
important. Currently, all commercial processes produce 
methanol from synthesis gas in a vapor phase reactor 
with a synthesis loop, using packed bed or tubular 
reactors, and temperature moderation is achieved 
by recycling large quantities of hydrogen-rich gas. 
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Typically, a gas phase process is limited to about 16% 
CO in the reactor inlet, as a means of avoiding excess 
heating by constraining the activity per pass. Cool, 
unreacted gas is injected at different stages in the catalyst 
bed or internal cooling surfaces are used to provide 
further temperature control. 

As a means to improve throughput per pass with 
temperature control, ChemSystems and Air Products 
developed a fundamentally different, liquid phase 
reactor system. In this system, referred to as the 
LPMeOH™ (Liquid Phase Methanol) process, synthe¬ 
sis gas is bubbled through a slurry, consisting of 
micrometer-sized catalyst particles and methanol in 
a suspension of mineral oil. The nonvolatile mineral 
oil acts as a temperature moderator, transferring the 
heat of reaction from the catalyst surface via the liquid 
slurry to boiling water in an internal tubular heat 
exchanger. As a result of maintaining a constant, highly 
uniform temperature through the entire length of the 
reactor, the slurry reactor can achieve a much higher 
per pass syngas conversion than its gas-phase counter¬ 
parts and also CO concentrations in excess of 50% have 
been maintained without adverse effect on catalyst 
activity. As with slurry F-T systems, the liquid phase 
methanol process benefits from the ability to withdraw 
spent catalyst slurry and add fresh catalyst on line 
periodically, without having to shut the system down. 

Typical commercial Cu/Zn/alumina methanol syn¬ 
thesis catalysts are also active for the low temperature 
water gas shift (LTWGS), which enables use of syngas 
from lower quality, low-hydrogen feedstocks such as 
coal to be used in making methanol. Reduction of CO 
or C0 2 to methanol with H 2 is exothermic, and heat 
dissipation in fixed bed reactors is inadequate to pro¬ 
tect the catalyst or to allow optimal product recovery. 
While less exothermic than the methanol synthesis 
reaction, the WGSR also adds heat to the system. 

ChemSystems performed pioneering development 
of the LPMeOH process in the 1970s [72]. The 
LPMeOH technology was based on early design and 
demonstration work funded by EPRI (Electric Power 
Research Institute) in the mid-1970s, wherein metha¬ 
nol is produced in one step using the high-CO 
content syngas from coal [73]. Air Products then 
undertook a significant research effort together with 
ChemSystems and DOE co-funding, to further develop 
the process in the 1980s and 1990s, as part of the DOE’s 


cogeneration program. While ChemSystems originally 
used an ebullating bed design with an aromatic 
(1,2,4-trimethylbenzene) or mineral oil liquid, Air 
Products used the more stable mineral oil in a slurry 
bubble reactor to further augment heat and mass 
transfer. 

The LPMeOH process was demonstrated at large 
pilot scale during 1997-2002, in a collaborative effort 
between Air Products (technology supplier), Eastman 
Chemical (syngas supplier, plant operator, and end user 
of methanol), and Arcadis (fuel and power testing). 
This facility in Kingsport, TN operated for nearly 
6 years, with continuous runs of up to 3 months. The 
nameplate capacity of 80,000 gpd was exceeded by 15%. 
Eastman syngas, derived from high-sulfur (3-5%) bitu¬ 
minous coal, was used. Of note is the ability to produce 
sulfur-free methanol containing less than 1% water, 
which makes this methanol directly suitable for trans¬ 
portation as well as chemical use. Used as a fuel addi¬ 
tive, methanol also showed promise for reducing NOx 
emissions from gas turbines and diesel engines. Air 
Products also applied the understanding gained from 
developing the LPMeOH process in developing the 
Liquid Phase Dimethyl Ether Process (LPDME™) pro¬ 
cess, which was demonstrated in the early 1990s. 

Sunggyu Lee and colleagues, at the University 
of Akron and later at the University of Missouri- 
Columbia, pioneered the mechanistic understanding 
of the one-step slurry methanol process and also 
worked on the MeOH/DME slurry process [74]. The 
LPMeOH process has three distinctive features and 
benefits compared to prior art using fixed beds. First, 
the commercial Cu/Zn/alumina methanol synthesis 
catalyst is slurried instead of fixed, which greatly 
improves heat transfer via the inert, liquid medium. 
Second, the slurried catalyst is a fine powder, which 
avoids mass transfer limitations involving pore 
diffusion. Third, the improved mass transfer in the 
well-agitated slurry allows much higher single-pass 
conversion of the syngas. 

The initial method devised to combine steps in 
liquid-phase DME production involved combining 
the methanol dehydration catalyst with the slurry 
phase methanol synthesis catalyst [75]. For DME, 
typically, the methanol synthesis catalyst is copper on 
alumina promoted by zinc oxide. The ratio of CuO/ 
Zn0/Al 2 0 3 is typically about 2:1:5 with a H 2 :CO ratio 
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of 2:1 but will vary with syngas composition. Alumina 
is the most common dehydration catalyst. Both CO 
conversion and selectivity for DME are improved 
by including 10%wt sulfate in the alumina. Many 
researchers have reported using zeolites, and recent 
research with zeolites has been quite active in Asia 
[76]. Likewise, silicoaluminophosphate (SAPO) 
molecular sieves have been reported recently for use 
as dehydration catalysts in direct synthesis of DME. 
A third catalyst functionality may be included to 
enhance the water gas shift reaction, although the 
methanol synthesis catalyst is itself also an effective 
water gas shift catalyst. 

Long-term stability of one-step DME synthesis 
catalyst systems was noted as a primary difficulty by 
Mobil in the early 1980s and continued as an issue 
in Air Products’ work. The temperature needs and 
catalyst stability are considered for three simultaneous 
reactions in direct synthesis of DME: water gas shift 
(Eq. 4), methanol synthesis (Eq. 5), and methanol 
dehydration (Eq. 6 ) [77]. The methanol synthesis via 
Eq. 5 can be considered in principle as the combination 
of water gas shift and the reduction of C0 2 by hydro¬ 
gen (Eq. 7). 

2CH 3 OH <-► CH 3 OCH 3 + H 2 0 

, (6) 

AH 298 = -5.9 kcal/molDME v ' 

C0 2 + 3H 2 <-► CH 3 OH + H 2 0 

7 

AH 298 = -11.7 kcal/molMeOH v J 

Even so, the mechanism actually in effect during 
direct synthesis of DME is in some debate and might 
not involve the explicit intermediacy of methanol as 
per Eq. 6 . Figure 11 shows the results of modeling done 
by JFE Holdings (formed in 2002 by the merger of NKK 
and Kawasaki Steel) based on their pilot plant results, 
which give best conversion at a ratio of 1:1 CO:H 2 . 
Specifically, DME may form directly from coal-derived 
syngas as per Eq. 8 . The stoichiometry of Eq. 8 being 
noted for coal-derived syngas, it is also the case that 
catalysts for direct DME synthesis have been based on 
methanol synthesis catalysis and most processes give 
some combination of both DME and methanol. 

3CO + 3H 2 CH 3 OCH 3 + C0 2 

(8) 

AH 298 = -19.6 kcal/molCO 
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Equilibrium conversion of CO and H 2 to DME (280°C, 
5 MPa) [81] 


The WGS reaction used for one-step DME/MeOH 
production is of the low temperature type (LTWGS) 
and operates best at about 180-240° C, which overlaps 
substantially with the methanol synthesis temperature 
range. The overall chemistry is more easily controlled if 
the WGS catalyst has little or no F-T activity for hydro¬ 
carbon production, which means that the LTWGS cat¬ 
alyst needs to be copper-based instead of iron-based. 
The typical catalyst used for LTWGS in DME or meth¬ 
anol synthesis has the general formula xCu/yZnO/ 
A1 2 0 3 and has no hydrocarbon-producing F-T activity. 
A typical low temperature shift catalyst contains 
30-70% CuO (prior to reduction), 20-50% ZnO, and 
up to about 40% alumina [78]. 

The optimal temperature for the methanol dehy¬ 
dration reaction is set not by catalyst stability but by the 
equilibrium dehydration reaction thermodynamics. 
The dehydration proceeds best at about 250°C and 
yield declines as the temperature is raised further. The 
catalyst function for dehydration of alcohol to ether is 
a solid acid. Various options are possible, including 
acidic aluminas, silica-aluminas, zeolites, and various 
solid or supported inorganic acids. In practice, 
y-alumina is preferred to date. The dehydration rate 
is directly related to the acid strength of the zeolite 
and the overall rate of DME synthesis can be controlled 
by the proportion of ion exchange in the dehydration 
catalyst [79]. Dehydration is relatively fast above 


2249 


c 






2250 


c 


Coal to Liquids Technologies 


about 240° C and only a few percent of dehydration 
catalyst is needed in the total catalyst formulation 
[70]. The LPDME process was demonstrated on 
a 10 t/day scale for 25 days at Air Products’ LaPorte 
facility using commercially available methanol synthe¬ 
sis and y -alumina dehydration catalyst in a ratio of 
95:5 wt%. The temperature and pressure were approx¬ 
imately 250°C and 50 bar, respectively. Deactivation 
occurred at a rate of 0.7%/day for both catalyst com¬ 
ponents [80]. 

The concern that LPG used heavily in Asia as 
a fuel and a power source will become increasingly 
unavailable has sparked significant research to develop 
DME as a replacement [81]. Properties of DME and 
LPG are similar in terms of fuel characteristics, low 
toxicity, and especially handling, distribution, and 
storage. Some use of DME as an LPG substitute has 
recently begun in China. To demonstrate the potential 
replacement of automotive fuel, fleet tests with DME 
have been performed (the fuel injection system requir¬ 
ing modification to accommodate the lower viscosity 
of DME compared to diesel, but without modification 
of the engine itself) [82]. Turbine, boiler, and fuel cell 
tests with DME have also been performed [83]. 

JFE in Japan has been a major driver for develop¬ 
ment and commercialization of one-step slurry DME 
production. NKK, Taiheiyo Coal Mining, and their 
partners constructed a 5 t/day direct syngas to DME 
pilot plant in 1999 at Taiheiyo’s site in Hokkaido 
and operated it intermittently for about 5 years [83]. 
In 2002, JFE began a 5-year development plan for 
a 100 t/day pilot plant demonstration in Kushiro, 
with METI and many collaborators that together 
formed the DME Development Corp. and the DME 
International Corp. Interestingly, the feed for syngas 
production actually used is natural gas; however, JFE 
opts to bring the 2:1 H 2 :CO ratio (produced by 
reforming natural gas) down to 1:1 (more like the 
ratio produced from coal) by adding C0 2 during 
reforming, and the work therefore also relates to 
advancement of the state of the art in producing 
DME and methanol by ICL of coal. 

JFE reported an impressive purity from their overall 
process: 99.5 + % DME and <100 ppm combined 
water and methanol [84]. In this process, shown in 
Fig. 12, coal is fed to the gasifier and then raw syngas 
is brought up to a H 2 :CO ratio of 1:1 via the WGSR 


using steam from the gasifier. Sulfur and C0 2 are 
removed prior to DME synthesis. Unreacted CO and 
H 2 are separated from liquid products and sent back to 
the DME synthesis reactor. The DME synthesis makes 
C0 2 as a by-product, plus some methanol and a slight 
amount of water. The separation of C0 2 , DME, meth¬ 
anol, and water involves a low temperature extraction 
(—39 °C) [85]. After downstream distillation, metha¬ 
nol is cycled back to the DME reactor. 

A different type of ICL process involving methanol 
is the gasification of coal followed by conversion to 
methanol and subsequent conversion of methanol to 
a marketable gasoline product. A $3 billion plant with 
a production capacity of approximately 700,000 t/year 
to be built by TransGas is planned in Mingo County, 
West Virginia. The plant will use Uhde’s PRENFLO 
direct quench gasifier technology to make syngas, 
which will be converted to methanol and subsequently 
to gasoline using ExxonMobil’s methanol-to-gasoline 
technology. Methanol conversion is nearly complete 
and the total liquids yield is higher than in F-T synthe¬ 
sis, including significantly less tail-gas production. The 
product of the MTG reaction does not require much 
further upgrading and refining compared to other coal 
liquefaction routes. A 100,000 t/year MTG process 
plant was started in China’s Jincheng, Shanxi province 
in 2009, as part of a CTL demonstration complex. 

Like DCL, ICL is complex and characterized by 
heavy investment, but CTL processes, in general, have 
been recently described as economically competitive at 
an oil price as low as $25/barrel [86] or, more typically, 
at an oil price of about $55/barrel [45], with the cost of 
coal being that incurred at the mine (i.e., without the 
cost of shipping and handling). 

Environmental Factors The overall conversion of 
coal to liquids not only produces fuel- or chemical- 
grade products but also consumes water and produces 
carbon dioxide, as shown in Eq. 9 [87]. 

Carbon Dioxide The high carbon content of coal 
combined with the energy use in ICL processes 
involving gasification means that carbon dioxide 
emissions on a well-to-wheel basis are about 1.8 times 
more for coal than for oil. In terms of fuel preparation 
alone, it has been estimated that an F-T CTL plant 
without capture would release about 0.78 t of carbon 
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Process flow diagram of DME direct synthesis from coal (or methane) 


dioxide per barrel of product, in comparison to 
a comparable oil refinery emitting about 0.1 t of 
carbon dioxide per barrel of product. For a typical 
bituminous coal, approximately 670 lb of carbon (or 
2,450 lb of carbon dioxide) would be emitted for every 
barrel of F-T liquids produced. This compares to about 
250 lb of carbon (or 900 lb of carbon dioxide) emitted 
for every barrel of petroleum fuels produced by 
refineries. When carbon sequestration is employed for 
both facilities (92% captured and stored at the F-T CTL 
plant and 40% at the refinery), the carbon dioxide 
emissions become approximately equal. 

(4 + y )CH + 2 (y - x)H 2 0 -► (4 + x)CH y 

+ (y-x)C0 2 {9) 

The large quantity of C0 2 produced currently poses 
a significant challenge to development of the industry. At 
the time of this writing, no requirements exist in the 
USA to manage carbon emissions from fossil fuel 
sources, but should carbon management be required, 
carbon dioxide produced during the conversion 
process could be captured for subsequent storage in 


deep saline aquifers or sold for use in enhanced oil 
recovery (EOR) operations. A range-finding study 
done in 2004 by Mitretek Systems (now Noblis) for the 
US DOE considered production of substitute natural gas 
(SNG) from coal and assumed that the value of C0 2 for 
EOR was $ 12/t, which would significantly improve the 
economics of a CTL plant [88]. At present, there are 
over 2,500 km of dedicated C0 2 pipelines within the 
US delivering to commercial EOR projects in areas 
such as the Permian Basin of West Texas, southeastern 
New Mexico, the Rocky Mountain Region of Utah, 
Wyoming and Colorado, Louisiana, and the Weyburn 
Fields in Saskatchewan. With the exception of C0 2 
from the Dakota Gasification Plant, all of this C0 2 is 
naturally occurring. 

Total production related emissions from a 2 million 
barrels/day F-T CTL industry could be as much as 
a 0.5 Gt/year. There is already a significant knowledge 
base regarding how to handle C0 2 and inject it into 
deep geological structures, because injection of C0 2 is 
already used commercially in the USA to augment 
natural gas production. The transport of large volumes 
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of C0 2 via pipeline is also a well-established practice, 
within segments of the oil and gas industry. Given 
adequately scaled infrastructure for carbon capture 
and storage in place, emissions from CTL plants and 
emissions from use of CTL fuels would be comparable 
to those associated with the production and consump¬ 
tion of petroleum-based fuels. If sequestration of car¬ 
bon dioxide becomes mandatory, an additional $4—$8/ 
barrel for the price of low-sulfur, light crude oil would 
be required for profitable operation, depending on 
specific site logistics [89]. 

The high sulfur content of most coals is addressed at 
multiple junctures in the F-T ICL process. Sulfur is 
removed when the raw synthesis gas is scrubbed to 
remove C0 2 and sulfur compounds (predominantly 
H 2 S) and again during purification of F-T product 
streams. The H 2 S containing C0 2 streams are treated 
in a Claus or similar process, where the H 2 S is oxidized 
to elemental sulfur. 

Fischer-Tropsch Reaction Product Water Water is a net 
reactant in CTL processes (Eq. 9) but is a product of the 
F-T portion of CTL processes (Eq. 1). On a mass basis, 
approximately 1.3 t of water is produced per ton of 
hydrocarbon product. After separation of the 
hydrocarbon product, the F-T reaction water typically 
contains numerous dissolved oxygenated compounds 
such as alcohols, aldehydes, acids, ketones, esters, and 
traces of other hydrocarbons. Purification of F-T 
reaction water is required in commercial processes, 
because all water is reused. Distillation removes most 
of the nonacid oxygenates to leave a bottom stream 
known as F-T acid water, rich in carboxylic acids. The 
recovered oxygenated compounds can be purified to 
chemical products or used as a fuel source. After 
extraction of the oxygenates, only low concentrations 
of organic acids remain and these are easily 
biodegraded by either aerobic or anaerobic digestion 
to produce water of very high quality, for use as coolant 
or even for drinking, from the original F-T acid water 
stream. High purity water suitable for use as boiler feed 
water can also be produced but requires a third set of 
treatments, after distillation and digestion, such as 
adsorption of trace suspended solids, dissolved salts, 
and organic material on activated carbon. 

In addition to distillation-based purification of 
F-T reaction water, membrane-based, liquid-liquid 


extraction, and biological operations have been devel¬ 
oped to remove and recover carboxylic acids from F-T 
acid water [90]. The resultant water stream from these 
processes is generally of a good quality and can be 
reused or disposed of to the environment. 

After physical separation of oil and water streams, 
plus steam stripping and oil extraction to remove vol¬ 
atiles and aromatics, respectively, sulfur recovery is 
performed by conversion to elemental sulfur. Tail gas 
streams are also cleaned for use in recycle or as fuel. 

The stream of water from the cooling system 
(referred to as the blowdown stream) must be cleaned 
also. This water contains inorganic salts and is treated 
with coal ash that chemically binds the salts, before 
the remaining water is collected by evaporation and 
condensation. 

In addition to fuels, chemicals, and purified water, 
F-T ICL processes generate mineral waste derived from 
the coal. This non-leachable slag is suitable for sale 
as aggregate. Other, low-volume solid wastes include 
spent catalysts, organic sludge from biological waste- 
water facilities, inorganic sludge from flue gas desul¬ 
furization or evaporation ponds, and by-product sulfur 
if not recovered. These wastes may require specialized 
treatment to avoid transmission of soluble components 
if present, such as traces of the elements arsenic, chro¬ 
mium, mercury, molybdenum, and selenium, to local 
water supplies. However, F-T CTL plants incorporating 
modern, state-of-the-art gasification plants should 
accomplish removal of these and other potential pol¬ 
lutants within the normal plant process operations. 

Utility Water The CTL process chemistry uses water 
as a source of hydrogen and oxygen, during gasification 
of coal, and also produces water during the F-T 
reaction. The overall chemistry consumes water, but 
the chemical transformations are not the only use of 
water during coal liquefaction. The other major uses of 
water are for cooling, boiler feed water, and in solids 
handling, including processing of fine coal or dust. 
Most of the water is recovered after purification, but 
in a DCL plant, for example, 3-5% of the water being 
circulated in a cooling tower is lost to evaporation, 
leaks, and blowdown [91]. Thus, for every 1,000 gal 
circulated, 30-50 gal of makeup water is needed. 

The amount of water needed varies with the process 
design, choice of gasifier, ambient temperature and 
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humidity, and type of coal. According to the DOE 
Idaho National Lab, approximately 12-14 barrels of 
water are used for every barrel of F-T liquid fuel pro¬ 
duced, [92] meaning that 40 million gallons of water/ 
day are needed for an 80,000 barrel/day CTL facility. 
That is enough water to meet the domestic needs of 
more than 200,000 people (or 1/5 the population of the 
state of Montana). 

The usage level of water in F-T CTL plants is con¬ 
sidered a significant logistic issue in geographical areas 
of low rainfall or otherwise limited water resources. For 
example, competing use of water for agriculture would 
be a problem in the Western USA, and competing use 
of water for thermal-electric plants would be an issue in 
the Eastern USA. The availability of water is already an 
issue in Montana and Wyoming, where most of the 
inexpensive supplies of coal in the USA are located. 
Where water is in short supply, this logistic require¬ 
ment becomes an environmental and/or economic 
issue and therefore has precluded construction of 
some planned CTL plants. Use of air, instead of 
water, as coolant may substantially reduce water usage 
requirements of the exothermic F-T reaction. 

Future Directions 

Current commercialization of CTL technology serves 
the global interest in sustainability of fuel supply. Any 
additional production of fuels from alternative 
resources reduces overall world demand for petroleum 
and benefits all consumers by easing pressure on crude 
oil prices. As the price of oil increases and its easy 
availability decreases, countries with large coal reserves 
and insufficient domestic petroleum to meet increasing 
transportation fuel demands - particularly the PRC 
and India - are potential candidates for using CTL 
technologies to supplement conventional petroleum- 
based fuel products. 

In the case of DCL, contribution to sustained fuel 
supply requires that the Shenhua project be successful 
and lead to commissioning of additional units, given 
estimates of proven coal Shenhua coal reserves in the 
Shanxi Province/Inner Mongolia Autonomous Region 
of about 200 billion tons [93]. In this context, it is 
important to note that parallel efforts are also under¬ 
way in the PRC and elsewhere to erect commercial scale 
indirect coal liquefaction plants. Presumably, these 


plants have lower technical risks because the designs 
are derived from ICL technology that has already been 
well proven at full commercial scale, by Sasol. On the 
other hand, the direct liquefaction project, while 
supported by a 6 t/day pilot plant constructed in 
Shanghai, has no precedent on a larger scale. The 
more than 1,000-fold scale up from pilot plant to 
commercial unit is an ambitious objective that carries 
more than the typical amount of technical risk. These 
plants will also incorporate technologies specific to 
regional coals and environmental regulations. 

If DCL technology is successful in the PRC, it will 
garner increased global interest, perhaps in combina¬ 
tion with indirect liquefaction in a hybrid systems 
configuration. In this scenario, the low value residue 
from the direct liquefaction process would be gasified 
to provide a low cost source for synthesis gas produc¬ 
tion, rather than incurring a cost penalty for disposal. 
A recent US National Coal Council (NCC) report sug¬ 
gests that this might be a good time and justification to 
revisit the direct liquefaction pathway, particularly if 
the scale-up in the PRC is successful. The NCC report 
also suggests the speculative possibility of hybrid direct 
and indirect plants. 

In the case of ICL, the F-T technology is already well 
commercialized but currently has a very limited world¬ 
wide aggregate production volume. Computational 
science is a rapidly expanding area of research that 
offers the potential to shortcut development time by 
providing a theoretical basis for subsequent R&D and 
technology design activities. It provides the opportu¬ 
nity to quickly explore novel processing strategies to 
guide and accelerate experimental research. The com¬ 
putational work focuses on the complex and critical 
chemical and physical aspects of converting coal to 
premium fuels: the fundamentals of catalyst activity/ 
selectivity; hydrodynamics; separation of small catalyst 
particles from the liquid product; optimum system 
integration to achieve high process efficiency, minimal 
pollutant emissions and C0 2 capture; and computa¬ 
tional frameworks to enable virtual demonstration of 
the entire fuel life cycle. 

Laboratory research and additional modeling, 
followed by larger-scale bench and pilot-scale testing, 
could experimentally verify the computational find¬ 
ings. However, systems engineering is an important 
component activity needed to evaluate the technical 
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and economic feasibility of: novel processes resulting 
from computational research; advanced integrated gas¬ 
ification and F-T processing concepts and cleanup 
technology; advanced reactor types needed to improve 
the process efficiency when utilizing specific regional 
coals; novel in situ reactions/processing; and modeling 
of the gas-solid-liquid physics of the F-T reactor to help 
achieve the highest throughput and liquid product 
quality. 

Life cycle analyses performed on a “mine-to- 
wheels” basis likely will be used to identify those parts 
of the overall system impacting health and the environ¬ 
ment now. Estimates of the impact of C0 2 on global 
warming, and estimates of the impacts of global 
warming itself, may also be attempted but these pro¬ 
jections appear at present as far too broad in range to 
aid in resolving issues of energy sustainability. The 
greatest current push for the future use of coal appears 
to be in developing sustainable concepts for capturing 
and then storing or using the C0 2 , because doing so 
obviates a significant portion of the large uncertainty as 
to potential effects of that C0 2 were it not captured. 
A related type of research involves combining coal use 
with chemistries or processes that generate less (or 
consume some amount of) C0 2 , to decrease the overall 
output of C0 2 per barrel of oil equivalent produced. 

In addition to the freestanding F-T CTL process 
configuration, the basic plant can be reconfigured to 
capture about 90% of the carbon that otherwise would 
be released into the atmosphere within the plant 
boundary. Also, another option, not involving C0 2 
recycle, would be to design a polygeneration plant 
that produces both fuels and power. 

More production from existing mines and the 
opening of new coalmines will be required to accom¬ 
modate growth in the CTL industries as well as growing 
global demand for electric power. These facilities will 
require land for construction and support operations 
such as roads, railroads, and storage facilities that cre¬ 
ate changes in land use, alter topography, and impact 
ecological systems. Mitigation and reclamation strate¬ 
gies would need to be implemented to offset some of 
these changes to land and ecological resources. Cur¬ 
rently, there are very strict reclamation regulations in 
place and they are being further enhanced to encourage 
a reforestation initiative to better restore the land, while 
concurrently improving water quality and providing 


a source for carbon sequestration. Even if the environ¬ 
mental risks are addressed, there is a very good possi¬ 
bility of public reluctance to accept the need for new 
facilities, particularly these coal-based plants. Measures 
would need to be taken to involve the general public 
and other state, local, and nongovernmental entities to 
assure them that these plants could effectively protect 
human health and safety and the environment. 

However, F-T CTL plants would be expected to 
have very low environmental criteria pollutant emis¬ 
sions. They could also be designed to accommodate 
carbon capture and pressurization for subsequent 
sequestration in saline aquifers or oil reservoirs for 
enhanced oil recovery (EOR). Therefore, these plants 
could remove a good deal of uncertainty associated 
with possible new environmental regulations. Early 
plants that would sell or demonstrate C0 2 use for 
EOR would be encouraged. 

Regarding methanol and DME, natural gas is cur¬ 
rently the preferred feedstock for methanol production 
but companies such as Eastman Chemical in particular 
have rendered the coal-based process commercial, and 
cost-effective technology for C0 2 mitigation could 
pave the way for significant increase in commercial 
production of methanol and DME from coal in Asia 
and elsewhere. 

Future CTL plants will use advanced clean coal 
gasification technology to produce transportation 
fuels and optionally electric power. Pollutant emissions 
will be minimal because coal-derived sulfur will be 
removed and converted into elemental sulfur. Nitrogen 
oxides will be minimized using low-NOx burners and 
selective catalytic reduction (SCR) in the flue gas 
stream and mercury will be removed, perhaps by 
some combination of pre- and post-combustion pro¬ 
cesses. Using air coolers where possible will minimize 
water use, and solids emissions will consist of non- 
leachable slag from the gasification process. Because 
of the sensitivity of the F-T catalyst to poisons, all 
contaminants must be removed to near-zero levels 
(ppb levels), thus ensuring that overall plant emissions 
would be close to zero. 

Certainly coal conversion to hydrocarbon liquids 
and electricity is now economically attractive from 
a cash cost viewpoint. However, large capital invest¬ 
ments are required and small to medium coal conver¬ 
sion plants will not become a reality unless there are 
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effective forms of state assistance with the initial capital 
investment. This would be motivated by the desire for 
independence from imported energy and the more 
efficient and cleaner utilization of the local energy 
resource. The Chinese government has accepted this 
approach, and China is rapidly becoming the pioneer 
for developing commercial CTL technology options. 
However, it is the duty of technology providers to 
ensure that technology is made available to allow coal 
use to be as efficient as possible. 

Site-specific early design studies would provide the 
ability to obtain information on environmental base¬ 
lines for CTL plants. These plants would be ready for 
C0 2 separation and capture and the information 
obtained would define resource requirements. Site- 
specific information would also address where the 
resources, such as coal and water, are coming from, 
how they are delivered, and how waste products are 
to be reused or disposed. Additionally, current R&D 
activities cosponsored by the DOE and industry are 
being pursued to improve C0 2 separation and capture 
and define C0 2 storage sinks. 

Making fuels from coal via either DCL or ICL 
processes has the advantage that fuel infrastructures 
already in place for petroleum crude oil products can 
be used largely unchanged when a shift is made to coal- 
derived fuels. The advantage of fuel infrastructure com¬ 
patibility offered by synthetic hydrocarbon fuels is not 
so great in the PCR at present, where a liquid hydro¬ 
carbon fuel infrastructure for transportation fuels is in 
a highly developmental state. The challenges are pri¬ 
marily economic, as prospective air pollutant regula¬ 
tory constraints worldwide give high value to clean 
synthetic fuels having emission characteristics superior 
to crude oil derived fuels, whereas coal-based synfuels 
processes incur significant cleanup costs. Much, 
although not all, of the contaminants needing to be 
cleaned up result from the biological origins of coal 
itself. In that regard, clean coal research might also 
provide insight on more efficient processing and 
cleanup in emerging biomass-to-liquids processes. 

Bibliography 

Primary Literature 

1. Appert 0 (2008) CTL could provide long-term, transport-fuel 

supply. Syngas Refiner, IV 9:14-18 


2. Gray D, White C, Tomlinson G, Ackiewicz M, Schmetz E, 
Winslow, J (2007) Increasing security and reducing carbon 
emissions of the US transportation sector: a transformational 
role for coal with biomass. DOE/NETL-2007/1298, p 61 

3. De Klerk A (2009) Overview of coal-to liquids technology. 
In: De Klerk (ed) Beyond Fischer-Tropsch. Elsevier, Amster¬ 
dam, p 1. Arno de Klerk comments, "Coal-to-liquids technol¬ 
ogy started as an ironic twist of fate seen against the present 
day drive to move towards renewable sources of energy, since 
coal replaced whales as feed material for the production of 
lamp oil (kerosene). Coal is not a renewable energy source, 
except when viewed on a geological time scale, but whales 
are, yet, the whaling industry came close to making whales 
a non-renewable resource by converting all whales into oil." 
The anecdote illustrates a growing understanding among 
energy experts, that even with conservation global demand 
for energy is growing and can best be met by careful use of 
both renewable and nonrenewable resources. On this point, 
see also Crane et al 

4. Crane H, Kinderman E, Malhotra R (2010) A cubic mile of oil. 
Oxford University Press, New York 

5. Thomson E (2002) The Chinese coal industry: an economic 
history. Routledge, London 

6. Lister G (1944) Chronological records of coal mining, transport 
etc. In: Northumberland and Durham from A.D. 1180 to 1839. 
Ramsden Williams, Consett 

7. Perry H (1974) Coal conversion technology. Chem Eng 
81 (15):88—102 

8. Murakushi N (1981) The transfer of coal-mining technology 
from Japan to Manchuria and manpower problems: focusing 
on the development of the Fushun coal mines. Japanese 
Experience of the UNU Fluman and Social Development 
Programme series, p 47 

9. NEDO (2006) Clean coal technology in Japan. 4A1. Coal 
liquefaction technology development in Japan, pp 57-58. 
www.nedo.go.jp/kankobutsu/pamphlets/sekitan/cct2006e. 
pdf. Accessed 10 April 2010 

10. Sugawara A, Kurosawa S, Hatori H, Saito K, Yamada, Y, 
Sugihara M, Wasaka S, Yoshida H, Seo T, Susuki T, Inoguchi M, 
Sohnai M (1998) Coal conversion technologies on the new 
sunshine program in Japan. Preprints of symposia - American 
Chemical Society, Division of Fuel Chemistry 43(2): 330-334 

11. NEDO (2006) Clean coal technology in Japan. 4A2. Bituminous 
coal liquefaction technology (NEDOL), pp 59-60. www.nedo. 
go.jp/kankobutsu/pamphlets/sekitan/cct2006e.pdf. Accessed 
10 April 2010 

12. Ghosh TK, Prelas MA (2009) Energy resources and systems: 
volume 1: Fundamentals and non-renewable resources. 
Springer Science + Business Media, Berlin 

13. Dry ME (2002) The Fischer-Tropsch process: 1950-2000. Catal 
Today 71:227-241 

14. Jager B, Espinoza R (1995) Advances in low temperature 
Fischer-Tropsch. Catal Today 23(1 ):17-28 

15. Radtke K, Heinritz-Adrian M, Marsico C (2006) New wave of 
coal-to-liquids. An opportunity to decrease dependency on oil 


C 




2256 


c 


Coal to Liquids Technologies 


and gas imports and an appropriate approach to a partial 
revival of domestic coal industries. VGB PowerTech 86(5):78-84 

16. Williams RH, Larson ED (2003) A comparison of direct and 
indirect liquefaction technologies for making fluid fuels from 
coal. Energy Sustain Dev Vll(4):103—129 

17. Grant-Huyser M, Maharaj S, Matheson L, Rowe L, Sones E 
(2004) Ethoxylations of detergent-range oxo alcohols derived 
from Fischer-Tropsch alpha-olefins. J Surfactants Deterg 
7(4):397-407 

18. Price JG (2004) Chemicals from synthesis gas. US 6740683 to 
Sasol Technology, 25 May 2004 

19. Degnan TF Jr, Chen NY, Somorjai GA (2009) Heinz 
Heinemann's legacy at ExxonMobil: an illustrious career in 
industrial catalysis. Catal Lett 133(1 —2):227—230 

20. Brown D, Bhatt B, Hsiung T, Lewnard J, Waller F (1991) Novel 
technology for the synthesis of dimethyl ether from syngas. 
Catal Today 8:279-304 

21. Sardesai A, Lee S (2005) Alternative source of propylene. 
Energy Sources 27(6):489-500 

22. Larson ED, Yang H (2004) Dimethyl ether (DME) from coal as 
a household cooking fuel in China. Energy Sustain Dev, VII 
3:115-126 

23. Jones G, Jr Holm-Larsen H, Romani D, Sillis R (2001) DME for 
power generation fuel: supplying India's southern region. 
PETROTECH-2001 January 08-12, New Delhi, India 

24. Bhatt B, Schaub E, Heydorn E (1993) Recent developments in 
slurry reactor technology at the LaPorte alternative fuels 
development unit. In: International technical conference on 
coal utilization & fuel systems, Clearwater, April 26-29, 
pp 197-208 

25. Shikada T, Ohno Y, Ogawa T, Ono M, Mizuguchi M, Tomura K, 
Fujimoto K (1998) Direct synthesis of dimethyl ether from 
synthesis gas. In: Parmaliana A, Sanfilippo D, Frusteri F, 
Vaccari A, Arena F (eds) Studies in surface science and catalysis 
(Natural Gas Conversion V) 119: 515-520 

26. Miller CL, Cicero D, Ackiewicz M, Anderson J, Schmetz E, 
Winslow J (2006) Coal conversion - a rising star? In: 23rd 
international Pittsburgh coal conference, Pittsburgh, 
September 25-28 

27. White LC, Frederick JP (1995) ENCOAL mild coal gasification 
project. In: Proceedings of the 12th annual international 
Pittsburgh coal conference, Pittsburgh, pp 151-156 

28. US Department of Energy, National Energy Technology Labo¬ 
ratory (2002) The ENCOAL® mild coal gasification project. 
A DOE assessment. DOE/NETL-2002/1171 

29. McMillen DF, Malhotra M (1989) The role of hydrogen transfer in 
bond-cleavage and bond-forming processes during coal con¬ 
version. In: Schindler HD (ed) Coal liquefaction. A research 
needs assessment DE-AC01-87ER30110 final report, v2, ch 4, 
US Department of Energy, pp 30-49 

30. Haenel MW (2008) Catalysis in direct coal liquefaction. In: 
Gerhard E (ed) Handbook of heterogeneous catalysis, 
vol 6, 2nd edn. Wiley-VCH Verlag, Weinheim, Germany, 
pp 3023-3036 


31. Miller L (2008) Coal conversion technology. Congressional 
noontime briefing. Rayburn House Office Building, 
Washington DC, April 24 

32. Eccles RM, DeVaux GR (1982) H-coal commercialization: 
current status. Energy Progress 2(2):111-115 

33. Cleaner Coal Technology Programme (1999) (PDF). Technol¬ 
ogy status report 010: coal liquefaction. Department of 
Trade and Industry, http://www.dti.gov.uk/files/file18326.pdf. 
Accessed 10 April 2010 

34. Styles GA (1982) The Wilsonville advanced coal liquefaction 
R&D facility - accomplishments and process evolution. Energy 
Progress 2(3):160-162 

35. Valente AM, Cronauer DC (2003) Progress in coal liquefaction 
including a discussion of Wilsonville. Preprints of symposia - 
American Chemical Society, Division of Fuel Chemistry 48(1): 
147-148 

36. Burke FP, Brandes SD, McCoy DC, Winschel RA, Gray D, 
Tomlinson G (2001) Summary report of the DOE direct lique¬ 
faction process development campaign of the late twentieth 
century: topical report. DE2002-794281/XAB; DOE/PC-93054- 
94 Department of Energy, Washington DC 

37. Nolan P, Shipman A, Rui H (2004) Coal liquefaction, Shenhua 
group, and China's energy security. Eur Manage J 22(2):150-164 

38. Stohl FV, Lott SE, Diegert KV, Goodnow DC (1996) Results of 
hydrotreating the kerosene fraction of HTI's first proof of 
concept run. US DOE, SAND-96-0990C; CONF-960807-2 

39. Friedrick F, Strobel BO (1983) The KOHLEOEL experimental 
plant at Bergbau-Forschung. In: Proceedings of the 3rd Euro¬ 
pean coal utilisation conference, Amsterdam, pp 227-235 

40. Strobel BO, Loering R (1992) IGOR - taking the short cut in coal 
hydrogenation. Preprints of papers - American Chemical 
Society, Division of Fuel Chemistry 37(1):448-455 

41. NEDO (2006) Clean coal technology in Japan. 4A3. Brown 
coal liquefaction technology (BCL), pp 61-62. www.nedo.go. 
jp/kankobutsu/pamphlets/sekitan/cct2006e.pdf. Accessed 
10 April 2010 

42. Ramdoss PK, Tarrer AR (1997) Modeling of two-stage coal 
coprocessing process. Energy Fuels 11(1 ):194-201 

43. Wender I, Tierney JW (1995) Effect of pretreating of host oil on 
coprocessing. Final report. DOE/PC/91054-T15, Contract 
Number AC22-91PC91054, Oct. 1 1995 Oct 01 

44. Miller RL, Giacomelli GF, McHugh KJ, Baldwin RM 
(1989) Coprocessing of coal and residuum under low-severity 
reaction conditions: effect of basic nitrogen promoters. 
Energy Fuels 3(2):127—131 

45. Smith R, Asaro M, Naqvi S (2008) Fuels of the future: technol¬ 
ogy intelligence for coal to liquids strategies. SRI Consulting, 
Menlo Park 

46. Koornneef J, Junginger M, Faaij A (2007) Development of 
fluidized bed combustion - an overview of trends, perfor¬ 
mance and cost. Prog Energy Combust Sci 33(1 ):19-55 

47. Senapati PK, Das D, Nayak A, Mishra PM (2008) Studies on 
preparation of coal water slurry using a natural additive. 
Energy Sources A 30(19):1788-1796 



Coal to Liquids Technologies 


C 


2257 


48. SRI Consulting (2006) Coal gasification. Process Economics 
Program report 154A, ch 4, SRI Consulting, Menlo Park, 
pp 4-46 

49. Davis BH (2001) Fischer-Tropsch synthesis: current mechanism 
and futuristic needs. Fuel Process Technol 71 (1 —3):157—166 

50. Leckel D (2005) Hydrocracking of iron-catalyzed Fischer- 
Tropsch waxes. Energy Fuels 19:1795-180 

51. Davis BH (2007) Fischer-Tropsch synthesis: comparison of 
performances of iron and cobalt catalysts. Ind Eng Chem Res 
46(26):8938-8945 

52. Smith R, Asaro M (2005) Fuels of the future: technology intel¬ 
ligence for gas to liquids strategies. SRI Consulting, Menlo Park 

53. Bukur DB (2005) Attrition studies with catalysts and supports 
for slurry phase Fischer-Tropsch synthesis. Catal Today 
106(1-4):275-281 

54. Bukur DB, Carreto-Vazquez VD, Pham HN, Datye AK 
(2004) Attrition properties of precipitated iron Fischer- 
Tropsch catalysts. Appl Catal, A 266(1 ):41-48 

55. Dry M (2004) Chemical concepts used for engineering pur¬ 
poses. In: Steynberg A, Dry M (eds) Studies in surface science 
and catalysis 152: Fischer-Tropsch technology. Elsevier, 
Amsterdam, p 215 

56. Donnelly TJ, Yates 1C, Satterfield CN (1988) Analysis and 
prediction of product distributions of the Fischer-Tropsch 
synthesis. Energy Fuels 2:734-739 

57. Smith R (2009) SRI Consulting Coal to gasoline. Process Eco¬ 
nomics Program report 271, SRI Consulting, Menlo Park. Data 
for the figure were obtained from reference 58 

58. Donnelly TJ, Satterfield CN (1989) Product distributions of 
the Fischer-Tropsch synthesis on precipitated iron catalysts. 
Appl Catal 52(1 ):93-114 

59. Dry M (2004) Chemical concepts used for engineering pur¬ 
poses. In: Steynberg A, Dry M (eds) Studies in surface science 
and catalysis 152: Fischer-Tropsch technology. Elsevier, 
Amsterdam, pp 196-257 

60. Jager B, Kelfkens RC, Steynberg AP (1994) A slurry bed reactor 
for low temperature Fischer-Tropsch. In: Curry-Hyde HE, 
Howe RF (eds) Natural gas conversion II. Elsevier, Amsterdam, 
pp 419-425 

61. Adeyiga AA, Bukur DB, Carreto-Vazquez V, Ma W, Nowicki L 
(2004) Attrition resistance and catalytic performance of spray- 
dried iron Fischer-Tropsch catalysts in a stirred-tank slurry 
reactor. Ind Eng Chem Res 43(6):1359-1365 

62. Bai L, Chang J, Hao Q, Li Y, Liu F, Wang H, Xiang H, Xu B, Yi F, 
Zhang C (2007) Effect of reduction temperature on a spray- 
dried iron-based catalyst for slurry Fischer-Tropsch synthesis. 
J Mol Catal A: Chem 261 (1 ):104-111 

63. Bartholomew C, Bukur DB, Datye AK, Nowicki L, Pham HN, 
Xu J (2003) Attrition resistance of supports for iron Fischer- 
Tropsch catalysts. Ind Eng Chem Res 42(17):4001-4008 

64. Zhao R, Goodwin JG, Jothimurugesan K, Gangwal K, Spivey JJ 
(2001) Spray-dried iron Fischer-Tropsch catalysts.1. Effect of 
structure on the attrition resistance of the catalysts in the 
calcined state. Ind Eng Chem Res 40(4):1065-1075 


65. Demirel B, Bohn MS, Benham CB, Siebarth JE, Ibsen MD 
(2005) Method and apparatus for regenerating an iron- 
based Fischer-Tropsch catalyst. US 6838487 to Rentech, 
4 January 2005 

66. Benham CB, Bohn MS, Yakobson DL (1996) Process for 
the production of hydrocarbons. US 5504118 to Rentech, 
2 April 1996 

67. van der Merwe W (2010) Conversion of spent solid phosphoric 
acid catalyst to environmentally friendly fertilizer. Environ Sci 
Technol 44(5):1806-1812 

68. AsiaPulse News (2009) SHENHUA NINGXIA COAL, SASOL'S 
COAL LIQUEFACTION PJT BEGINS 2010. Yinchuan, 22 June 
2009 

69. Peng X, Toseland B, Underwood T (1997) A novel mechanism 
of catalyst deactivation in liquid phase synthesis gas-to-DME 
reactions. In: Bartholomew C, Fuentes GH (eds) Catalyst 
deactivation. Elsevier, Amsterdam 

70. Gogate M, Lee S (1991) A single- stage, liquid-phase dimethyl 
ether synthesis process from syngas. ! Dual catalytic activity 
and process feasibility. Fuel Sci Technol Int 9(6):653-679 

71. Cornthwaite D (1972) British patent 1296212 to Imperial 
Chemicals Inc. (ICI) 

72. Espino RL, Plezke TS (1975) Methanol production. US 3888896 
to ChemSystems, 10 June 1975 

73. Sherwin M, Blum D (1979) Liquid-phase methanol. Final 
report. Electric Power Research Institute, EPRI-AF-1291 

74. Lee S, Sardesai A (2005) Liquid phase methanol and dimethyl 
ether synthesis from syngas. Top Catal 32(3-4):197-207 

75. Air Products Liquid Phase Conversion Company (2004) 
Commercial-scale demonstration of the liquid-phase methanol 
(LPMEOH) process. DOE/FE-0470, US Department of Energy 

76. Joo O, Jung K, Han S (2002) Modification of H-ZSM-5 and 
gamma-alumina with formaldehyde and its application to 
the synthesis of dimethyl ether from syn-gas. Bull Korean 
Chem Soc 23:1103-1105 

77. Lee S, Parameswaran VR, Wender I, Kulik CJ (1989) Roles of 
carbon dioxide in methanol synthesis. Fuel Sci Technol Int 
7(8):1021 -1058 

78. Cai Y, Davies S, Wagner J (2003) Water gas shift catalyst. 
US 6627572 to Sud-Chemie, 30 September 2003 

79. Kim J, Park MJ, Kim SJ, Joo O, Jung K (2004) DME synthesis 
from synthesis gas on the admixed catalysts of Cu/Zn0/Al 2 0 3 
and ZSM-5. Appl Catal, A 264(1 ):37-41 

80. Tijm PJ (2003) Development of alternative fuels and chemicals 
from synthesis gas. DOE contract number FC22-95PC93052, 
final report 

81. NEDO (2006) Clean coal technology in Japan. 4A4. Dimethyl 
ether production technology (DME), pp 63-64. www.nedo. 
go.jp/kankobutsu/pamphlets/sekitan/cct2006e.pdf. Accessed 
10 April 2010 

82. Hayashi H, Todoroki A, Yasuto A, Ohno Y (2002) NKK DME 
diesel vehicle development and fleet test in Japan. In: Inter¬ 
national symposium on alcohol fuels, Sao-Paulo, November 
12-15 


C 




2258 


Coastal Ecosystems, Sustainable Management 



83. Ohno Y, Tanishima S, Aoki S (2005) Coal conversion into 
dimethyl ether as an innovative clean fuel. In: International 
conference on coal science and technology, ICCST, Okinawa, 
Japan, October 09-14 

84. Mogi Y, Ohno Y, Ogawa T, Inoue N, Shikada T (2000) Develop¬ 
ment of slurry phase dimethyl ether synthesis technology. 
In: Pittsburgh coal conference, Pittsburgh, September 11-14, 
pp 398-408 

85. Ohno Y, Yagi H, Inoue N, Okuyama K, Aoki S (2007) 
Slurry phase DME direct synthesis technology - 100 tons/day 
demonstration plant operation and scale up study. In: 
Noronha FB, Schmal M, Sousa-Aguiar EF (eds) Studies in sur¬ 
face science and catalysis: natural gas conversion VII. Elsevier, 
Amsterdam 

86. Liu Z (2005) Clean coal technology: direct and indirect coal- 
to-liquid technologies. InterAcademy Council. http://www. 
interacademycouncil.net/Object.File/Master/10/335/CleanCoal 
technology_ coal liquefaction.pdf. Accessed 24 April 2010 

87. Probstein RF, Flicks RE (2006) Synthetic fuels. Dover, Mineola 

88. Gray D, Salerno S, Tomlinson G, Marano JJ (2004) 
Polygeneration of SNG, hydrogen, power, and carbon dioxide 
from Texas lignite. Mitretek technical report for the DOE, MTR- 
04-2004-18 

89. Camm F, Bartis JT, Bushman CJ (2008) Federal financial incen¬ 
tives to induce early experience producing unconventional 
liquid fuels. Report prepared for the United States Air Force 
and the National Energy Technology Laboratory of the DOE. 
http://www.rand.org/pubs/technical_reports/TR586/, RAND 
corporation. Accessed 10 April 2010 

90. Kohler LPFD, Du Plessis GH, Du Toit FJ, Koper EL, Phillips TD, 
Van Der Walt J (2006) Method of purifying Fischer-Tropsch 
derived water. US 7153432 to Sasol Technology, 26 December 
2006 

91. US Department of Energy (2006) Emerging issues for fossil 
energy and water. DOE/NETL-2006/1233 

92. Boardman R (2007) Gasification and water nexus. Presented 
March 14, 2007 at the GTC workshop on gasification technol¬ 
ogies, Denver 

93. Wei W (2003) Current issues of China's coal industry: the case 
of Shanxi. In: Coate B, Brooks R, Fraser I, Xu L (eds) Proceedings 
of the 15th annual conference of the association for Chinese 
economics studies Australia (ACESA), RMIT (Royal Melbourne 
Institute of Technology) Business Research Development Unit, 
Melbourne 

Books and Reviews 

Dry ME (2001) High quality diesel via the Fischer-Tropsch process - 
a review. J Chem Technol Biotechnol 77(1):43-50 

Egan CJ (1976) Method of power generation via coal gasification 
and liquid hydrocarbon synthesis. US 3986349 to Chevron 
Research Company, 19 October 1976 

Furstner AE (1995) Active metals: preparation characterization 
applications. In: Furstner A (ed) Supported metals. Wiley, 
New York 


Iglesia E, Reyes S, Madon RS (1993) Selectivity control and catalyst 
design in the Fischer-Tropsch synthesis: sites, pellets, and 
reactors. Adv Catal 39:221-302 

Iglesia E, Reyes S, Soled S (1993) Reaction-transport selectivity 
models and the design of Fischer-Tropsch catalysts. In: Becker 
ER, Pereira CJ (eds) Computer-aided design of catalysts and 
reactors. Marcel Dekker, New York, p 640 

Jin Y, Wang J, Wang T (2007) Slurry reactors for gas-to- 
liquid processes: a review. Ind Eng Chem Res 46(18): 
5824-5847 

O'Brien R, Xu L, Bao S, Raje A, Davis B (2000) Activity, selectivity and 
attrition characteristics of supported iron Fischer-Tropsch cat¬ 
alysts. Appl Catal, A 196(2):173-178 

Rahmin II (2008) GTL, CTL finding roles in global energy supply. 
Oil Gas J 24:22-31 

Sloane D (2008) Clean coal technologies accelerating commercial 
and policy drivers for deployment. International Energy 
Agency/Coal Industry Advisory Board, p 70 

Tijm PA, Waller F, Brown D (2001) Methanol technology devel¬ 
opments for the new millennium. Appl Catal, A 22(1-2): 
275-282 

Williams A, Pourkashanian M, Jones JM (2001) Combustion of 
pulverized coal and biomass. Prog Energy Combust Sci 
27:587-610 


Coastal Ecosystems, Sustainable 
Management 

Paul A. Sandifer 1 , Tracy K. Collier 2 , Juli M. Trtanj 2 
National Oceanic and Atmospheric Administration, 
National Ocean Service, c/o Hollings Marine 
Laboratory, Charleston, SC, USA 
2 National Oceanic and Atmospheric Administration, 
National Ocean Service, Oceans and Human Health 
Initiative, Silver Spring, MD, USA 

Article Outline 

Glossary 

Definition of the Subject and Its Importance 
Introduction 

History and Evolution of Oceans and Human Health 
Programs 

The “One Health” Paradigm 

Ocean Health Early Warning Systems (OHEWS) 

Response to Emergency Situations 

Future Directions 

Bibliography 




Coastal Ecosystems, Sustainable Management 


C 


2259 


Glossary 

Antibiotic resistance Situation where microorgan¬ 
isms, particularly bacteria, are able to survive expo¬ 
sure to antibiotics. Of major concern when resistant 
microorganisms are the causative agents of signifi¬ 
cant human or animal diseases. 

Coastal and marine spatial planning “A comprehen¬ 
sive, adaptive, integrated, ecosystem-based, and 
transparent spatial planning process, based on 
sound science, for analyzing current and anticipated 
uses of ocean, coastal, and Great Lakes areas” 
(Interagency Ocean Policy Task Force (IOPFT) 
(2010) Final recommendations of the interagency 
ocean policy task force. The White House Council 
on Environmental Quality, Washington, DC, 96 pp). 
Ecological forecasting Combining environmental 
and biological information to predict effects that 
biological, chemical, physical, and human-induced 
changes will have on ecosystems in general or more 
typically specific ecological components (e.g., 
harmful algae, fishery populations, jellyfish). 
Epidemiological Pertaining to the study of the inci¬ 
dence, distribution, causes, and prevention of 
diseases. 

Harmful algal blooms Harmful algal blooms (HABs) 
are proliferations (blooms) of microscopic algae 
that produce toxins that may cause serious illness 
or death in both animals and humans. HABs are 
caused by numerous species of algae and may occur 
in salt, brackish, and fresh waters. 

One health The paradigm that links environmental, 
human, and animal health. 

Pathogens Microorganisms that cause infectious 
disease. 

Zoonotic Referring to diseases of animals that may be 
transmitted to and cause illness in humans. 

Definition of the Subject and Its Importance 

Oceans and Human Health is a “meta-discipline” com¬ 
prising elements of oceanography, marine biology and 
ecology, biomedical science, environmental health 
science, medicine, and public health. It focuses princi¬ 
pally on water- and food-borne causes of human and 
animal illness associated with exposure to ocean, 
coastal and/or Great Lakes waters, habitats, and organ¬ 
isms. It seeks to integrate information across many 


disciplines to achieve a fuller understanding of health 
risks and benefits associated with these environments 
and how they can be managed to protect or enhance 
public health. 

Introduction 

That the health of the oceans and humans are tightly 
and inextricably connected has been recognized by 
some scientists for many years [1-3]. However, only 
in the last few decades have these connections - and the 
implications of human health effects of degraded 
coastal and ocean environments - come clearly into 
focus as an important emerging arena for broad 
scientific investigation and policy development. The 
developing “meta-discipline” known as Oceans and 
Human Health (OHH) is a direct manifestation of 
that clearer focus. 

Degradation of ocean and coastal ecosystems 
and the crucial ecosystem services they provide to 
humans have negatively impacted tourism; treasured 
aesthetic and cultural attributes, coastal properties 
and their values; and have resulted in increasingly 
evident and serious health threats to humans from 
harmful algal blooms (HABs) and their toxins, chem¬ 
ical pollution, and infectious diseases [2, 4, 5]. Dealing 
with these complex and interwoven issues to support 
resilient and sustainable coastal ecosystems and com¬ 
munities requires dedicated, integrated, interdisciplin¬ 
ary, and multi-institutional approaches. OHH brings 
together not only oceanographers and marine biolo¬ 
gists but also physicians, public health professionals, 
social scientists, coastal managers, planners, and others 
to work collaboratively on the human health benefits 
and threats that emanate from ocean, coastal, and 
Great Lakes environments [2]. 

History and Evolution of Oceans and Human 
Health Programs 

Named OHH programs have focused predominantly 
on the deleterious environmental and human health 
effects of harmful algal blooms and their toxins; 
sewage- and contamination-mediated, naturally occur¬ 
ring, and zoonotic infectious diseases; and chemical 
contamination in ocean, coastal, and Great Lakes 
waters. These programs also target beneficial products 
such as new drugs to combat various diseases and pain, 
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high-quality seafood with its cardiovascular protective 
and other healthful qualities, and use of marine organ¬ 
isms and habitats as tools to study disease processes or 
as sentinels of likely human health risks. In addition to 
direct focus on the root causes, effects, and ecological 
processes involved with each of these, OHH programs 
are utilizing a variety of tools such as warning systems, 
sensors, cutting edge “omics” and other scientific 
methods, and sentinel species and habitats to address 
ocean-related health risks and benefits. As a result of 
the activities of OHH programs, a new interdisciplin¬ 
ary community of science and practice is developing to 
concentrate on potential ocean health risks and benefits 
to humans. 

It is fairly easy to identify when named programs in 
OHH began (e.g., see [6-8]), but it is much more 
difficult to unravel the evolutionary history and deter¬ 
mine when the collaborations that eventually culmi¬ 
nated in these programs really got underway (e.g., 
see [9]). Nonetheless, it is clear that a variety of 
programs played pivotal roles in this evolution. These 
include but are not limited to such activities as the 
National Institute of Environmental Health Sciences’ 
(NIEHS) Marine and Freshwater Biomedical Sciences 
Centers program initiated in 1978 that focused on 
seafood safety, harmful algal blooms, and toxicology 
among other things; the National Institute of 
Health’s (NIH) support of work to grow aquatic organ¬ 
isms for use in research [10]; the National Oceanic 
and Atmospheric Administration’s (NOAA) studies in 
seafood safety including effects of HAB toxins; the 
passage of the initial Harmful Algal Bloom Research 
and Control Act (HABHRCA) in 1998 and resulting 
further development of HAB programs in NOAA 
and elsewhere; and the pioneering work by Rita 
Colwell [11] and subsequently by her, her associates, 
and others on links between cholera outbreaks 
and climate. 

In addition to the nascent efforts at the federal level, 
beginning in the late 1980s and through the 1990s, 
a team initially composed of researchers from NOAA 
and the South Carolina Department of Natural 
Resources and later expanded to include other partners 
(the College of Charleston, Medical University of 
South Carolina, and the National Institute of Stan¬ 
dards and Technology) developed plans for a new 
partnership laboratory, initially known as the Marine 


Environmental Health Research Laboratory, to be 
located in Charleston, SC. By 1994, the planning team 
had zeroed in on an initial mission statement for the 
new facility, today known as NOAA’s Hollings Marine 
Laboratory (HML): “[it] will be directed specifically 
toward achieving a better understanding of the health 
of marine organisms, populations, and habitats and 
their relationship to human health” [12, 13]. Today, 
the HML, also designated as NOAA’s Coastal Center for 
Human Health Risk, conducts research to understand 
and forecast relationships between coastal ocean eco¬ 
systems and human health and communicates findings 
to ocean, coastal, and public health officials and inter¬ 
ested parties. 

At about the same time as formal planning got 
underway for the HML, the first National Harmful 
Algal Bloom research plan was prepared in response 
to growing concerns about the occurrence and impacts 
of such blooms [ 14] . This plan guided US HAB-related 
research for well over a decade and led to establishment 
of an informal national network of HAB researchers 
in the USA. Then, in 1998, the US Congress passed 
the HABHRCA, which authorized an interagency 
task force, required HAB and hypoxia assessments, 
authorized funding for research in these areas, and 
established programs that became Ecology and Ocean¬ 
ography of Harmful Algal Blooms (ECOHAB), Moni¬ 
toring and Event Response for Harmful Algal Blooms 
(MERHAB), and the Coastal Hypoxia Research Pro¬ 
gram (CHRP). 

More recently, HARNESS (the Harmful Algae 
Research and Response National Environmental Sci¬ 
ence Strategy, 2005-2015) [15] was completed to con¬ 
tinue to provide broad guidance for HAB-related 
research in the USA. This major report focused on 
four broad thematic areas of inquiry: bloom ecology 
and dynamics, toxins and their effects, food webs 
and fisheries, and public health and socioeconomic 
impacts. A part of the HARNESS effort included for¬ 
mal establishment of a National HAB Committee 
designed to facilitate implementation of the plan and 
coordination among the numerous interested agencies, 
academic institutions, other entities, and individuals, 
along with international cooperation. Because of the 
increasing frequency, duration, intensity, geographical 
extent, and number of species and toxins involved [2, 
16-18] , HAB research has remained one of the primary 
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foci of oceans and human health programs, both 
nationally and internationally. 

Broader recognition of the multiple roles the ocean 
(and its coastal environments) may play in human 
health and well-being followed US sponsorship of an 
OHH-themed pavilion at the 1998 World’s Fair in 
Lisbon, Portugal, and, as noted above, 1998 also saw 
passage of the HABHRCA. Shortly thereafter, the 
National Research Council of the National Academies 
of Science published the highly influential report 
“From Monsoons to Microbes: Understanding the 
Ocean’s Role in Human Health” [19] which focused 
substantial attention on the myriad relationships 
between ocean and coastal environments and human 
health and well-being. This report was quickly followed 
by other NRC reports that further helped shape the 
OHH landscape, including “Under the Weather: Cli¬ 
mate, Ecology, and Infectious Disease” [20] and 
“Marine Biotechnology in the Twenty-First Century: 
Problems, Promise, and Products” [21]. 

Following quickly on the heels of these seminal 
publications, the National Science Foundation (NSF) 
and NIEHS joined forces to develop a collaborative 
OHH program. Their combined effort was initiated 
through a scientific community workshop held in 
December 2001, which was followed by a joint Request 
for Proposals in November 2002 for creation of aca¬ 
demic centers of excellence in OHH [6, 8]. NOAA’s 
named OHH program got underway shortly thereafter, 
with initial Congressionally directed funding appropri¬ 
ated in 2003 for the agency to establish an Ocean and 
Human Health Initiative (OHHI) to “coordinate 
and focus agency activities on critical areas of concern 
and identify critical gaps in coverage, and ... to be used 
for critical research and projects aimed at closing iden¬ 
tified gaps.” The Congressional language directed 
NOAA to work with NSF and other federal agencies 
and establish NOAA Centers of Excellence in OHH, 
a national advisory panel, and competitive external 
grant, distinguished scholar, and traineeship programs. 
Like NSF and NIEHS, NOAA utilized a scientific work¬ 
shop, held in October 2003, as a mechanism to solicit 
external as well as internal input for development of its 
OHH Initiative [7]. These actions led to creation of 
a national network of competitively funded OHH 
Centers of Excellence, with four awarded in the 
academic sector in May 2004 by NSF and NIEHS 


(at the Woods Hole Oceanographic Institution, Uni¬ 
versity of Miami, University of Washington, University 
of Hawaii) and three selected in NOAA a few months 
later (at the Great Lakes Environmental Research 
Laboratory, the Hollings Marine Laboratory, and the 
Northwest Fisheries Science Center). 

In September of 2004, the US Commission on 
Ocean Policy released its final report [22] which 
included a full chapter on OHH and recommended 
the establishment of a national research effort in this 
area. This report had a major and positive impact on 
the continued development of OHH programs in the 
USA. Shortly after its release, the Congress passed the 
Oceans and Human Health Act (OHHA) of 2004 to 
“.. .improve understanding of the role of the oceans in 
human health.” The OHHA established the national, 
interagency OHH program in law and formally autho¬ 
rized the Secretary of Commerce to create the OHHI 
within NOAA, with its component parts including 
internal centers of excellence, expert advisory panel, 
support for grants, distinguished scholars, traineeships 
in the external scientific community, and requirements 
to collaborate with other agencies. The USCOP report 
also resulted in action by the George W. Bush 
Administration to establish a cabinet-level National 
Ocean Committee, with subordinate committees and 
interagency working groups [23] . One of the high-level 
interagency bodies was the Joint Subcommittee on 
Ocean Science and Technology (JSOST) which was 
established by the National Science and Technology 
Council’s Committee on the Environment and Natural 
Resources and Interagency Committee on Ocean Sci¬ 
ence and Resource Management Integration. The 
JSOST produced the first interagency ocean research 
plan, “Charting the Course for Ocean Science in the 
United States for the Next Decade: An Ocean Research 
Priorities Plan and Implementation Strategy” [24], 
which was organized around six primary themes, 
including one on “Enhancing Human Health.” This 
was a major step forward for OHH programs because 
it was the first time that all the federal agencies involved 
in any kinds of ocean-related research agreed on the 
importance of including human as well as animal 
health-oriented studies in a comprehensive ocean 
research portfolio. 

The JSOST also formed a number of interagency 
working groups under its umbrella. One of these, the 
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Interagency Working Group on Harmful Algal Blooms, 
Hypoxia, and Human Health (IWG-4H), was chartered 
in fall 2005 with a principal focus on responding to 
the Congressional requirements of the HABHRCA 
Amendments of 2004 and the OHHA of 2004. Working 
under the general oversight of the JSOST, the IWG-4H 
produced a series of major reports related to oceans 
and human health, including the first ever Interagency 
Oceans and Human Health Research Implementation 
Plan [2] which expanded upon the “Enhancing 
Human Health” theme of the JSOST’s [24] overarching 
ocean research plan. The purpose of this OHH 
Research Plan was to define “the goals and priorities 
for Federal research which most effectively advance 
scientific understanding of the connections between 
the oceans and human health, provide usable informa¬ 
tion for the prediction of marine-related public health 
problems, and use the biological potential of the oceans 
for development of new treatments of human diseases 
and a greater understanding of human biology” [2]. In 
addition to the OHH plan, the IWG-4H produced the 
National Assessment of Efforts to Predict and Respond to 
Harmful Algal Blooms in U.S. Waters [25], Scientific 
Assessment of Freshwater Harmful Algal Blooms [26], 
Harmful Algal Bloom Management and Response: 
Assessment and Plan [27], Scientific Assessment of 
Marine Harmful Algal Blooms [28], and Scientific 
Assessment of Hypoxia in U.S. Coastal Waters [29], all 
of which are important resources for OHH research 
activities. The IWG-4H is continuing to focus on 
interagency coordination and collaboration among 
a variety of agencies involved in one or more aspects 
of OHH work, including the Centers for Disease Con¬ 
trol and Prevention (CDC), Environmental Protection 
Agency (EPA), Food and Drug Administration (FDA), 
Marine Mammal Commission (MMC), National 
Aeronautics and Space Administration (NASA), US 
Department of Agriculture (USDA), and US Geological 
Survey (USGS), in addition to the NIEHS, NSF, 
and NOAA. 

The years 2007-2008 marked other important 
developments for OHH as a recognized meta¬ 
discipline. NOAA inaugurated its OHH Traineeship 
Program, with initial awards to five academic consortia 
specifically focused on providing support for the inter¬ 
disciplinary education and training of doctoral and 
post-doctoral scholars in OHH, as part of a long-term 


strategy to build a new cadre of OHH scientists. Most if 
not all of the NSF-NIEHS-funded OHH research pro¬ 
jects and many of those supported by NOAA, whether 
internally or externally, also supported students, but 
the NOAA traineeships have the education and devel¬ 
opment of the next generation of OHH scientists as the 
principal focus. Also in 2008, the first textbook devoted 
specifically to support teaching in the interdisciplinary 
oceans and human health arena was published [9]. 
Several academic institutions had already begun offer¬ 
ing formal OHH courses, but for the first time there 
was a strong textbook to support them, leading to 
further development, refinement, and breadth of 
OHH-related course offerings. In addition, the first 
Gordon Research Conference (GRC) and Graduate 
Research Seminar (GRS) on OHH was held in 2008, 
and these have followed every 2 years since, with the 
next GRC/GRS scheduled for summer 2012. Accep¬ 
tance of OHH as a recurring theme for Gordon 
Research Conferences marked a broad level of recogni¬ 
tion of OHH as a scientific area of ongoing significance, 
major community interest, and likely important 
discoveries. 

During 2010-2011, the JSOST (now renamed Sub¬ 
committee on Ocean Science and Technology or SOST) 
initiated an update of the “Charting the Course” Ocean 
Research Priorities Plan [24]. The new version, 
expected to be released in 2011 and entitled “Science 
for an Ocean Nation: An Update of the Ocean Research 
Priorities Plan,” [30] is expected to include updates of 
the previous plans OHH-oriented theme, “Enhancing 
Human Health,” demonstrating the continuing signif¬ 
icance placed on this arena of interdisciplinary and 
interagency ocean research by US government agencies. 

Following a workshop on climate change sponsored 
by the Institute of Medicine s (National Academies of 
Science) Roundtable on Environmental Health Sci¬ 
ences, Research and, Medicine in January 2009, senior 
representatives from four federal agencies (NIEHS, 
EPA, NOAA, and CDC) formed an ad hoc Interagency 
Working Group on Climate Change and Health 
(IWGCCH) to prepare a white paper to “to identify 
research needs for all aspects of the research-to- 
decision making pathway that will help us understand 
and mitigate the health effects of climate change, as 
well as ensure that we choose the healthiest and most 
efficient approaches to climate change adaptation.” 
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This effort resulted in the first comprehensive evalua¬ 
tion of research needs related to likely human health 
effects of climate change, including ocean-associated 
health risks and benefits [31]. An additional outcome 
of the work of the ad hoc IWGCCH was the establish¬ 
ment by the US Global Climate Change Research Pro¬ 
gram of a formal Interagency Working Group on 
Climate Change and Human Health (IWGCCHH), 
which includes strong representation of OHH pro¬ 
grams and issues. 

On June 12, 2009, US President Obama issued 
a memorandum establishing an Interagency Ocean 
Policy Task Force (IOPTF) led by the Chair of the 
White House Council on Environmental Quality 
(CEQ) and comprised of senior officials representing 
24 executive departments, agencies, and other offices 
across the Federal government. President Obama 
charged the IOPTF with responsibility for developing 
recommendations to maintain healthy and resilient 
ocean, coastal, and Great Lakes ecosystems, resources, 
and uses. On July 19, 2010, President Obama issued 
Executive Order 13547 [32] that established the US 
National Policy for the Stewardship of the Ocean, 
Our Coasts, and the Great Lakes (National Ocean 
Policy) including the Final Recommendations of the 
Interagency Ocean Policy Task Force [33]. The Final 
Recommendations include 17 mentions of “human 
health” and specifically reference issues dealing with 
harmful algal blooms, pathogens , disease, and chemical 
pollution, all primary OHH issues. Most significantly, 
in its first paragraph the President’s Executive Order 
includes human health among the central foci of the 
National Ocean Policy, stating that “America’s steward¬ 
ship of the ocean, our coasts, and the Great Lakes is 
intrinsically linked to environmental sustainability, 
human health and well-being [emphasis added], 
national prosperity, adaptation to climate and other 
environmental changes, social justice, international 
diplomacy, and national and homeland security.” 

The "One Health" Paradigm 

There is a growing worldwide effort to link scientists 
and practitioners working on human, animal, and 
environmental health into a single, health-oriented 
movement aptly named “One Health” [34]. This para¬ 
digm recognizes the myriad and important linkages 


between the condition of the environment and the 
health of humans and animals and the numerous 
shared diseases and disease processes in humans and 
animals. Approximately 61% of known human patho¬ 
gens are zoonotic, meaning they were originally trans¬ 
mitted from animals to humans, and approximately 
75% of emerging infectious diseases are zoonotics 
[35]. In addition, many diseases are exacerbated by 
stressful environmental conditions. Thus, the case for 
connecting human, animal, and environmental health 
is clear. Co-equal collaborations among environmental 
and biomedical scientists, medical doctors, veterinar¬ 
ians, and public health workers are expected to lead 
to rapidly expanding knowledge, new health care 
approaches and treatments, and synergisms not yet 
even contemplated. 

Considering that the oceans harbor an incredible 
range of life forms, including representatives of 34 of 
the 36 phyla of animal life found on Earth compared to 
17 occurring on land, there is a high likelihood for 
increasing interactions leading to transfer of both 
known and new diseases from marine organisms to 
humans and from human and domesticated animals 
to marine organisms. There are numerous instances of 
zoonotic diseases being detected in marine mammals 
and seabirds, including avian influenza, and several 
required reportable pathogens ( Brucella , Salmonella, 
Leptospirosis ) (e.g., see [36]). Broad occurrence of anti¬ 
biotic-resistant bacteria is also being observed in 
marine organisms, raising additional health risks 
related to ocean exposures [36-38]. At the same time, 
because of the high but relatively poorly known biodi¬ 
versity of much of the ocean [39], there is significant 
likelihood of discovery of many new drugs and other 
natural products to treat human diseases [40-43]. 
There is also a growing body of evidence that connects 
biodiversity and human health and that implicates 
declining biodiversity as a factor exacerbating health 
threats, although many uncertainties remain [44, 45]. 
Some marine organisms have been used for many years 
to study human diseases, and this area is likely to 
increase substantially in the future [2]. And as pointed 
out in a later chapter, numerous marine organisms and 
habitats make excellent sentinels or indicators of 
potential health threats to humans (e.g., [46-49]). 

Degradation of ocean and coastal environments is 
known to lead to increasing health threats to humans, 


c 




2264 


Coastal Ecosystems, Sustainable Management 



including those from infectious diseases as well as from 
chemical contaminants and algal toxins. The OHH 
meta-discipline is a clear example of a health arena 
where the One Health approach is particularly appli¬ 
cable (Fig. 1), and application of this interdisciplinary 
approach will become even more important in 
unraveling and addressing the expected effects of cli¬ 
mate change and variability on humans and marine 
organisms [50-54]. In the ocean, it is anticipated that 
climate change may result in increased prevalence, dis¬ 
tribution, and virulence of both known as well as 
emerging zoonotic, vector-, water-, and seafood- 
borne pathogens; increased distribution, frequency, 
duration, and intensity of harmful algal blooms 
[54]; and increased and/or changed mobilization, 
bioaccumulation, and transfer of chemical contami¬ 
nants. As the Ocean Heath Working Group of 
NOAA’s Science Advisory Board recently stated: “The 
concept of the combined health of people and wildlife - 
one health - when combined with the concept of 
the ocean as an essential modulator/caregiver of all 
life on Earth, inexorably leads to the prime tenet 
that the health of the £ one ocean is essential to the 
£ one health’ of all life on earth, including that of 
humans” [55]. 



Coastal Ecosystems, Sustainable Management. Figure 1 

"One Health" paradigm illustrating areas of intersection 
among environmental health, veterinary, and medical 
sciences 


Ocean Health Early Warning Systems (OHEWS) 

A resilient and secure coastal community depends on 
a vibrant economy, healthy coastal ecosystems, and 
healthy people. The security and resilience of ocean 
and coastal communities are increasingly threatened 
by health consequences related to natural hazards, 
changes in climate and land use, anthropogenic con¬ 
taminants, and naturally occurring and introduced 
pathogens. In general, ocean health threats to humans 
and marine animals appear to be increasing [1, 2], 
resulting in economic losses, illnesses, and a loss of 
public confidence in the healthful condition and safety 
of coastal environments and resources, including sea¬ 
food. For example, news about a contaminated prod¬ 
uct, such as oysters, can shut down a fishery that will 
affect local commerce, but such an event may also affect 
consumers’ behavior toward the resource and the use of 
the environment. The information is rapidly picked up 
by news media, with the result that consumers - even 
those far away from the original incident - may be 
reluctant to eat seafood or go to the beach. An excellent 
example is the recent resistance to consumption of Gulf 
of Mexico seafood even after it has been deemed safe by 
comprehensive government testing for months and 
over a year after the Deep Water Horizon oil spill (see 
for example [56]). In addition, researchers are finding 
growing evidence of antibiotic resistance in marine 
organisms and the presence of MRSA (methicillin- 
resistant Staphylococcus aureus) and other human 
diseases in coastal animals and waters [36, 38, 57]. 
Understanding interactions among these environmen¬ 
tal stressors is crucial to ensure a safe supply of food 
from the sea and to protect coastal-associated com¬ 
merce (e.g., fisheries, aquaculture, tourism) (Fig. 2). 

One important concept under development to 
improve coastal security and resilience and to protect 
the public at larger from increasing exposures to ocean- 
related health risks is an “Ocean Health Early Warning 
System” or OHEWS. Similar early warning systems or 
forecasts are routinely used for hazardous natural 
events such as hurricanes, tsunamis, floods, and vol¬ 
cano eruptions. The objective of an OHEWS is to 
reduce or mitigate threats to marine animal and 
human health from the oceans through development 
of a comprehensive oceans and health assessment 
and forecasting capability that combines new and 
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Coastal Ecosystems, Sustainable Management. Figure 2 

Schematic diagram of major components of, and information flows in, a hypothetical ocean health early warning system 
(modified and used with permission from Trtanj JM, Davis JR (2002) Climate variability information as an adaptation 
tool: the case of public health. Presentation at Energy Modeling Forum Workshop on Adapting to climate change and 
variability. Snowmass Conference Series: Climate Impacts and Integrated Assessment (CCI-IA) (Stanford University), 
Snowmass Colorado) 


developing diagnostic and detection capabilities, sur¬ 
veillance and monitoring data, sentinel species and 
habitats, and targeted outreach to public health offi¬ 
cials and the public. While this effort is relatively new, it 
is building on extensive research that connects envi¬ 
ronmental signatures with a variety of health concerns 
(e.g., [11, 58-65]) and a growing understanding of the 
value of sentinel species and habitats as indicators of 
potential health threats to humans [1,46-49] . Based on 
a broad base of previous research, development is well 
underway for forecasting capability for harmful algal 
bloom events [66, 67] . A few examples include NOAA’s 
Harmful Algal Bloom Operational Forecast System 
[68], the Harmful Algae websites maintained by the 
Woods Hole Oceanographic Institution [18] and the 
Great Lakes Environmental Research Laboratory [69], 
the Vibrio Remote Sensing Report [70], and Mapping 
Pathogens in the Chesapeake Bay [71]. The goal of 
these nascent warning systems is to protect human 
life, health, and property and provide decision¬ 
makers with as much advance notice as possible 
about the likelihood of a threatening event in a partic¬ 
ular location, thus widening the range of feasible 
response options. 


Ongoing surveillance using meaningful indicators 
of ecosystem health, such as the examples mentioned 
above, and development of other ecological forecasting 
tools, followed by assimilation of the indicators into an 
index or suite of indices that could be integrated into an 
overall OHEWS could provide early warning of envi¬ 
ronmental changes likely to have measurable impacts 
on human health and well-being with sufficient lead 
time to take corrective actions. In addition to providing 
warnings in time to treat symptoms, an OHEWS may 
suggest testable hypotheses that, in turn, may lead to 
a much deeper understanding of cause and effect rela¬ 
tionships and opportunities to mitigate, minimize, 
control, or prevent environmental degradation and 
the related threats to human, animal, and environmen¬ 
tal health. 

Response to Emergency Situations 

Federal, state, and local agencies regularly respond 
to natural and human-caused environmental emergen¬ 
cies and disasters, such as hurricanes, tornadoes, 
floods, tsunamis, oil and chemical spills, etc. Such inci¬ 
dents may involve increased likelihoods of water- or 
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food-borne health threats, including infectious dis¬ 
ease and chemical contamination. On occasion OHH 
researchers can enhance the capacities of emergency 
responders by providing additional scientific expertise 
and laboratory capabilities to address the large volumes 
of samples that require processing and data that needs 
analysis and interpretation following environmental 
emergencies [2], leading to better awareness and com¬ 
munication of factual information. For example, fol¬ 
lowing Hurricane Katrina in late summer of 2005 and 
the tragic flooding of New Orleans, the dewatering 
of New Orleans created an unprecedented situation 
regarding public safety, including the perception of 
unsafe seafood. The floodwaters that were pumped 
into nearshore areas of the Gulf of Mexico (GoM) 
were known to contain oils, metals, and a wide range 
of pathogens and enteric bacteria, and there was also 
concern about synthetic chemical contaminants from 
flooded businesses and homes. OHH scientists were 
asked to mount an immediate response, both to collect 
samples of fish and shrimp from the potentially 
affected area as well as provide a wide range of chemical 
and microbiological analyses. Scientists from govern¬ 
ment and academia moved quickly into action. NOAA 
OHH scientists deployed within 1 week after the flood- 
water pumping began, and issued the first public report 
on seafood safety 3 weeks later [72]. Their experience 
and recommendations for ways to further improve 
emergency responses involving oceanic conditions 
were subsequently published as part of a series of 
papers on mitigating the effects of natural disasters on 
fishery ecosystems [73]. Similarly, rapid action by aca¬ 
demic OHH scientists led to significant improvements 
in our understanding of risks associated with infectious 
microbes following hurricane events [74, 75]. 

More recently, the Deepwater Horizon oil spill led 
to grave public concern regarding the safety of seafood 
from the GoM, and again lessons learned in the OHH 
science community were applied to a large multi¬ 
agency effort to assure seafood safety and protect 
public health. Several thousand seafood samples were 
analyzed for injurious components of oil, and after 
developing appropriate methods, for dispersants as 
well. The results of this massive effort showed that 
seafood in areas re-opened to harvest after oil was no 
longer apparent was indeed safe for human consump¬ 
tion [76, 77], although as noted previously, public 


confidence in the safety of GoM seafood was not 
restored as readily. 

Despite many significant spills over the past few 
decades, surprisingly little is known about the direct 
effects of oil spills on human health, such as on those 
people working in oil spill response and cleanup activ¬ 
ities [78-80] . Following the DWH release, the NIEHS is 
planning a study that will be the largest ever investiga¬ 
tion of effects of an oil spill on human health. The 
NIEHS intends to enroll approximately 55,000 
response and cleanup workers in a 10-year-long longi¬ 
tudinal study, with the goal of evaluating both physio¬ 
logical and psychological health that might be affected 
by the spill [81]. OHH science will assist in this study, 
for example, by assessing the health of marine mam¬ 
mals in the Gulf of Mexico, partly as sentinels for 
human health, as well as providing other information 
about likely human exposures. Finally, as the great 
Japanese earthquake and tsunami of March 11, 2011, 
have shown, unanticipated effects of disasters, such as 
the damages to the Fukushima nuclear reactors that 
resulted in discharges of radionuclides to coastal waters 
(e.g., see [82, 83]), will continue to bring OHH-related 
concerns forward into public consciousness as well as 
raise questions for policymakers and public health 
communities around the globe. Ongoing and sustained 
progress in OHH science should allow us to better 
address such concerns and better safeguard public 
health following future emergencies involving our 
oceans, coasts, and Great Lakes. 

Future Directions 

Principally because of resource limitations, many if not 
most of the opportunities for future research and 
development identified by Refs. [2, 24, 31, 84] remain 
incompletely addressed, though still valid. Some of the 
more pressing issues, that we believe are most likely to 
lead to significant advances in science and in protection 
of human and animal health, include the following: 

1. Continue fundamental research to address known 
and emerging human and animal pathogens, 
harmful algal blooms and their toxins, and 
impacts of chemical pollution in ocean, coastal, 
and Great Lakes environments, including occur¬ 
rence, source, transport, fate, physical factors 
and ecological processes affecting occurrence, 



Coastal Ecosystems, Sustainable Management 


C 


2267 


abundance, and virulence, and the development 
and use of models to provide reliable forecasts of 
likely exposures and risk of contracting disease. 

2. Determine how the prevalence and virulence of 
pathogens, HABs, and chemical contaminants 
may be affected by seasonal factors and climate 
change, including ocean acidification, tempera¬ 
ture, sea level rise, and other factors, and develop 
models and predictive capacity to forecast risks 
and effects of alternative management and adap¬ 
tation scenarios. 

3. Further develop robust capacity to monitor con¬ 
dition and health of sentinel species, habitats, and 
sites and to integrate the data derived from such 
monitoring with other environmental and human 
health information to provide additional founda¬ 
tion for risk characterization and assessment and 
development of early warning systems. 

4. Establish robust monitoring and investigative 
capacity for regular evaluation of seafood safety 
and to identify and characterize ongoing and 
emerging risks to humans and animals in time to 
take corrective or preventive action before seafood 
products reach markets and are consumed. 

5. Develop and implement sustained epidemiological 
investigations to characterize and quantify human 
health risks associated with pathogens, toxins, and 
chemical contaminants from ocean, coastal, and 
Great Lakes waters, and ensure that ocean observ¬ 
ing systems acquire related environmental data at 
the appropriate temporal and spatial scales for 
combination with epidemiological information 
to present a more complete picture of factors 
affecting human health. 

6. Conduct continuing research on the potential of 
ocean, coastal, and Great Lakes environments to 
provide new drugs and other beneficial products 
derived from marine biota and establish the nec¬ 
essary long-term mechanisms to support ocean 
product development over the long period 
between initial discovery and laboratory trials to 
commercialization. 

7. Develop, test, and deploy new and improved sen¬ 
sors capable of detecting and quantifying occur¬ 
rence of specific ocean-derived health threats 
(pathogens, HABs and their toxins, chemical con¬ 
taminants) and incorporate into ocean observing, 


seafood safety monitoring, and human and 
marine wildlife disease surveillance. 

8. Locus a sustained research effort on understand¬ 
ing the linkages between biodiversity and either 
reduced or increased human health risks in coastal 
and ocean environments, with particular focus on 
increasing knowledge of the health-related values 
of conservation of natural biodiversity. 

9. Acquire and integrate social, economic, and cul¬ 
tural information related to human health risks, 
impacts, and benefits associated with the sea and 
incorporate into OHH analyses, models, and 
forecasts. 

10. Develop and implement a variety of ocean health 
early warning systems, incorporating data from 
remote sensing; ocean observing systems; new 
and improved sensors designed specifically to 
detect marine ocean health threats, sentinel spe¬ 
cies, sites, and habitats; and socioeconomic and 
epidemiological studies with physical and ecolog¬ 
ical models to provide forecasts and evaluate man¬ 
agement alternatives. 

11. Integrate OHH results into Coastal and Marine 
Spatial Planning (CMSP) as described in the US 
National Ocean Policy [32, 33] to identify those 
geographic areas that should not be considered for 
public and recreational uses due to the likelihood 
of exposure to health threats and to monitor 
ongoing conditions in areas heavily used by the 
public to ensure public safety. 
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Glossary 

Ash Residue obtained after combustion of a fuel. 
Biomass From a scientific and technical point of view, 
biomass is defined as material of biological origin, 
excluding material embedded in geological forma¬ 
tions and/or transformed to fossil. 

Bioenergy Bioenergy is defined as energy from 
biomass. 

Co-combustion Simultaneous combustion of bio¬ 
mass and fossil fuels within the same combustion 
device. 

Combustion Full oxidation of a fuel. 

Solid Biofuel Solid fuel produced directly or indirectly 
from biomass. 

Thermochemical conversion Heat-induced conver¬ 
sion of biofuels into energy carrier and/or energy. 

Definition of the Subject 

Due to the introduction of the C0 2 emissions trading, 
the co-combustion of wood in coal-fired large-scale 
power plants gains more and more importance within 
the energy industry. Co-combustion of biomass in 
coal-fired power plants means the utilization of solid 
biofuels in existing hard coal- or lignite-fired power 
plants or combined heat and power (CHP) plants for 
the replacement of fossil fuel energy. (Parts of this paper 
are retrieved from [1].) For detailed citation see [1]. 

This option is particularly characterized by the 
following benefits: 

1. Highly efficient electricity provision from biomass 
compared to a electricity production in small-scale 
units; this allows conserving natural resources, 
which can even be improved by additional extrac¬ 
tion of heat in combined heat and power generation 
plants, 

2. High greenhouse gas (GHG) reduction efficiency 
by direct replacement of coal as the fossil energy 
source with the highest specific greenhouse gas 
emission, 

3. Relatively low investments for the prevalent co¬ 
combustion rates of 3-10% of the boiler capacity. 


Therefore, the objective of the following explana¬ 
tions is it to provide a comprehensive analysis of the 
technical effects on the power plant processes by dif¬ 
ferent co-fired solid biofuels. Subsequently, a special 
focus is put on the experiences with co-combustion of 
natural wood in three large-scale coal-fired power 
plants in Berlin/Germany. Finally, appropriate conclu¬ 
sions are drawn. 

Introduction 

Of all renewable sources of energy, biomass contributes 
by far most to cover the demand of energy in Germany 
and globally. In Germany, for instance, approximately 
70% of the total primary energy from renewable sources 
of energy is derived from biomass; for the generation of 
heat, biomass (mainly wood) takes a share of almost 
94%, for the biofuels this is 100%, and for electricity, it 
ranks second (28%) behind wind (40%). 

Approaches of a combined use of fossil and renew¬ 
able sources of energy particularly apply for biomass, 
for example, adding a mixture of solid biofuels to fossil 
fuels or feeding biogas into natural gas distribution 
systems as well as co-combustion of biomass in large- 
scale power plants. This combined utilization with 
a limited share of renewable energies is mainly used 
for technical reasons so that the high efficiency of the 
technical installations achieved with fossil standard and 
regular fuels as well as their life cycle will not be 
compromised. 

The latter particularly applies to biomass co- 
combustion in large-scale combined heat and power 
(CHP) plants, where a high efficiency for electricity 
generation exists due to advanced steam parameters. 
In addition, hard coal or lignite is directly replaced by 
the C0 2 -neutral biomass. Therefore, co-combustion of 
solid biofuels in coal-fired power plants is particularly 
efficient from a climate protection point of view due to 
direct and immediate replacement of fossil fuel energy 
with high specific C0 2 emissions (i.e., coal). In contrast 
to that, pure biomass-fired power plants primarily dis¬ 
place electricity from the local electricity mix and thus 
electricity provided with a high percentage of low-C0 2 
electricity from, e.g., windmills or PV systems. The 
moderate additional investments and short transport 
distances for the manageable quantities account for 
other benefits of biomass co-combustion in a limited 
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scope of approximately 3-5% of the boiler capacity. 
Such co-combustion of biomass in conventional coal- 
fired power plants is influenced by a series of ancillary 
conditions, which will be briefly discussed below. 

The share of solid biofuels in the total thermal 
performance of a conventional power plant fired with 
hard coal or lignite is limited by the amount of bio¬ 
mass, which can be used in the combustion system 
without significant disadvantages to be expected (e.g., 
corrosion, emissions, flow conditions). For instance, 
usually a share of 10% of biomass energy referring to 
the amount of fossil fuel energy resp. the installed 
thermal capacity should not be exceeded to keep mea¬ 
sures of adjusting the existing power plant to such 
changed conditions as low as possible. However, higher 
percentages are possible for lignite-fired systems for 
firing systems converted from lignite to hard coal 
with flue gas recirculation and for fluidized bed sys¬ 
tems. Additionally, the biomass flow, which can be used 
for co-combustion, is also limited by the quantity of 
biomass fuels, which can be provided at a certain power 
plant site for economic reasons. For example, the upper 
performance limit of biomass provision under Central 
European conditions would be approximately 50-100 
MW of biomass fuel capacity. For higher capacities, the 
size of the required catchment area will increase so that 
transport effort, which is mostly handled by road due 
to infrastructural constraints, will increase too much 
and thus the procurement costs for biomass will 
become too high. 

Against this background, the coal-fired power 
plants coming into consideration for co-combustion 
of solid biomass are on the one hand comparably 
“small” combined heat and power (CHP) generation 
plants normally operated by public utility companies in 
a capacity range below approximately 100 MW of elec¬ 
tric power. On the other hand, also the larger block- 
unit power stations with installed electrical capacities 
of 500-1,000 MW come into consideration. Facing the 
comparably large installed electric (and thus also ther¬ 
mal) power in the scope of large-scale power stations, 
even percentages of co-firing of 5-10% mean truly 
significant quantities of solid biomass, so that for the 
large-scale biomass co-firing, there is no need to be 
worried about higher shares of biomass in the firing 
heat capacity. 


The utilization of biomass in existing large-scale 
coal-fired power plants for the generation of electricity 
has a number of advantages compared to the genera¬ 
tion of electric energy in smaller plants exclusively fired 
with biomass. 

• The significant power plant capacity available 
in Central Europe for generating electricity 
from coal results in a high potential for the utili¬ 
zation of biomass. From a technical point of 
view, this potential is quickly available; limita¬ 
tions only exist due to the fact that principally 
not all locations of large-scale power plants are 
qualified for co-combustion of biomass (e.g., due 
to logistic limitations). Even with a comparably 
small share of biomass referring to the overall 
installed thermal capacity resp. to the overall used 
fossil fuel energy, a huge amount of solid biomass 
can be used. 

• Compared to smaller plants exclusively fired 
with biomass, the efficiency of electricity generation 
in large-scale power plants normally is significantly 
higher due to a higher technical effort possible in 
large-scale plants due to economic reasons. 

• In case of seasonal nonavailability or in case of 
supply bottlenecks for solid biofuels due to weather 
conditions, electricity generation can be ensured by 
the basic fuel of coal (i.e., high security of supply 
with high fuel flexibility). 

• The additional investment required for the realiza¬ 
tion of biomass co-combustion is relatively low 
because only the additional equipment is counted for. 

The substitution of the fossil fuel of coal by 
solid biofuels makes the acquisition of an according 
quantity of emission permits dispensable, thus creat¬ 
ing the financial scope for the co-combustion of 
biomass. 

Principally, besides solid biofuels (like wood) also, 
other biomass fractions available in a corresponding 
quantity and homogeneity (e.g., sewage sludge) can 
(and will) be co-fired in existing coal-fired power 
plants along with solid biomass such as wood and 
straw. Thus co-combustion is also suitable for types 
of biomass which from a combustion point of view 
are rather problematic and for which revenues for 
disposal is paid to some extent. 
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Biomass Processing 

The required processing of the solid biofuel for co¬ 
combustion in existing power plants depends on the 
type of biomass and on the firing system technology 
(see [1-31]). 

The solid biofuel co-fired in large-scale power 
plants should principally be free of foreign substances 
(e.g., rocks, metal parts) so that downstream conveying 
equipment, processing units (e.g., shredders for coarse 
matter), or dosing units will not be affected. 

Drying of the biomass may possibly be required. 
While this, from a combustion point of view, is nor¬ 
mally not required for wood and straw, it could how¬ 
ever be advantageous from an energetic point of view. 
This particularly applies if waste heat can be utilized for 
this purpose, which before that had not been used. 

Due to some impurities within some solid biofuels 
(e.g., chlorine in straw) problems can be expected 
during the combustion within the power plant (e.g., 
corrosion). For example, one measure to prevented 
such problems is the “washing” the fuel if the impuri¬ 
ties are water-soluble (which is the case for the chlorine 
within the straw). In this process, some impurities (e.g., 
chlorine, alkalines) are washed out of the solid biofuel 
with water. Because of this, the biofuel can be used 
within the firing system of the power plant with prob¬ 
ably less problems. However, the negative aspect is the 
high-energy input for the practical realization of such 
considerations. 

The particularities for dust firing and fluidized bed 
systems with respect to processing of the biofuels to be 
co-combusted are discussed below. 

Processing for Dust-Fired Systems 

Dust firing requires extensive milling of the biomass 
used (e.g., wood-like woodchips, herbaceous biomass 
like straw bales) to achieve a complete burnout. Cutting 
mills, hammer mills can be used for this purpose. 

The wood particles milled with a cutting mill have 
a rather cubic shape, while particles from herbaceous 
biomass (e.g., straw, hay) have the shape of oblong, 
rectangular platelets. The demand of electric energy 
needed for such milling is increasing with declining 
particle size. In average, the energy demand is between 
approx. 0.8 and 2% of the calorific value of the 


biomass, when cutting mills are used with screens of 
a mesh size between 2 and 6 mm. Using a hammer mill, 
the energy demand can be reduced to 0.5-1% of the 
calorific value. Increased water content of the fuel prin¬ 
cipally leads to higher energy consumption; the energy 
demand is especially increasing with water contents of 
more than 10-20%. For straw, for example, more than 
8% of the calorific value is needed for milling in 
a cutting mill with a mesh width of 2 mm and a water 
content of 30%. This energy demand can be reduced by 
previous torrefaction. 

Processing for Fluidized Bed Systems 

Wood can be used in fluidized bed systems as 
woodchips without further milling. For example, max¬ 
imum piece sizes for wood in circulating fluidized bed 
systems are around 60 mm and in stationary fluidized 
bed systems around 100 mm. For herbaceous biomass, 
the bales have to be opened to get loose stalks and the 
stalks need to be cut to lengths of around 10-30 cm. 
Thus, there are virtually no differences to exclusive 
biomass combustion with regard to fuel dimensions. 

Fluidized bed systems therefore require only little 
processing of the fuel to be fired. In addition, they are 
resistant to increased water content of fuels and/or low 
calorific values, as well as fluctuations in water content. 

Co-combustion in Dust-Fired Systems 

The resulting mass and volume flows and their changes 
compared to exclusive firing of coal, as design fuel, is of 
fundamental importance for the possibility of co¬ 
combustion of biomass in existing dust fired systems 
(see [1-31]). Also, the existing facilities for transport 
and processing of the fuel must be suitable with respect 
to the changed fuel volume flow. Beside this, they 
necessarily have the required capacity. In addition, the 
change of the flue gas volume flow has to be evaluated 
as this is influenced by the different water content in 
the solid biofuel resp. the coal and by the different 
quantities of reaction water from the oxidation of the 
hydrogen contained in the fuel. This will significantly 
affect the behavior of heat transfer and retention time 
inside the steam generator and the function of the 
downstream flue gas cleaning equipment. Figure 1 
outlines the effects of the co-combustion to the 
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1 Mill: Capacity, attrition 

2 Combustion chamber: Slagging 

3 Super heater: Corrosion 

4 Heat exchanger: Dirtiness, 
erosion 

5 Flue gas denitrification system: Deactivation, 
capacity, erosion 


6 Electrostatic precipitator: Capacity 

7 Ash: Utilization 

8 Flue gas desulphurisation plant: Capacity 

9 Products of the flue gas desulphurisation 
plant: Utilization 

10 Flue gas: Emissions 


Co-combustion of Wood in Coal-Fired Large-Scale Power Plants. Figure 1 

Possible effects and affected plant components with co-combustion (according to [1]) 


components of a dust-fired power plant based on hard 
coal. They are discussed below for the co-combustion 
of wood and herbaceous biomass. In addition, sewage 
sludge will be considered as an example because, until 
now, this fuel of biogenic origin has gained the largest 
importance in practical realization. 

Fuel and Flue Gas Volume Flow 

From fuel supply to fuel input, the fuel volume flow is 
strongly increasing by the addition of wood and her¬ 
baceous biomass to the coal (Fig. 2) (see [1-31]). This 
is due to the significantly lower heating value of the 
solid biofuels compared to coal and, therefore, rela¬ 
tively larger biomass flow (also compared to coal) to 
achieve the same energy input. For example, a capacity 
share of 10% of co-combustion of straw in hard coal 
dust-fired systems will lead to doubling of the total fuel 
volume flow. Therefore, both milling of the biomass 
and conveying of the solid biofuel to the combustion 
chamber should be carried out separately. In this pro¬ 
cess, in most cases, the coal-grinding mills cannot be 
used for the biomass to be co-fired due to the different 
structure of the solid biofuel and the coal. Therefore, 
separate grinding mills are usually required for this. 

The co-combustion of biomass in coal-fired sys¬ 
tems will change the gas flow inside the boiler only to 


a small extend. Both, the fed combustion air and the 
flue gas volume remain constant largely. If, for exam¬ 
ple, dry wood or straw is used in a hard coal-fired 
system, the moist flue gas volume flow will change by 
maximum 1% with a share of 10% biomass of the 
installed thermal capacity, depending on the biomass 
and its humidity (Fig. 2). Even with very moist biomass 
(e.g., bark, fresh-cut grass), the increase of flue gas 
caused by this will remain in the area of a few percent. 

By co-combustion of biomass with hard coal, the 
water content of the sewage sludge will influence the 
fuel-mass flow as well as the flue gas volume flow. 
While for a thermally dried sewage sludge with 95% 
dry matter (TM) with a 10% share of sewage sludge 
referring to the overall installed thermal capacity, the 
fuel mass flow will increase by 18%, the increase with 
mechanically well dehydrated sludge with 45% dry 
matter (TM) is approx. 70% already with the same 
percentage of thermal capacity. 

This results in a high demand of adaptation for the 
conveyor technology, the mills and the feeding system 
for the fuel into the boiler, when disproportionately 
large quantities of water are entered together with the 
sewage sludge into the system. A similar situation is 
also given to the flue gas treatment system. The 
increased flow speed of the flue gas due to the increased 
volumes leads to a change of the heat transfer as well as 
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Share of biomass referring to the thermal capacity 


Co-combustion of Wood in Coal-Fired Large-Scale Power Plants. Figure 2 

Change of the moist flue gas volume flow with biomass co-combustion [1] 
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an increase of the pressure losses; furthermore, it also 
affects the cleaning efficiency of the flue gas-treatment 
system. 

If however the main and the additional fuel have 
nearly the same content of water, the required changes 
are low. Therefore, co-combustion of moist biomass 
(e.g., humid wood, dehydrated sewage sludge) with raw 
lignite (water content approximately 50%) is a suitable 
and very promising option. The humid flue gas volume 
is slightly decreasing in this process. Dry biomass or 
dried sewage sludge can be fired with hard coal with 
a water content of approximately 7%. Moist wood 
would not lead to an acceptable increase of volume 
flow with higher shares referred to the thermal capa¬ 
city. For the usually realized low percentages of co¬ 
combusted referred to the thermal capacity, however, 
also mechanically dehydrated sewage sludge can be 
used in hard coal-fired plants usually without any 
major problems. 

Combustion Process 

The biofuel needs to be appropriately milled to ensure 
complete combustion within dust-fired systems (see 
[1-31]). The maximum particle size for safe ignition 


and complete burn-off- for example, in a 0.5 MW pilot 
plant - is approximately 6 mm for straw and approx. 
4 mm for Miscanthus (mesh diameter of mill) due to its 
woody structure. Grinding of wood is required between 
2 and 4 mm for co-combustion in coal dust flames. In 
plants with a higher thermal capacity of up to several 
1,000 MW, also coarser biomass particles can princi¬ 
pally be co-fired due to the longer retention times. In 
a large-scale power plant, for example, straw with stalk 
lengths up to 10 cm was completely burned up. 

In addition, significantly coarser fuel particles can 
be used in dust-fired systems than for coal due to the 
high volatile content of the solid biofuels. In this pro¬ 
cess, herbaceous biomass is more reactive than woody 
biomass and thus requires less fine grinding. So, there is 
no need to dry the solid biofuels for water contents of 
up to 50%. 

The significantly coarser grinding of the biomass 
particles affects the combustion process. This is exem- 
plarily shown in Fig. 3 in a coal dust flame and coal 
dust/biomass flame, respectively. The delayed ignition 
of the coarse straw particles can be seen for the run of 
the medium oxygen concentration along the combus¬ 
tion path. After ignition, the oxygen is used up faster 
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Distance from the burner 

Co-combustion of Wood in Coal-Fired Large-Scale Power Plants. Figure 3 

Combustion process of a mixed biomass/coal flame (according to [1]) 


and the combustion after ignition until burn-off is 
faster than for coal. 

Slagging and Fouling 

Due to the lower melting points of the biomass ash 
particles (especially for straw) compared to coal ash 
particles, there is a higher risk of slagging in the area of 
the combustion chamber (see [1-31]). The effects of 
low melting temperatures are different for dry deashed 
and molten deashed firing equipment. While low melt¬ 
ing temperatures can be absolutely desirable for slag- 
tap furnaces, a lower melting temperature in dry 
furnaces can lead to slagging in the combustion cham¬ 
ber and especially in the area of the burner, which will 
affect operation and does not allow a continuous and 
safe operation any more. 

Fouling and slagging on the convective heating 
surfaces resulting from the solid biofuel increases with 
growingly lower melting points of the biofuel used. 
Examinations at a 0.5 MW research plant however 
show that the fouling rate for the co-combustion of 
straw was only insignificantly higher than for firing of 
coal which is rather less prone to pollution. Further¬ 
more, the dust layer resulting from the combustion can 
be easily cleaned. The properties of the coal ash there¬ 
fore dominate the ash behavior of the ash mixture if the 
share of biomass within the fuel mixture is not too 
high. If however larger straw particles will deposit, 
which are incompletely burned off, they lead to 


corresponding slagging due to their low melting point 
of straw ash. 

Corrosion and Erosion 

Herbaceous biomass (e.g., straw) has a significantly 
higher chlorine content compared to coal and wood 
(see [1-31]). This can lead to increased high- 
temperature corrosion with corresponding abrasion 
of the surfaces of the heat exchanger. The heating 
surfaces of the superheater with their high steam and 
flue gas temperatures are affected first and foremost 
(Fig. 4). All in all, however, the measured corrosion 
rates at the heating surfaces of the coal dust-fired 
equipment seem to be acceptable compared to exclu¬ 
sive firing of coal if the percentage of biomass in the 
total firing heat capacity is limited to small quantities, 
as shown. 

Due to the low share of ash referring to the fuel 
mass in most biomasses discussed for co-combustion, 
there will be almost no increase of erosion (i.e., mate¬ 
rial removal by abrasive flue gas components) to equip¬ 
ment parts, where the dust-loaded flue gas volume flow 
is passing through. 

The chlorine content of sewage sludge is in the 
range of the chlorine contents of coal. Therefore, 
increased high-temperature corrosion cannot be 
expected in case of co-combustion of coal with sewage 
sludge. However, increased pollution and erosion may 
result in the area of the convective heating surfaces. 
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Material temperature 


Co-combustion of Wood in Coal-Fired Large-Scale Power Plants. Figure 4 

Corrosion rates for co-combustion of straw in a dust-firing system with an electric capacity of 130 MW, exemplary for the 
material X20CrMoV121 (according to [1]) 
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Co-combustion of Wood in Coal-Fired Large-Scale Power Plants. Figure 5 

Nitrogen oxide (NO x ) emissions with different excess air coefficients in the primary zone for different types of biomass with 
fuel mixture of 25% biomass and 75% coal, each time related to the thermal fuel capacity (according to [1]) 


Emissions 

The co-combustion of wood and herbaceous biomass 
in coal-fired power plants leads to a reduction of the 
main harmful substances in the raw flue gas prior to 
flue gas cleaning (see [1-31]). Sulfur dioxide (S0 2 ) is 
reduced, on the one hand, by the integration of sulfur 
into the ash and, on the other hand, by dilution, 
because the biomass fuels shows in average significant 
lower sulfur content than coal. Also, nitrogen oxide 
(NO x ) can reach lower values due to biomass-specific 
advantages in combustion kinetics even if there is not 
much change for the total nitrogen content in the fuel. 
Beside this, the carbon monoxide (CO) emissions do 
not increase when the biomass is sufficiently milled. 

Due to the high content of volatile matter, biomass 
is particularly suitable for the application of nitrogen 


oxide (NO x ) reduction processes such as air and fuel 
grading. Although for straw, the fuel nitrogen related to 
the calorific value approximately corresponds to the 
value of coal. Thus, the higher release of products of 
pyrolytic degradation and volatile nitrogen com¬ 
pounds produce less nitrogen oxide in the process of 
thermo chemical reaction. Figure 5 therefore shows as 
an example the NO x emissions identified in a pilot 
plant for a fuel mixture of 25% of particular biomass 
and 75% of hard coal (each time related to the total 
installed thermal capacity). Thus, approximately the 
same NO x emissions result independently of the nitro¬ 
gen content in the biomass. 

Therefore, the co-combustion of biomass in dust- 
fired systems does not place any requirements on the 
nitrogen content of the biomass. Also, higher nitrogen 
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concentrations in the biofuel can be managed by mea¬ 
sures of firing technology. 

The co-combustion of sewage sludge however leads 
to an increase of the concentration in the raw flue gases 
for specific substances such as sulfur dioxide, nitrogen 
oxides, or volatile heavy metals, which may require 
additional downstream flue gas cleaning stages. The 
pollutant concentrations within the flue gas depends 
on the percentage of sewage sludge in the total fuel 
throughput, the percentage of sulfur or nitrogen relat¬ 
ing to the calorific value and the conversion rate. In 
case of co-combustion of thermally dried sewage sludge 
with hard coal, for example, the nitrogen input of the 
sewage sludge relating to the calorific value is approx¬ 
imately 7 times larger than that of coal. For fuel sulfur, 
the relation is about 3.3. 

The conversion rate of sulfur to sulfur oxide (S0 2 ) 
is independent of the percentage of sewage sludge and 
is approximately 90%; this results in an increase of the 
S0 2 emissions proportional to the percentage of the 
sewage sludge. The high content of calcium oxide 
(CaO) of the sewage sludge ash does not lead to 
a reduction of the S0 2 emissions. This can be explained 
by an inactivation of the calcium oxide by the surface 
fusion with the high temperatures in the dust firing. 

With corresponding reduction measures of air and 
fuel grading in the combustion chamber, the nitrogen 
oxide emissions of the sewage sludge co-combustion 
are comparable to those with exclusive firing of coal. 
The high nitrogen percentage in the fuel, however, 
should be considered for the burner design and the 
air distribution in the particular combustion chamber. 

The chlorine contents related to the calorific value 
for hard coal and municipal sewage sludge is approxi¬ 
mately in the same range; a significant increase is pos¬ 
sible only for industrial sludge. Generally, no increased 
values are measured also for the PCDD/PCDF emis¬ 
sions, which can be formed in the fuel in the absence 
of chlorine. 

Flue Gas Denitrification 

By analyzing the effects of biomass co-combustion to 
the denitrification in DENOX plants, it must be differ¬ 
entiated between high-dust and low-dust configuration 
(see [1-31]). 


Altogether for a low-dust configuration, the possi¬ 
ble effects (e.g., catalyst deactivation) to the operation 
of a DENOX plant are less compared to the high-dust 
configuration due to the previously realized flue gas 
cleaning with the electrostatic precipitator and the flue 
gas desulfurization system (REA); therefore, this con¬ 
figuration offers advantages for the co-combustion 
compared to the high-dust configuration. 

In particular, for a configuration of the DENOX 
system in the high-dust area as usually realized in 
hard coal dry-firing systems, the catalyst is endangered 
by possible deactivation during co-combustion; this 
particularly applies if straw is used. Different mecha¬ 
nisms can be involved in this deactivation process. For 
example, the catalyst can react with potassium and 
sodium. Therefore, limit values of the alkaline percent¬ 
age are specified by manufacturers of DENOX-catalysts 
(K 2 0 and Na 2 0 below 4% by weight of ash), which 
however - depending on the coal used - can already be 
reached with small fractions of straw. Additionally, 
a blockage of the pores of the active catalyst cells may 
result due to alkaline and alkaline earths. In addition, 
phosphor may contribute to a toxification of the cata¬ 
lyst. Such damage can be prevented by arranging the 
catalyst behind the flue gas desulfurization system 
(REA) (i.e., low-dust configuration). This problem 
does not occur in lignite firing systems, where primary 
measures are used for denitrification. In general, sig¬ 
nificantly less problems can also be expected for 
wood due to the considerably lower content of prob¬ 
lematic substances. 

For the high-dust configuration, the relatively high 
ash content of sewage sludge can lead to pollution and 
to erosion on the DENOX-catalyst. Due to the high 
nitrogen fraction, the NO x -concentrations in the raw 
flue gas can increase. Due to this, they need to be 
subsequently reduced in the DENOX system. Specific 
substances (e.g., arsenic, phosphor) may lead to 
a deactivation of the catalyst here. 

Flue Gas Desulfurization 

The low content of sulfur of most solid biofuels leads 
to a relief of the flue gas desulfurization plant (REA) 
(see [1-31]). The input of other flue gas substances 
however can possibly affect the function of the flue 
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gas desulfurization plant or additionally burden it; this 
also may result in a limitation of the biomass share. 
This particularly applies to the chlorine input into the 
flue gas desulfurization plant. 

Next to sulfur, also a number of other flue gas 
components are deposited in the flue gas desulfuriza¬ 
tion plant (REA); these are, e.g., the volatile ash com¬ 
ponents in the flue gas such as mercury, arsenic, lead, 
and other heavy metals. They can be found in the REA 
residuals (e.g., REA gypsum); therefore, their quality 
has to be inspected prior to marketing. For wood and 
herbaceous biomass considered here, the concentra¬ 
tions of such substances can be disregarded compared 
to coal. 

The sulfur contained in the sewage sludge however 
significantly exceeds the sulfur content of coal in most 
cases. Therefore, the desulfurization plant has to have 
a sufficient capacity to safely adhere to the legal emis¬ 
sion limits. For example, for a 25% co-firing of sewage 
sludge within a coal-fired power plant, the S0 2 amount 
to be bonded within the flue gas desulfurization plant 
will increase by the factor 1.6. 

Ash Formation and Utilization 

When considering the possible effects of co¬ 
combustion of biomass in coal-fired power plants, 
also the results for ash formation and ash utilization 
has to be taken into consideration (see [1-31]). 

Ash formation. The small ash fractions of the soli 
biofuels wood and straw relieve the facilities for dust 
separation. Altogether, less ash accrues in case of co- 
combustion of wood and herbaceous bioenergy sources. 

In contrast, the high ash content of sewage sludge 
can lead to overcharge of the electrostatic precipitator. 
When 5-25% thermally dried sewage sludge is co- fired 
with hard coal (relating to the fuel capacity), an ash 
quantity multiplied by the factor 1.7-4.7 must be antic¬ 
ipated. For a sewage sludge proportion of 5 resp. 25% 
for example, already 44 resp. 84% of the ash comes 
from the sewage sludge. This results in corresponding 
changes of the ash properties. 

Ash utilization. The composition of the fly ash and 
the bottom ash of coal dust-fired systems determine 
the possibilities for its utilization. For example, the 
concentration of alkaline, sulfates, chlorides, and 


unburned carbon is essential for using the fly ash in 
the cement and concrete industry. 

The co-combustion of wood is least critical due to 
the very low ash content. For straw, the increase of the 
alkaline content and of the unburned substances can 
limit the utilization of ash as soon as a straw fraction of 
10% referring to the thermal capacity is exceeded. In 
addition, an increased amount of unburned straw 
nodes can remain, which will make the utilization of 
the fly ash difficult. For lignite-fired systems, however, 
the ash is mostly used for recultivation in open-hole 
mining so that appropriate measures shall ensure that 
specific ash substances should not leach out and enter, 
e.g., the ground water. 

A decrease of the carbon content was observed in 
the fly ash with an increasing proportion of sewage 
sludge. Insofar, the higher ash fraction in the sewage 
sludge improved the fly ash product to be marketed. 

Primarily, the other ingredients of the sewage 
sludge ash need to be considered for further assessment 
of the ash properties compared to the coal ash. Next to 
the main ingredients, first of all the substances with 
toxic behavior and/or accumulation behavior in the 
biosphere are of special interest. Despite of the given 
spread, however, it turns out that only very few sludges 
are more contaminated than this is permitted for agri¬ 
cultural use. 

Apart from substances like mercury, selenium, and 
arsenic, which as elements or in their compounds emit 
significant fractions via the flue gas flow due to low 
boiling points, most microelements from sewage 
sludge can be found in the solid residues of firing or 
flue gas cleaning. Figure 6 shows a direct comparison 
between sewage sludge ash and typical hard coal ash. 
Accordingly, the concentration of microelements in 
consideration of the different ash contents becomes 
similar. The co-combustion of sewage sludge with 
hard coal thus does not lead to a significant change of 
the pollutant concentration in the ash. 

For heavy metals leaving the boiler, where the com¬ 
bustion takes place partly via the flue gas flow, the 
removal from the flue gas has to be examined in the 
following cleaning stages. In the wet flue gas desulfur¬ 
ization systems (REA) usually integrated within power 
plants, heavy metals are only partly removed. The 
remaining part is emitted via the flue gas flow into 
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Co-combustion of Wood in Coal-Fired Large-Scale Power Plants. Figure 6 

Comparison of trace elements in dry fuels {left) and ashes {right) {TM dry matter; according to [1]) 


the atmosphere. For mercury, for example, a removal of 
approximately 50% is achieved; the rest is emitted into 
the atmosphere. This may require improvement of the 
flue gas desulfurization systems (REA) customary in 
power plants, for example, by specifically adapted pre¬ 
cipitating agents or an additional downstream filter 
stage (e.g., activated charcoal filter). It has to be 
noted in this respect that different regulations apply 
according to the legal emission limits for coal and for 
sewage sludge. The latter is basically categorized as 
waste; in Germany, for example, the requirements of 
the 17. BlmSchV (17th Federal Control of Pollution 
Act) has to be met, which are stricter than those of 
the 13th BlmSchV, which apply for exclusive firing of 
coal in conventional power plants. The European stan¬ 
dard EN 450, amended in 2005, which regulates the 
utilization of fly ash as a concrete additive, now also 
includes fly ash from the co-combustion of fuels such 
as wood, straw, sewage sludge, or also paper sludge. 
This is valid as long as the mass fraction of the co-fired 
fuels does not exceed 20%, and the ash formation 
from the additional fuel is not more than 10%. Next 
to the mentioned quality requirements, further char¬ 
acteristics are specified in the EN 450. Thus, by the 
amendment of the standard, an essential obstacle was 
overcome for the propagation of co-combustion. 

Co-combustion in Fluidized Bed Systems 

Aspects of co-combustion in fluidized bed systems are 
discussed below (see [1-31]). 


Combustion Process 

In contrast to coal, biomass with its higher content of 
volatiles tends to post-combustion in the free space of 
the fluidized bed firing system (see [1-31]). This is in 
particular true for stationary fluidized bed systems. 
Furthermore, smaller, light particles (e.g., straw) can 
be carried out of the fluidized bed. These light particles 
combust within the free space and increase due to this 
post-combustion the temperature inside the free space. 
A good mixing within a circulating fluidized bed sys¬ 
tem will evenly distribute the fire-room temperature. 
However, a temperature shift upward can result also 
here, if biomass fuels are co-incinerated. 

Since fluidized bed systems are particularly suitable 
for fuels containing ballast, co-firing of mechanically 
dehydrated sewage sludge makes sense. However, this 
mainly serves for disposal and less for energy gain due 
to the low calorific value and the revenues which can be 
achieved. While the higher content of water of the 
sewage sludge increases the volume flow of the flue 
gas compared to hard coal, this effect is low for the 
co-combustion with lignite. Depending on the calorific 
value and on the added amount of sewage sludge, 
however, the thermal capacity and thus the steam pro¬ 
duction will correspondingly decrease. 

On the other hand, synergy effects can result due to 
the co-combustion of biomass increasing the efficiency 
of power plant operation. For example, it has been 
proven in an industrial fluidized bed power plant that 
by the co-combustion of wood pellets and the thus 


















































































Co-combustion of Wood in Coal-Fired Large-Scale Power Plants 



resulting fine wood ash, the balance of particle size of 
the fluidized bed system shifted toward finer particle 
sizes. This significantly reduced the solid matter inven¬ 
tory inside the combustion chamber, leading to 
a reduction of the electricity demand of the power 
plant (i.e., the own consumption has been reduced). 

Slagging and Fouling 

While no additional operation problems are expected 
for the co-combustion of wood, solid biofuels based on 
herbaceous biomass can lead to increased slagging and 
fouling compared to coal (see [1-31]). This is particu¬ 
larly due to the potassium chloride contained in her¬ 
baceous biofuels. 

In a circulating fluidized bed system (88 MW ther¬ 
mal capacity), for example, firing of coal with a sulfur 
content of 3% together with straw (proportion in the 
thermal capacity each 50%) resulted in heavy slagging 
within the combustion chamber in the cyclone and in 
the area of the superheater. After short operations 
time, the operating parameters could no longer be 
met. Afterward, only coal was used with sulfur content 
below 1%. The increasing melting temperatures of the 
ash deposits with the use of such low-sulfur fuel 
resulted in less slagging; slagging only occurred in the 
area of the superheater, where it was facilitated by 
the design of the superheater with narrow channels 
(here: 37 mm). However, this problem could be 
avoided to a large extent by use of suspended super¬ 
heaters with a channel width of 50 mm (for the pre¬ 
vention of bridging) as well as lowering the flue gas 
temperature below the melting point of potassium 
chloride (770°C). 

Compared to an exclusive coal firing, the fouling 
rate on plant components subject to flue gas contam¬ 
ination increases, e.g., for a straw proportion of 50% of 
the installed thermal capacity, the fouling rate reaches 
five times the value of exclusive coal firing. However, 
such pollution is easy to remove. 

Corrosion and Erosion 

Corrosion problems are not expected when wood is co¬ 
combusted in fluidized bed systems (see [1-31]). This 
however does not apply for biofuels produced from 
herbaceous biomass, particularly due to the potassium 
chloride contained in it. 


Corrosion examinations of different materials 
within a circulating fluidized bed system showed sig¬ 
nificant corrosion to the convective superheater 
heating surfaces with the co-combustion of straw, 
although the test results with short test periods of 
500-1,000 h are associated with a certain degree of 
uncertainty. When martensitic steel (X20CrMoV121) 
was used, the corrosion rates, for example, for co¬ 
combustion of straw were approximately 10 times 
higher than with exclusive firing of coal (Fig. 7). They 
are thus significantly higher than within a dust-firing 
system with the same proportion of straw. It is assumed 
for the reason for the high corrosion rates that the in 
situ desulfurization in the fluidized bed facilitates the 
formation of potassium chloride. This potassium chlo¬ 
ride will precipitate on the superheater pipes and 
releases the corrosion-causing chlorine when potas¬ 
sium sulfate is formed. Also, the selection of different 
high-alloy steels did not result in a significant decrease 
of the corrosion tendency. 

Different measures are taken to reduce the corro¬ 
sion of the convective heating surfaces. The most 
important measure turned out to be the reduction of 
the bed temperature below 860°C. By this and other 
modifications, it has been achieved that after 7 years, 
the superheater still was in operation. 

Newer concepts therefore aim not to arrange the 
superheater in the flue gas tract, but as immersion 
heating surface within a fluidized bed cooler, and thus 



Temperature 

Co-combustion of Wood in Coal-Fired Large-Scale Power 
Plants. Figure 7 

Corrosion rate for co-combustion of 50% straw in 
a circulating fluidized bed system versus steam 
temperature (according to [1]) 
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to utilize the characteristic of a circulating fluidized bed 
system. Since there may be unconsummated straw 
components in the cyclone return of the circulating 
fluidized bed, forming potassium chloride during 
their combustion, comparably high abrasion rates 
were determined with corrosion sensors arranged 
inside the return. If corrosion is to be prevented, it is 
therefore required to connect a non-cooled fluidized 
bed upstream of the fluidized bed cooler with super¬ 
heater in order to assure burnout prior to reaching the 
superheater. This design has proven to be successful; 
however, the immersion heating surfaces eroded after 
several years. 

Both the corrosion and emission problems as well 
as the mentioned slagging and fouling problems can be 
practically prevented best by input of correspondingly 
less fractions of biomass relating to the firing heat 
capacity (below 10%). 

Emissions 

The co-combustion of biomass in coal-fired fluidized 
bed systems mainly has the effect that emissions are 
reduced, except for the hydrogen chloride (HC1) emis¬ 
sions when straw is used (see [1-31]). 

The sulfur dioxide (S0 2 ) emissions decrease with 
increasing biomass proportion in relation to the ther¬ 
mal capacity. This, on the one hand, is due to the lower 
sulfur content of the biomass compared to coal and, on 
the other hand, to the encapsulation of the S0 2 in the 
biomass ash. 

For the nitrogen oxide (NO x ) emissions, the possi¬ 
ble effects are not uniform. Tests showed that there 
were only small changes of the NO x emissions with 
small proportions of biomass to be co-combusted. 
Other experiences even revealed a reduction of NO x 
with an increasing biomass proportion; this particu¬ 
larly applies to co-combustion of wood. Tests have also 
shown that the N 2 0 emissions significantly decrease 
with the co-combustion of biomass. 

Similar as for dust-fired systems, the co¬ 
combustion of straw causes an increase of hydrogen 
chlorine (HC1) emissions. In a fluidized bed system, for 
example, with a straw proportion of 60% relating to the 
installed thermal capacity, the chlorine input into the 
firing system was about 20 times higher than in case of 
exclusive firing of coal. The entered amount of chlorine 


can be found almost completely in the flue gas. The co¬ 
combustion of wood however leads to a reduction of 
the HC1 emissions. 

For the co-combustion of sewage sludge within 
a circulating fluidized bed system, the emissions of 
S0 2 , NO x , CO, and dust are within the legal range of 
fluctuation. For example, in Germany, the limit values 
for the application of the proportional regulation 
according to 17th BImSchV are therefore undercut. 
In order to safely meet the limit value for heavy 
metals (e.g., mercury) it may be required to add an 
additional flue gas cleaning system (e.g., fly stream 
absorber with lignite coke) downstream of the electro¬ 
static precipitator. 

Ash Formation and Utilization 

Correspondingly, similar conclusions as for dust-fired 
systems apply to the accruing ash and its possibilities 
for utilization (see [1-31]). 

As, for example, the ash proportion in the intro¬ 
duced sewage sludge is approximately 50% relating to 
the dry matter (TM), the ash quantity accruing in the 
fluidized bed system will significantly increase. In this 
process, the toxic substances contained in the sewage 
sludge are encapsulated inert within the ash. The only 
exception is mercury that is released from the boiler 
with the flue gas. The ash can therefore be used, e.g., for 
purposes of recultivation in open-top lignite mines, 
because, for example, in Germany the requirements of 
the dump category 1 of the TA (Technical Instruction) 
for municipal waste are met. 

Berlin Case Study 

A detailed analysis of the options and limits of co- 
combustion of solid biomass in existing coal-fired 
power plants was carried out for Berlin/Germany. The 
prepared technical analysis as well as the existing 
approvals for the power plants resulted in a clear deci¬ 
sion for a co-combustion of virgin wood. On this 
basis, both the availability of resources and the tech¬ 
nical consequences potentially resulting from co¬ 
combustion to the existing plants are discussed below. 

Fuel Procurement 

Germany is a densely wooded country. Approximately 
11.1 million hectares, almost one third of the total area 
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of Germany, is covered with forests. Due to the favor¬ 
able climate, the average amount of wood produced 
per hectare is 3 times the amount produced, e.g., in 
Sweden. The annual felling in Germany steadily 
remains between 60 and 80 million solid cubic meters. 
Acting on the assumption of a bulk density of 0.6 t/m 3 
and an average calorific value of 3 MWh/t, this is 
126 million MW of heat energy for 70 million solid 
cubic meters. Thus, the heat energy of the total German 
annual felling would be sufficient for the electricity 
generation in five 1600 MW state-of-the-art large 
hard coal-fired power plants (57 million MWh el ). 
This is approximately one tenth of the demand for 
electric energy in Germany. 

This context on the one hand makes clear the lim¬ 
ited quantities of wood as a resource. On the other 
hand, it also required maximum efficiency of wood 
utilization. In addition, the use of wood as a raw mate¬ 
rial should be favored because of considerations of 
climate politics since the carbon will then be bonded 
in the wood products after the growth phase of the 
plant and is not immediately released back to the 
atmosphere in the form of C0 2 . 

Therefore, mainly rather low-grade wood (includ¬ 
ing short stubs, rotten knots) is suitable. This will be 
initially shredded for further processing. Thus, the co¬ 
combustion primarily focuses on wood from landscape 
conservation, scrap wood, rolled lumber, wood affected 
by bark beetles and thinning material from short rota¬ 
tional plantations, and similar types of wood. Wood, 
which can be used as a raw material, is out of question 
often also for reasons of costs. 

In addition, it should be observed for the men¬ 
tioned low-grade types of wood basically suitable for 
co-combustion that brushwood, leaves, needles, and 
roots should remain in the forest for the formation 
of humus, in terms of sustainable forestry. However, 
there are special cases (e.g., leaves from parks or 
streets), where energetic utilization may be considered. 
Subject to the ancillary conditions mentioned here, an 
approval for the coal-fired power plants in Germany 
should be assumed according to the legal frame condi¬ 
tions for waste and similar combustible substances. 

Due to the relatively low calorific value and partly 
high water content of the firewood, which basically 
would be suitable for co-combustion, only short trans¬ 
port distances can be economically reasonably realized. 


Even if the economic efficiency of firewood for co- 
combustion is supported by saving of C0 2 certificates, 
this advantage is in contrast with the high specific 
transport costs per ton of fuel for overland transporta¬ 
tion. One individual lorry with two 38 m 3 containers 
with approximately 18-20 t of wood cannot compete 
against the specific costs of large-scale technology, for 
example of lorries, with 7,000 t of oil or huge conveyor 
lines from the nearby open-top mining with almost 
1,000 t/h of lignite. Long transport distances therefore 
are generally economically conceivable for energy 
sources with a high-energy density and good-handling 
properties (e.g., oil, hard coal). This however also 
applies to wood pellets. 

Technical Solutions 

The coal boilers available in Berlin/Germany are 
designed as dust-firing systems. A fast and fully com¬ 
bustion of the coal dust particles is achieved with 
the coal grinded to dust. The wood co-combusted 
within this boiler needs to be of appropriately small 
size, which places increased requirements to wood 
processing compared to grate firing. 

In order to meet this requirement with respect to 
the size of the woodchips, a two-stage strategy is pur¬ 
sued for the three coal-fired combined heat and power 
plants used and/or designed for co-combustion. The 
wood is initially shredded to approximately 40 mm 
delivery size, and then there is another grinding process 
to lengths of several millimeters within the particular 
hammer mills of the power plants. 

For this technology of joint milling of the coal- 
woodchips mixture, the grinding and feeding into the 
boiler is the main problem of the co-combustion 
within these power plants. The reason lies in the differ¬ 
ent fuel properties of hard coal and wood. 

• On the one hand, wood has a significantly larger 
volume relating to the calorific value compared to 
hard coal. The result is that the amount of wood is 
limited which maximally can be conveyed through 
the coal mills. For a proportion of 5% of the ther¬ 
mal boiler capacity, the total fuel volume increases 
by up to 30-40%. 

• On the other hand, the proportion of moisture and 
volatile substances in wood is larger than in hard 
coal. The larger content of moisture requires more 
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intense drying of the mixture and leads to higher 
flue gas losses. In addition, for the co-combustion 
of wood, there is an increased risk that the wood 
components will already ignite before they enter 
into the mill due to the higher content of volatile 
matters. There also is an increased risk of slagging in 
the combustion chamber due to the lower melting 
point of the wood ash, which at least in the slag-tap 
furnace of one of the Berlin power plant is beneficial 
for the slag removal from the boiler. 

These issues shown here were tested in the Reuter C 
hard-coal power plant in several test series with an elec¬ 
tric capacity rating of 132 MW, and technical solutions 
were found for the co-combustion of wood (e. g., by a 
reduction of the sifter temperatures in front of the mills). 

An addition, the effects of co-combustion of wood 
of up to 5% of the fuel heat for the efficiency of the 
boiler in Reuter C has been examined. A maximum 
loss of efficiency of 0.25% has been measured for wood 
with a moisture content of 50%. This however can be 
assessed as very low. 

Due to the longer holding times of the fuel in 
fluidized bed systems, higher co-combustion rates 
than 5% of wood are possible there. A direct supply of 
the wood into the boiler, i.e., no longer via the hammer 
mills, can here open up new dimensions for the 
co-combustion of wood. 

Economic Efficiency 

The economic efficiency of the co-combustion of wood 
is mainly influenced by the price of wood as a fuel, the 
price of the coal replaced by the wood and the costs for 
the C0 2 certificates as well as the additional investment 
costs and operating costs for the co-combustion of 
wood. The main cost factor of the co-combustion in 
this process is the costs for the fuel itself. These costs 
roughly are broken down into the actual price of wood 
as well as the shredder and transport costs. For prefer¬ 
ential use of wood from landscape conservation from 
the close vicinity, the prices of wood and transport 
accordingly are moderate, and the highest costs result 
from the mobile shredder equipment on location. 
These shredder costs however can be reduced to 
approximately two thirds with stationary electric 
shredders; for economic reasons, however, a fixed loca¬ 
tion and larger quantities of wood is required. 


The other important economic parameters such as 
the market prices for electricity, heat, and the efficiency 
of the power plant play the important part in the 
resource planning of the power plant; it is decided 
here whether it is economically worthwhile to operate 
of shut off a power plant in a specific market situation. 
The co-combustion of wood to the amount of 5% of 
the thermal capacity has only small influence to this. 
But the economic efficiency of the co-combustion of 
wood is adversely affected by longer standstill periods 
of the combined heat and power plants, in particular 
outside the heating period, because then less wood can 
be co-combusted. 

At the end of 2009, for example, the following 
circumstance has been observed. The price of hard 
coal declined to approximately half the price compared 
to fall 2008. At the same time, the price for C0 2 certif¬ 
icates declined by approxiamtely one quarter compared 
to the end of 2008. In contrast, the price of wood 
declined only to a small extent. The forest owners 
practise retentiveness with felling when the prices for 
wood are going down and wait for a higher price level. 
For wood from landscape conservation, however, the 
price remained stable with slightly reduced shredder 
and lorry transport costs depending on the oil 
price. Therefore, the economic situation of the co- 
combustion of wood in hard-coal fired power plants 
in Germany changed to the worse in 2009 compared to 
2008, but slightly recovered again in the year 2010. 

Conclusion 

The co-combustion of biomass is an option for a 
relatively efficient production of electricity out of 
large quantities of biomass. Against this background, 
the goal of this paper is it to analyze these options and 
to examine them by means of a case study. The issues 
explained can be briefly summarized as follows. 

• Basically, a multitude of different types of biomass 
can be used in existing coal-fired power plants. 
Recently, quite different technical solutions were 
realized. 

• There should be no basic technical problems as long 
as the proportion of the biomass used in relation to 
the coal is small, and this does not significantly 
change the properties of the coal with regard to 
combustion. 
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• Nevertheless, co-combustion of wood in coal- 
fired power plants requires individual technical 
solutions depending on the unloading situation, 
on the type of mill and boiler, as well as on the 
location of the power plant. Generally, appropriate 
solutions can be found, as indicated by the 
development in some European countries in the 
past years. 

All in all, the co-combustion is a technically prom¬ 
ising option for the large-scale highly efficient produc¬ 
tion of electricity from biomass by the substitution 
of an energy source with relatively high specific 
greenhouse gas emissions. This efficiency can even be 
improved by co-combustion in combined heat and 
power plants. 
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Glossary 

Cartagena protocol on biosafety (CPB) The Carta¬ 
gena Protocol on Biosafety to the Convention on 
Biological Diversity is an international agreement 
which aims to ensure the safe handling, transport, 
and use of living modified organisms resulting from 
modern biotechnology that may have adverse 
effects on biological diversity, taking also into 
account risks to human health. It was adopted 
on 29 January 2000 and entered into force on 
11 September 2003 (for full text, see http://bch. 
cbd.int/protocol/text/). 

Convention on biological diversity (CBD) The objec¬ 
tives of this Convention are the conservation of 
biological diversity, the sustainable use of its com¬ 
ponents, and the fair and equitable sharing of the 
benefits arising out of the utilization of genetic 
resources, including by appropriate access to 
genetic resources and by appropriate transfer of 
relevant technologies, taking into account all rights 
over those resources and to technologies, and by 
appropriate funding (for full text, see http://www. 
cbd.int / convention/text / ). 

Convention/protocol/treaty A treaty is an agreement 
in written form between nation-states (or interna¬ 
tional agencies, such as the United Nations, that 
have been given treaty-making capacity by the 
states that created them) that is intended to estab¬ 
lish a relationship governed by International Law. It 
may be contained in a single instrument or in two 
or more related instruments such as an exchange of 
diplomatic notes. Various terms have been used for 
such an agreement, including treaty, convention, 
protocol, declaration, charter, covenant, pact, act, 
statute, exchange of notes, agreement, modus 
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vivendi (“manner of living” or practical compro¬ 
mise), and understanding. The particular designa¬ 
tion does not affect the agreement’s legal character. 
Genetically modified/genetically engineered/trans¬ 
genic organisms Organisms, such as plants, ani¬ 
mals, and microorganisms (with the exception of 
human beings), in which the genetic material 
(DNA) has been altered in such a way that does 
not occur naturally by mating and/or natural 
recombination (The terms “genetically modified” 
(GM), “transgenic,” “genetically engineered” (GE), 
and “living modified” (LM) are used in different 
legal instruments around the world. It is useful 
(and deliberate) in this document to essentially 
use them interchangeably). 

Living modified organism (LMO) Any living organ¬ 
ism that possesses a novel combination of genetic 
material obtained through the use of modern bio¬ 
technology (according to the CPB). 

Modern biotechnology The application of [ 1 ] in vitro 
nucleic acid techniques, including recombinant 
deoxyribonucleic acid (DNA) and direct introduc¬ 
tion of nucleic acid into cells or organelles, or [2] 
fusion of cells beyond the taxonomic family, that 
overcome natural physiological reproductive or 
recombination barriers and that are not tech¬ 
niques used in traditional breeding and selection 
(according to the CPB). 

Definition of the Subject 

This contribution describes and compares the regula¬ 
tion of GMOs and the underpinning legislative frame¬ 
works in selected countries from around the world. It 
also includes a description of the relevant international 
agreements related to biosafety and a description of the 
main characteristics and attributes of a modern bio¬ 
safety regulatory framework in this area. 

Introduction 

The rapid development and deployment of modern 
biotechnology in the last decades have made biosafety 
a critical issue. Although modern biotechnology has 
the potential of benefiting agricultural interests in 
developing countries as well as overall human wel¬ 
fare, living modified organisms (LMOs) resulting 
from modern biotechnology or genetically modified 


organisms (GMOs) remain a source of concern with 
regard to the conservation and sustainable use of bio¬ 
diversity, as well as to human health. The perceived 
risks, which relate to the release of GMOs into the 
environment as well as the placement of GMOs onto 
the market, have as much to do with social values as 
scientific concerns. For example, social concerns may 
require the labeling of genetically modified (GM) food 
and feed, the mitigation of socioeconomic impacts, and 
the demonstrated potential for the co-existence of 
organic, conventional, and GM farming [1]. 

Global use of GM crops is growing rapidly, increas¬ 
ing approximately 87-fold from 1996 to 2010 to 
148 million hectares under cultivation by 15.4 million 
farmers in 29 different countries [2]. Furthermore, the 
estimated value of the global GM crop market in 
2010 grew to US$11.2 billion, while the value of 
harvested products was estimated at US$150 billion 
[2]. Although the majority (52%) of GM crops are 
still grown in industrialized countries, developing 
countries are rapidly approaching parity and, due to 
their high rates of adoption, are soon expected to grow 
the majority of GM crops [2]. By 2009, the major 
biotech crops had achieved high levels of market pen¬ 
etration: 77% of soybean, 49% of cotton, 26% of maize, 
and 21% of oilseed rape grown globally in 2009 were 
GM varieties [3]. 

Despite expected high benefit-cost ratios from 
biotechnology, only a few developing countries, such 
as Brazil and Argentina, have had high uptake rates 
(over 20 million hectares) [2] of GM crops, with uptake 
typically concentrated in crops that are exported to 
developed country markets. Few others (mainly 
India and China) have started exploring their own 
national research capability in biotechnology. In the 
vast majority of developing countries, both investment 
in biotechnology research and development and 
the transfer to farmers of transgenic crops already 
being marketed have been generally low. This in part 
reflects a lack of transparent regulatory capacity 
necessary in dealing with risks associated with bio¬ 
technology as well as in addressing the issues of prop¬ 
erty rights development and protection that are 
essential to promoting innovative research. This is 
particularly important because of the high cost of 
undertaking the initial research and development in 
biotechnology [4]. 


c 




2288 


c 


Commercialisation of GM Crops: Comparison of Regulatory Frameworks 


As GM technologies are very recent and fast devel¬ 
oping, most governments are trying to keep pace by 
developing regulatory policies that reflect consumer 
demands and preferences affecting GM agricultural 
products. Almost all developed countries require prod¬ 
ucts derived from GM sources to be assessed both for 
their safety as foods and for their environmental 
impacts. However, there are considerable differences 
in the approaches taken by different countries. In the 
United States, analysis and approval mechanisms for 
GM foods have been subsumed into existing regula¬ 
tions governing the release of new foods, plants, and 
pesticides, whereas in the European Union, regulation 
of GM products requires considerable separate scrutiny 
(see chapter “► Commercialisation of GM Crops: 
Comparison of Regulatory Frameworks” by Devos 
et al.). Countries worldwide are in different stages of 
policy development, with the majority of the develop¬ 
ing countries still in the infant stage [4]. In setting up 
domestic legislation, developing countries seem to be 
paying increasing attention to international trade 
concerns [5]. 

India along with Argentina, South Africa, and 
others constitute a group of developing countries that 
aspire to develop domestic biotechnology through 
national public R&D and/or by creating incentives for 
the participation of multinationals as sources of tech¬ 
nology. They each have relatively liberal regulations as 
well as more explicit regulatory institutional arrange¬ 
ments [6]. Those developing countries with well- 
developed public agricultural research and extension 
systems (such as India) are well placed to benefit 
promptly from the new biotechnology by working in 
partnership or in parallel with private biotechnology 
and seed companies. Approving investments in those 
activities by the private sector - and the overall invest¬ 
ment climate - will allow the process of adaptation and 
adoption to move forward. The experiences in India, 
China, and South Africa all indicate that rapid and 
widespread adoption is then possible, including by 
small farmers [7]. 

The biosafety frameworks of the described coun¬ 
tries were selected based, primarily, on considering the 
following criteria: level of production and commercial¬ 
ization of GMOs, investments made and political 
commitment with the research and development par¬ 
ticularly in the field of GMOs, regional leadership in 


the adoption of GMOs and in the elaboration and 
implementation of biosafety legal frameworks, and the 
existence of a functional biosafety framework in place. 

Argentina 

Argentina is a major producer of agricultural products 
and the third largest producer of soybeans. Initially, the 
harvested area of soybean was 36,000 ha (59,000 mt) in 
1970, increasing to 5.98 million hectares in 1995/1996 
(12.43 mmt). The country has historically been the 
earliest and most aggressive adopter of GM crops in 
Latin America, first planting glyphosate-tolerant soy¬ 
beans in 1996, which sparked a further expansion of soy 
production and which is now in excess of 14 million 
hectares, of which at least 98% is GM [8]. This rate of 
adoption is far higher, and much faster, than that in the 
USA, which was the first country to introduce this 
technology [9]. In addition, Argentina also grows sig¬ 
nificant quantities of GM corn (Bt and glufosinate 
ammonium tolerant) and Bt cotton, comprising 40% 
and 20% of overall production in 2009 for these two 
crops, respectively [3, 10]. Available estimates place 
accumulated benefits (extra income which would 
have not been generated in the absence of the technol¬ 
ogy) until the year 2001/2002 at approximately US$5.2 
billion in the case of soybeans, about US$400 million 
for Bt maize, and approximately US$40 million for Bt 
cotton [7]. Argentina is now third only to the United 
States and Brazil in terms of the area planted with 
transgenic crops (22.9 million hectares) [2] and is 
thus a very important player in the international 
arena. Notably, Argentina signed, but has yet to ratify, 
the Cartagena Protocol on Biosafety (CPB, hereinafter 
referred to as “the Protocol”) of the Convention of 
Biological Diversity in May 2000. Argentina is currently 
undergoing a consultation process, analyzing and 
debating with all the involved sectors the position the 
country will take in this respect [8]. 

Regulatory Oversight in Argentina 

Argentina was one of the first countries to establish 
a system of regulatory oversight for GMOs [11]. It 
has instituted regulatory measures for the safe devel¬ 
opment and application of biotechnology in general 
and GMOs in particular. It chose to develop the 
policy on biosafety within the context of trade-related 
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issues [12] and as such has policies, procedures, and 
institutional arrangements to regulate the develop¬ 
ment, importation, and export of GMO products. 
The Argentine biosafety system is based on guidelines, 
not on legislation. Argentina’s legislative framework for 
regulating GMOs is based on the existing agricultural 
regulatory system (e.g., for plant protection chemicals) 
supplemented with GM crop-specific regulations 
established to specify conditions for environmental 
release or to assess food safety. The non-statutory 
guidelines include standards for facilities and practices 
designed to prevent the unintended release of a GMO, 
conditions of isolation, monitoring field trials, and 
standards for risk assessment for conducting the 
environmental release. This approach gives the system 
flexibility and allows for changes needed to keep up 
with scientific advances. One disadvantage, however, is 
that compliance with guidelines is not legally enforce¬ 
able; there is no way to prosecute offenders in the rare 
cases of non-compliance that have occurred [13]. 

Similar to the USA and Canada (see below), Argen¬ 
tine biosafety regulation follows a product-based 
approach which results in several agencies, all within 
the Agriculture Directorate of the Secretariat of Agri¬ 
culture, Livestock, Fisheries and Food (SAGPyA), man¬ 
dated to regulate GM crops and products. The National 
Advisory Commission on Agricultural Biotechnology 
(CONABIA) is the lead agency in charge of regulating 
GM crops and was established in 1991 by Resolution 
124/91 (later expanded by Resolution 669/93) of the 
SAGPyA to provide advice and oversee the implemen¬ 
tation of biosafety regulations [14]. CONABIA’s juris¬ 
diction and procedures were established in Resolutions 
656/92, 837/93, and 289/97 (later replaced with 39/03) 
[13]. Resolution 39/03 is part of the general regulatory 
system governing the existing agricultural regulations 
in Argentina related to plant protection (Decree-Law of 
Agricultural Production Health Defense 6704/66 and 
its amendments), seed and phytogenetic creations 
(Seed and Phytogenetic Creations Law 20.247/73 and 
its regulatory decree), and animal health (Law of 
Veterinarian Products, and Supervision of Creation 
and Commercialisation 13.636/49). 

CONABIA is a multidisciplinary and inter- 
institutional organization with advisory duties and 
comprises representatives from the public sector, aca¬ 
demia, and private sector organizations related to 


agricultural biotechnology. Its main responsibility is 
to assess the potential environmental impact of the 
introduction of GMOs in Argentine agriculture [8]. 
The Commission handles applications for laboratory 
and greenhouse testing, field trials, and governs the 
“flexibility status” of release conditions (unconfined 
release, usually large-scale, for regulatory purposes or 
off-season seed multiplication) of GM plants [13]. 
Resolution 60/2007 provides a differentiated treatment 
for the authorization of the breeding of parental 
material which contains transgenic events already 
approved for commercialization. Furthermore, it 
advises SAGPyA on the issuance of necessary licenses 
and authorizations for experimentation and/or envi¬ 
ronmental release of GM microorganisms, as well as 
GMO-derived or GMO-containing products (although 
the final decision is made by SAGPyA) [6]. In order to 
obtain the appropriate marketing license, varieties 
must also comply with requirements stipulated by the 
National Service of Health and Agrofood Quality 
(SENASA) [15]. SENASA’s jurisdiction concerning the 
oversight of GMO-derived food was established in 
Resolution 289/87, while Resolution 511/98 including 
Annexes established the food-safety review criteria 
[13]. The latter was based on FAO and WHO docu¬ 
ments, as well as on relevant regulations from Australia, 
Canada, the EU, Japan, and the US, but has since been 
replaced by Resolution 412/2002. 

A key part of the GMO regulatory process consists 
of verifying that the commercial approval will not have 
a negative impact on Argentina’s foreign trade. This 
specific assessment is carried out under Resolution 
39/03 by the National Bureau of Agrifood Markets 
(NBMA), and it includes an analysis of the current 
status of regulatory systems and public acceptance in 
the importing countries. The National Seed Institute 
(ex-INASE) is responsible for ensuring that all the 
necessary requirements for registration in the National 
Registry of Cultivars have been established. Ex-INASE 
plays a further role in the biosafety system by receiving 
and logging applications for GMO field trials. Applica¬ 
tions containing confidential business information are 
kept secure at INASE’s offices. Agency personnel per¬ 
form field test site inspections, checking for compliance 
with the biosafety requirements set by CONABIA [13]. 

During 2001, the SAGPyA actively cooperated 
with members of the Argentine Congress in drafting 
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a biosafety law. This draft represented a major 
improvement on the current situation, since it clearly 
set forth a conceptual framework, as well as issues and 
instances to be considered as participants in risk 
analysis procedures. But due to the institutional and 
economic crisis that broke out on December 2001, the 
draft was never discussed in Congress, and there is no 
evidence that it will be discussed in the near future [8]. 

Commercialization of GM Crops: Argentine 
Approval Process 

When an organization intends to obtain an authoriza¬ 
tion for commercialization of a GM crop in Argentina, 
it has to pass reviews by the three regulatory agencies. 
Briefly, CONABIA should determine that the environ¬ 
mental impact of the large-scale release of the GM crop 
will not significantly differ from that of its non- 
modified counterpart; SENASA’s Technical Advisory 
Committee on the Use of GMOs should determine 
that the derived foods are safe for human and animal 
consumption, and the DNMA should determine that 
the release will not have an undesired impact on the 
country’s international trade [16]. 

The prerequisite for entering the commercial eval¬ 
uation process is that authorizations for experimenta¬ 
tion and/or release into the environment of the specific 
GM crop have been previously granted [16]. After at 
least one release into the environment has been 
approved and the safety of the GM crop has been 
demonstrated, the applicant can apply for a 
“flexibilization” permit which allows future releases 
by simply providing notification of the location, area, 
sowing date, and intended harvest date [11]. Flexibility 
status conditions are granted for the following 
purposes [13]: 

• For providing testing material 

• For export 

• For off-season seed multiplication (not for use in 
Argentina) 

• For tests, which need to be presented at later stage 
(e.g., variety registration) 

• For pre-commercial seed multiplication for 
a pending variety registration 

The deregulation of field testing conditions is 
dependent on the results of the biosafety assessment 


conducted by CONABIA with regard to the criteria laid 
down in Resolution 131/98. These include the charac¬ 
terization of the GMO (recipient organism, genetic 
modification, insert, donor organisms, phenotypic 
characterization, potential environmental interactions 
of GMO) and the impacts expected from the produc¬ 
tion of the GM crop at commercial scale (environ¬ 
mental effects and impact on human health) [16]. If 
SAGPyA, on the recommendation of CONABIA, 
authorizes “flexibility status” release conditions for 
the GM crop in question, the applicant only needs to 
submit information on the area to be sown, the date of 
sowing, the site of release and the harvest date [ 16] . The 
flexibility status of a GM crop allows large-scale plant¬ 
ing, but not planting for commercial purpose. 

The second step to commercialization is the evalu¬ 
ation of the safety of the GM crop for human consump¬ 
tion and feed. This evaluation is carried out by 
SENASA under Resolution 412/2002 [17]. In the third 
step of the commercialization process, the NBMA 
assesses the impact of the GM crop in question on 
export market security. It does this by analyzing the 
status (if any) of the specific event in the destination 
markets and, as a result, whether the addition of this 
event to Argentina’s export supply might represent 
a potential barrier to the access to these markets. 

After completion of all of the steps mentioned 
above, CONABIA’s Office of Technical Coordination 
compiles all pertinent information and prepares 
a “Project of Resolution” on the basis of its own, 
SENASA’s, and DNMA’s assessments and submits it to 
the SAGPyA, which takes the final decision on approval 
or denial of the commercialization request [ 13] . Should 
the GM crop be authorized for open cultivation, it 
must also be registered in the National Registry of 
Cultivars, a process overseen by ex-INASE. For those 
GM crops expressing either herbicide tolerance or 
insect resistance, they require a pesticide approval 
from SENASA prior to their commercial use [13]. 

Canada 

Regulatory Oversight in Canada 

In line with a similar approach adopted in the United 
States (see below), the regulatory framework 
established in Canada is based on the extension of the 
existing regulations to GMOs [18]. However, in 
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contrast to all other countries, Canada relies on the 
concept of novelty to trigger regulatory oversight, 
thereby enabling the regulation of a wider array of 
novel seeds or food [19], and includes those produced 
by conventional breeding, mutagenesis, or rDNA tech¬ 
niques. Directive 94-08, first published in 1994, defines 
these as “plants containing traits not present in plants 
of the same species already existing as a stable popula¬ 
tion” [20]. 

The Canadian Environmental Protection Act 
(CEPA) of 1988 formally recognizes biotechnology as 
a manufacturing process for products potentially pos¬ 
ing environmental risks, and therefore, the act requires 
environmental assessments. CEPA, however, embraces 
a product-based approach to biotechnology, an 
approach explicitly defined in the 1993 Regulatory 
Framework for Biotechnology [21]. In accordance 
with this framework, policy-makers proceeded to the 
amendment of a series of regulations (below) contigu¬ 
ous to the laws governing the products of biotechnol¬ 
ogy. These amendments were mostly aimed at 
inscribing a trigger (the novelty of the trait) launching 
the risk evaluation process for the products of 
biotechnology. 

The Canadian approach is based on an agreement 
between the Canadian Federal agencies in 1993 that was 
renewed in 1998. The responsibility for regulating 
plants with novel traits (PNTs), including GM plants, 
is shared between the Canadian Food Inspection 
Agency (CFIA) and Health Canada [18]. The CFIA 
operates under the authority of the Seed Act, the 
Plant Protection Act, the Feeds Act, the Fertilizer Act, 
and the Health of Animals Act. It also shares some 
responsibilities with Environment Canada under the 
Canadian Environmental Protection Act, and with 
Health Canada under the Pest Control Products Act, 
and the Food and Drugs Act. The Canadian Environ¬ 
mental Protection Act is an umbrella legislation 
intended to serve as a regulatory “safety net” for any 
biotechnological products not currently regulated by 
another federal act. The Department of Fisheries 
and Oceans regulates aquatic organisms under the 
Fisheries Act. 

In 1997, the CFIA took over the risk management of 
novel seeds and feeds from Agriculture and Agri-food 
Canada. The agency regulates novel plants following 
assessment criteria provided by Directive 94-08. In 


particular, the CFIA is responsible for the regulations 
and guidelines dealing with cultivating PNTs, assessing 
their impact on the environment and biodiversity. 
Canadian authorities state that “all plants derived 
through genetic engineering have been considered 
novel, and as such have undergone a full, comprehen¬ 
sive, and rigorous safety assessment prior to release 
into the environment” [22]. In addition, the agency is 
also in charge of ensuring livestock feed safety, along 
with the responsibility for the regulation of seeds, 
veterinary biologies, and fertilizers. Furthermore, the 
CFIA develops standards related to the packaging, 
labeling, and advertising of foods and handles all 
inspection and enforcement duties [23]. 

Many GM crops are destined, in whole or in specific 
parts, for the human food supply system. For this 
reason, they must not only obtain CFIA approval but 
must also be assessed by Health Canada. It is within the 
jurisdiction of Health Canada to regulate GM foods 
according to the Food and Drugs Act under Division 28 
of Part B of Food and Drug Regulations (Novel Food). 
As with seeds, the trigger for pre-market safety assess¬ 
ments is novelty, and as such, Health Canada treats as 
novel food all those derived from GMOs “whether it 
is a micro-organism, a plant or an animal, such that 
it exhibits characteristics that were not previously 
observed, no longer fall within the anticipated range 
or no longer exhibits characteristics that were 
previously observed, for that plant, animal or micro¬ 
organism” [23]. 

Environment Canada is only responsible for the 
environmental assessment of GMOs used in industrial 
processes. Although the Canadian Environmental Act 
requires environmental risk assessments for GMOs, the 
responsibility for conducting assessments relevant to 
novel plants, feeds, and food rests with the CFIA, 
PMRA, and Health Canada. Notably, CFIA, as the 
agency most involved in the environmental risk assess¬ 
ment of GMOs, is the responsibility of the agriculture 
minister whose mandate is to promote agricultural 
development. 

Commercialization of GM Crops: Canadian 
Approval Process 

Before crops with novel traits may be authorized for 
unconfined release, they must be fully assessed for 
environmental safety by the CFIA. In meeting the 
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extensive information requirements for these applica¬ 
tions to the CFIA, applicants will have conducted 
experiments at the earlier confined release stage. 
These experiments are expected to contribute scientif¬ 
ically robust data to address the key criteria of environ¬ 
mental safety assessments. The applicant is required to 
provide the Plant Biosafety Office (PBO) at the CFIA 
with extensive high-quality, statistically sound data 
and/or valid scientific rationale to demonstrate the 
environmental safety of the PNT. This information 
initiates a review and decision for authorization of the 
release. CFIA officials also use pertinent information 
generated from the Agency’s own research, either 
conducted in-house or contracted out, on specific key 
environmental areas [18]. 

The unconfined release assessment by the PBO 
focuses on real or potential interactions of the PNTs 
with the wider agricultural and ecological environ¬ 
ment, using “substantial equivalence” as the basis for 
these assessments. Evaluations consider the unique 
combination of species and traits, using standard 
descriptions of each species known as biology docu¬ 
ments as a baseline for comparison. If the PBO con¬ 
cludes that there is minimal potential for significant 
negative environmental impact of the PNT relative to 
its unmodified counterpart, an unconfined environ¬ 
mental release may be authorized. In some cases, the 
PBO may authorize an unconfined release with condi¬ 
tions, such as a requirement that the applicant ensures 
that users of an insect-resistant PNT deploy methods 
to delay development of resistance among insect 
populations. Note that for species that may be used 
for food or feed, developers of PNTs must also seek 
approvals from Health Canada for human food use and 
from the CFIA Feed Section for livestock feed use [ 19] . 
The starting point for the safety assessment of novel 
foods is also based on “substantial equivalence,” where 
the novel food is evaluated relative to conventional 
counterparts that have a history of safe use [23] . Health 
Canada has 45 days to decide whether the product is 
safe or to request additional information to pursue the 
risk analysis, even to the extent of involving experi¬ 
ments [23]. 

All imported PNTs (or products derived from 
them) require a prior import permit, being subject to 
the CFIA regulatory review under the Plant Protection 
Act and Regulations. Pest risk assessments (PRAs) are 


conducted by the Plant Health Division in order to 
evaluate the potential capability of PNTs to pose 
a pest risk to the agricultural and forestry environment. 
Those commodities determined not to pose a plant 
pest risk are now no longer required to have an import 
permit. Additional exempted commodities include: 
PNTs with prior approval; PNTs (or products derived 
from them) that are incapable of sexual or asexual 
propagation, i.e., have been processed in some way to 
render them non-viable, such as by grinding or freez¬ 
ing; and plants further developed from exempted 
PNTs, or considered substantially equivalent to them 
provided that the intended use is similar, and that the 
plants do not display any additional novel traits, do not 
contain novel genetic elements, and have not been 
subject to inter-specific breeding [24]. 

China 

China has become one of the Asian leaders in biotech¬ 
nology and has dedicated substantial economic, scien¬ 
tific, and technological resources to R&D. Since the 
1980s, ministries and relevant government agencies in 
China have been investing significantly in agro- 
biotechnology research and have established more 
than 150 laboratories, resulting in the largest plant 
biotechnology capacity outside of North America 
[25]. The government has allocated research budgets 
to biosafety and management, and nearly all biotech¬ 
nology research programs have expanded their scope 
into biosafety issues [5]. The commitment to sustain 
biosafety after project closure is demonstrated by its 
growing budget to support agricultural research in 
biosafety over the last few years. From an initial budget 
of slightly over US$ 80,000 in 1999, China now spends 
about US$ 3 million annually on agricultural biosafety- 
related activities [10]. 

A wide variety of crops and traits has passed 
through China’s biosafety system and are now planted 
commercially, while many others remain at the field 
trial stage, including many varieties with adaptation- 
related traits developed by Chinese institutes and com¬ 
panies [26] . In contrast to the other countries described 
in this chapter, most of these crops have been devel¬ 
oped predominantly by public sector laboratories in 
China. The biosafety regulatory system in China has 
also reviewed a large number of applications since it 
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was formally set up in the late 1990s. The government 
received 1,044 applications for field trials or commer¬ 
cial release, and 777 of these were approved [27, 28]. 
These applications predominantly covered 60 crops, as 
well as several animals and a large number of microor¬ 
ganisms [28]. Varieties of cotton, tomatoes, phytase 
maize, insect-resistant rice, and sweet and chili peppers 
have all been approved for commercial planting [3]. 

Regulatory Oversight in China 

China has adopted a policy that promotes research and 
development of biotechnology, while at the same time, 
retaining control over research in genetic engineering. 
In the early 1990s, China had already implemented 
a very pragmatic approach to GM crop regulation. 
Regulations were basically product-based with special 
attention given to the economic interest of a given 
application. By 1993, China had already established 
its first biosafety regulation, namely the “Safety 
Administration Regulation on Genetic Engineering” 
issued by the Ministry of Science and Technology 
(MOST). This instrument required relevant ministries 
to draft and issue corresponding biosafety regulations 
on biological engineering (i.e., the Ministry of Agricul¬ 
ture (MOA) for agriculture and the Ministry of 
Public Health for food safety) and established general 
principles, safety categories, risk assessment and risk 
management procedures, application and approval 
mechanisms, and legal responsibilities [29]. It was 
followed in 1996 with the “Safety Administration 
Implementation Regulation on Agricultural Biological 
Genetic Engineering” by the MOA [12]. This was an 
explicit regulatory regime for the risk assessment and 
management of agricultural products of genetic engi¬ 
neering. Labeling was not part of this regulation, nor 
was any restriction imposed on imports or exports of 
GM products. The regulation did control GMOs for 
research and commercial production, as well as 
establishing the National Agricultural GMO Biosafety 
Committee (Biosafety Committee) to provide the 
MOA with expert advice on biosafety regulations. 

Criticism of GM crops on environmental, food- 
safety, and ethical grounds, however, led to some 
significant changes in the Chinese legal framework on 
agro-biotechnology [5]. In 2001, the State Council 
decreed a new and general rule on biosafety, with the 


aim of protecting human, animal, and plant health and 
the environment. This new “Regulations on Safety of 
Agricultural Genetically Modified Organisms” replaced 
the 1993 Regulation issued by MOST. The 2001 regu¬ 
lations provide the MOA with overall national author¬ 
ity to oversee the use of GM crops, whereas the 31 
provincial biosafety management offices are responsi¬ 
ble for the supervision and administration of biosafety 
in their respective areas [30]. The 2001 regulations 
meet the generally accepted risk assessment procedures 
outlined in the relevant international instruments 
and also stipulate a comparative risk assessment 
approach, in which a GM crop is compared with the 
corresponding non-transgenic crop for environmental/ 
ecological safety and food safety. 

To implement this Regulation, the MOA issued 
three implementation regulations including Imple¬ 
mentation Regulations on Safety Assessment of Agri¬ 
cultural Genetically Modified Organisms, which 
provided the legal basis and technical guidelines in 
GM crops risk assessment in China [31]. These new 
regulations primarily concerned Biosafety Evaluation, 
Import Safety, and Labeling and included several 
important changes to existing procedures and details 
of regulatory responsibilities after commercialization. 
The changes included an extra pre-production trial 
stage prior to commercial approval, new processing 
regulations for GM products, labeling requirements 
for marketing, new export and import regulations for 
GMOs and GMO products, and local and provincial- 
level GMO monitoring guidelines [29]. Specifically, the 
Regulation on Biosafety Evaluation establishes proce¬ 
dures for handling applications for GM cultivation and 
sets up an advisory body, the National Biosafety Com¬ 
mittee (NBC), and a decision-making body, the Office 
of Agricultural Genetic Engineering Biosafety Admin¬ 
istration (OGEBA), under the MOA to handle applica¬ 
tions. Applicants must provide information on risk 
assessment, and GMOs are classified into four classes 
depending on their potential danger to human and 
animal health and to the environment. The Regulation 
establishes the requirements that should be met to 
obtain authorization to import GMOs and will vary 
according to the intended purposes of the imports, 
i.e., research, release into the environment, or processing. 
In response to representations from GM-producing 
countries, China agreed to allow trade to continue as 
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normal until the new Regulation on Safety of Imports 
entered into force on 20 April 2004 [32, 34]. The 
Ministry of Public Health (MPH) is responsible for 
food-safety management of biotechnology products 
(processed products based on GMOs) and promul¬ 
gated its first regulation on GMO food safety in April 
2002, to take effect after July 2002. 

China’s policy on GM regulation is now under the 
responsibility of an agency which was established by the 
State Council, the name of which has been variously 
reported as either the Joint Monitoring and Manage¬ 
ment Commission [12] or the Allied Ministerial 
Meeting [29]. It has a multi-stakeholder membership 
comprising the highest representatives from ministries 
like Agriculture, Health, Commerce, Science and Tech¬ 
nology, the National Development and Reform Com¬ 
mission, the National Inspection and Quarantine 
Agency, and the State Environmental Protection 
Administration. It is responsible for the coordination 
of key issues related to the biosafety of agricultural 
GMOs, the examination and approval of the applica¬ 
tions for GMO commercialization, determining the list 
of GMOs for labeling, and establishing import or 
export policies for agricultural GMOs and their prod¬ 
ucts. In addition, under the new regulation, foreign 
investment in biotechnology has been prohibited [5]. 

The NBC remains the major player in the process of 
biosafety management. Currently, the NBC is com¬ 
posed of 56 members who come from different admin¬ 
istrative departments, academic institutions, etc. 
They are experts in biological research, production, 
processing, inspection and quarantine, public health, 
and environmental protection with respect to agricul¬ 
tural GMOs [31] . The committee meets twice each year 
to evaluate all biosafety assessment applications related 
to experimental research, field trials, environmental 
release, pre-production trials, and commercialization 
of agricultural GMOs. It makes recommendations to 
the OGEBA based on the results of its biosafety assess¬ 
ments. OGEBA is responsible for the final approval of 
decisions, as well as handling routine work and daily 
operations [29]. In 2005, all 31 provinces in China 
established biosafety management offices. These bio¬ 
safety management offices collect local statistics on and 
monitor the performance of research and commercial¬ 
ization of agricultural biotechnology in their provinces 
and assess and approve (or disapprove) all applications 


of GM-related research, field trials, and commerciali¬ 
zation in their provinces. Only those cases that are 
approved by provincial biosafety management offices 
are submitted to the NBC for further assessment [29]. 

In May 2007, the National Development and 
Reform Commission in China announced that it had 
approved the establishment of a National Biosafety 
Research Centre. To be completed by 2009, the Centre 
will manage agricultural and biological-related issues. 
It is to house several research departments, including 
laboratories for high-risk plant pathogens, insects, and 
plants, as well as units for agriculture-related informa¬ 
tion analysis and quarantine facilities. The Centre will 
be supervised by the Plant Protection Institute of the 
Chinese Academy of Agricultural Sciences. 

In 2009, a major development in China was the 
commercialization of transgenic Bt rice, which reached 
pre-production trials but was still pending final com¬ 
mercialization approval for several years. Influenced in 
part by opposition to the technology in Europe and to 
some extent Japan, the final approval of many crops 
stalled at the level of China’s inter-ministerial commit¬ 
tee even as research and field trials continued apace. 
The discovery by Greenpeace that some transgenic seed 
was planted by farmers without authorization caused 
international debates about China’s biosafety system 
and may have contributed to regulatory approval 
delays [33]. 

Commercialization of GM Crops: Chinese Approval 
System 

If the product is for cultivation, applications must be 
authorized before the first import of a specific GMO 
can take place and must be accompanied by a safety 
assessment carried out in the country of origin of the 
GM material [5]. For permanent approval of each 
imported GM product, compulsory field trials are car¬ 
ried out in China in order to re-assess safety within the 
Chinese context [35] . Generally, the practice in China is 
to use a comparative risk assessment approach, in 
which the transgenic crop is compared with the 
corresponding non-transgenic crop in ecological risk 
assessment and hazard identification in transgenic 
foods [36, 37]. Although there are no official guidelines 
in China for risk assessment on food derived from 
transgenic crops, the assessments carried out so far on 
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nutrition, toxicity, and allergenicity generally followed 
the relevant Codex principles and guidelines [31]. 

In China, agricultural GMOs also need to satisfy the 
procedures governing the release of new seed varieties. 
These procedures are governed by the Seed Law in 
China. Only agricultural GMOs that have previously 
obtained a biosafety certificate are eligible to be classi¬ 
fied as a new seed variety in accordance with the Seed 
Law and relevant regulations. After the GMO has 
passed seed variety testing and received the permission 
for production, it is eligible to enter into the chain of 
production and marketing [31]. 

India 

In India, a wide variety of crops have been field trialed, 
but most have not yet been commercialized. The first 
approval for the commercial production of any GM 
crop in India occurred in March 2002 when the Indian 
competent authority approved three varieties of 
Bacillus thuringiensis (Bt) cotton (MECH 12, MECH 
162, and MECH 184 expressing the cry 1 Ac gene for 
insect resistance) amid widespread protests by anti- 
GM activists. This was followed by a significant 
increase in the availability of Bt cotton (currently 809 
hybrid varieties) [38] better suited to Indian cultiva¬ 
tion. By 2003, more than 34 genes were being tested in 
a wide variety of crops, including cotton, rice, mustard, 
maize, potatoes, eggplant, tomatoes, pigeon pea, and 
cabbage [29]. Most of the varieties initially introduced 
included insect-resistant cotton varieties as well as 
some crops modified with herbicide tolerance. Several 
Bt crop varieties have passed through the field trial 
stage and have received approval for commercial 
planting. The primary technology used in India origi¬ 
nated in the Monsanto Company, which partnered 
with Maharashtra Hybrids Company (MAHYCO) to 
develop transgenic hybrid Bt cotton for sale in India. 
Once some Bt varieties had been approved for com¬ 
mercial planting, it was discovered that 800,000 ha of 
unapproved BT cotton had been planted, weakening 
confidence in the biosafety system [29]. The latest 
GM crop awaiting regulatory approval for commercial 
release is Bt brinjal (known also as aubergine or 
eggplant), which received positive assessments by the 
regulatory bodies based on years of field trials [3] but 
was refused authorization by the Environment Minis¬ 
ter at the last stage of the commercialization process. 


Regulatory Oversight in India 

With the signing of the CPB in 2001, India became 
committed to introducing structures and procedures 
commensurate with the conditions laid down in the 
CPB agreement - one of the main guiding principles 
for India when dealing with products derived from 
agricultural biotechnology. This commitment pro¬ 
vided India with the incentive to strengthen its bio¬ 
safety capacity and have relevant institutional 
mechanisms at hand to enable the proficient dealing 
of GMOs. As the CPB places due importance to 
national legislations, provided it is developed in accor¬ 
dance with the former, the existing domestic policy on 
GMOs was required to be fine-tuned and amended 
wherever necessary. The goal of the Indian regulatory 
system is therefore to ensure that their GM crops pose 
no major risk to food safety, environmental safety, or 
agricultural production and that there are no adverse 
economic impacts on farmers [29]. As such, the 
Government of India has adopted a policy of careful 
assessment of the benefits and risks of GMOs at various 
stages of their development and field release to ensure 
biosafety [39]. 

The existing regulatory framework takes the form 
of rules and guidelines and is based upon three specific 
provisions of the Environment Protection Act of 1986 
(EPA). These are sections 6, 8, and 25. While Section 6 
of the Act empowers the Central Government to make 
rules on procedures, safeguards, prohibition, and 
restrictions for handling of hazardous substances, 
Section 8 of the Act prohibits a person from handling 
hazardous substances, except in accordance with 
procedures and after complying with safeguards. 
Section 25 of the EPA empowers the Central Govern¬ 
ment to lay down rules regarding procedures and safe¬ 
guards for handling hazardous substances. Thus, the 
biosafety rules in India are statutory in nature as they 
originate from the EPA. These provisions of the EPA led 
to the adoption of the 1989 Rules for the Manufacture, 
Use, Import, Export and Storage of Hazardous Micro 
organisms Genetically Engineered Organisms or Cells 
(“1989 Rules”) [39, 40]. 

In 1994, the Department of Biotechnology revised 
its earlier guidelines of 1990, entitled “Revised Guide¬ 
lines for Safety in Biotechnology.” These revised guide¬ 
lines aimed at regulating large-scale production and the 


c 




2296 


c 


Commercialisation of GM Crops: Comparison of Regulatory Frameworks 


deliberate release of GMOs, plants, animals, and prod¬ 
ucts into the environment and shipment and importa¬ 
tion of GMOs for laboratory research [6]. By 2002, 
an array of legislation likely to impact biosafety 
regulations had come into existence. This included 
the National Biodiversity Act 2002 (NBA) and the 
Protection of Plant Varieties and Farmers’ Rights Act 
2001 (PPVFR), the latter of which derived from 
a broad-based consultation with a view to incorporate 
a form of farmers’ rights into the national plant variety 
rights legislation. The biosafety rules have since been 
supplemented by the Biotechnology Safety Guidelines 
issued by the Department of Biotechnology (DBT). 
These guidelines have been issued in pursuance of 
Rule 4 [2] of the Biosafety Rules, which require manuals 
of guidelines to be brought out by the Review 
Committee on Genetic Manipulation [40]. 

Therefore, the Indian biosafety regulatory frame¬ 
work, comprising the 1989 Rules and the 1990, 1994, 
and 1998 DBT guidelines, covers the entire spectrum of 
activities relating to GMOs. This includes “research 
involving GMOs, as well as genetic transformations of 
green plants, recombinant DNA (rDNA) technology in 
vaccine development, and large-scale production and 
deliberate/accidental release into the environment of 
organisms, plants, animals and products derived from 
rDNA technology.” Production facilities such as distill¬ 
eries and tanneries that use GMOs are also covered. In 
India, the risk assessment and regulatory approval for 
releases of GMOs and GM products are mandatory. 
The concept of “biosafety” used in the regulations is 
a broad one, covering the health safety of humans and 
livestock, environmental safety (ecology and biodiver¬ 
sity), and economic impact. The first two safety aspects 
dominate the regulations, while economic impact is 
given less prominence. 

Two nodal agencies, the Ministry of Environment 
and Forests (MoEF) and the DBT at the Ministry of 
Science and Technology, are responsible for the imple¬ 
mentation of the regulations [39]. The life cycle of 
a GM product features four domains, pre-research, 
research, release, and post-release, and is characterized 
by the presence of six competent authorities [41, 42]. 
The Recombinant DNA Advisory Committee (RDAC) 
is in the pre-research domain as it triggers research 
through its initial approval mechanisms. The Review 
Committee on Genetic Manipulation (RCGM) resides 


in the DBT and functions in the research domain, 
closely monitoring the process of research and experi¬ 
mental releases. It requests food biosafety, environmen¬ 
tal impact, and agronomic data from applicants who 
wish to do research or conduct field trials and will give 
permits to import GM material for research. Pursuant 
to Rule 4 [2] of the 1989 “Rules,” the RCGM is also 
required to produce manuals of guidelines. The RCGM 
is primarily made up of scientists (including agricul¬ 
tural scientists) and can request people with specialized 
knowledge to review cases. It has a Monitoring cum 
Evaluation Committee (MEC) that monitors limited 
and large-scale field trials of GM crops and is primarily 
made up of agricultural scientists. Commercial pro¬ 
duction of GM crops, large-scale field trials of GM 
crops, and the imports of GM commercial products 
and GM-derived products (e.g., foodstuffs, ingredients 
in foodstuffs, and additives including processing aids 
containing or consisting of GMOs) come under the 
authority of the Genetic Engineering Approval Com¬ 
mittee (GEAC) at the MoEF. The committee members 
are primarily bureaucrats representing different minis¬ 
tries, and they draw on the scientific expertise of each 
ministry. Additional to these national committees are 
the State Biotechnology Coordination Committee 
(SBCC) and the District Level Biotechnology Commit¬ 
tee (DLC), who, along with the MEC, basically occupy 
the post-release domain, although they also contribute 
to the research domain activities through data provi¬ 
sioning to the RCGM. Completing the regulatory 
apparatus are the Institutional Biosafety Committees 
(IBSC) which undertake the monitoring and imple¬ 
mentation of safeguards at the R8cD sites, under the 
close supervision of the RCGM, the SBCC, and the 
DLC. IBSCs must be established in any public or 
private institute using rDNA in their research and 
comprise scientists from their respective institutes 
and a member from the DBT. There are more than 
230 IBCs in India, of which 70 deal with agricultural 
biotechnology. They can approve contained research at 
institutes unless the research uses a particularly hazard¬ 
ous gene or technique which will require specific 
approval from the RCGM [29]. In general, these 
authorities are vested with non-overlapping responsi¬ 
bilities [39, 43]. 

Under the Constitution of India, it is not the central 
Government of India but the state governments that 
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exercise formal authority over agriculture. Thus, while 
the national government may take the initiative in the 
policy arena and formulate policies concerning agricul¬ 
tural biotechnology and GM crops (in R&D as well as 
commercialization), as well as being where the deci¬ 
sion-making process resides, the agreement and active 
cooperation of state governments are indispensable for 
their implementation [44]. 

The multitude of rules and regulations underline 
the complexities involved in biosafety as it cuts across 
ministries and agencies and does not merely govern 
environmental issues. Most of these regulations deal 
with GMOs in seclusion without referring to a 
common agency or secretariat to deal with the risks 
that are associated with the organism [46], resulting in 
the biosafety regulations being subjected to criticism 
both by industry and civil society groups. While indus¬ 
try associations consider these regulations as affecting 
their growth, civil society groups consider biosafety 
regulations as not being strong enough to check the 
introduction of potentially harmful biotechnology 
products. Since 2004, there have been serious discus¬ 
sions in India on re-engineering the structure of 
biosafety regulations. The primary objective of the 
exercise is to cut down red tape and ensure greater 
transparency in decision-making. Calls have come for 
the replacement of the present regulatory system (with 
its dispersed, unclear, and confusing mandates; respon¬ 
sibilities; and powers) by a new, single, integrated, and 
professionally led authority, the National Biotechnol¬ 
ogy Regulatory Authority (NBRA), with a comprehen¬ 
sive mandate and a wide range of responsibilities, with 
the power to implement the regulatory regime with 
speed and efficiency [40,44] and to help the assessment 
of risks and benefits associated with GM crops in 
a credible and transparent manner [47]. In May 2007, 
it was announced that the NBRA will be fully functional 
in 2 years time and will be administered by the DBT to 
expedite the application of biotechnology in the agri¬ 
culture, veterinary, and medicine sectors [48]. 

The current amendments or changes that have 
favored the industry relate to changes in the 1998 
revised guidelines for research in transgenic plants, 
whereupon a relaxation was permitted regarding the 
concept of deliberate release. This amendment, by 
conferring powers to the RCGM to permit limited 
conduct of field trials in multi-locations, was at 


variance with the 1989 Rules that prohibited deliberate 
or unintentional release for experimental purposes, 
except where the GEAC approved it as a special case. 
The distinction between small-scale and large-scale 
releases brought about by the changed guidelines was 
unusual and was designed to ensure the control of the 
DBT and the RCGM over initial field testing of trans¬ 
genic crops. An amendment was made by the DBT in 
September 1999 conferring rights to the RCGM to 
approve small experimental field trials for research, 
limited to a total area of 20 acres in multi-locations 
with any one location not exceeding 1 acre. Through 
this amendment, the DBT removed small experimental 
trials for research from the deliberate release clause of 
the 1989 Rules [40]. 

The changes that have been made to accommodate 
civil society concerns are basically twofold. The first 
relates to the formation of the MEC by the DBT in 1998 
in order to closely and objectively monitor private 
sector biosafety data and through the mandatory 
involvement of state-level agricultural university scien¬ 
tists. The second change, which was induced by the Bt 
cotton controversy in India, has been the introduction 
of allergenicity tests of transgenic seeds, leaves, and 
vegetables on rodents, rabbits, guinea pigs, and goats 
in the 1998 version of the Biotechnology Safety Guide¬ 
lines [42]. This precautionary step is viewed by the 
industry as having contributed to the delay in the 
regulatory approval for Bt cotton [40]. Additional reg¬ 
ulations have recently been added to the PPVFR Act of 
2001, requiring applicants to provide relevant GEAC 
clearances and approvals for registering transgenic 
varieties, as well as an affidavit stating that the “Termi¬ 
nator” Technology or the Genetic Use Restriction 
Technology is not involved [52]. Notably, the deci¬ 
sion-making circle does not include the participation 
of industry, civil society, or consumer groups. While 
the 1989 Rules explicitly say that the RCGM, the GEAC, 
the SBCCs, and the DLCs may co-opt other members/ 
experts as necessary, they neither include nor exclude 
representatives of NGOs and the private sector. In 
practice, however, these non-governmental stake¬ 
holders have been excluded [44]. However, following 
the “Terminator” controversy, the National Bureau of 
Plant Genetic Resources is now mandated by the gov¬ 
ernment of India to develop probes to detect the pres¬ 
ence of terminator genes in imported material, 
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highlighting how the force of public opinion can still 
shape biosafety rules in India. 

To keep up with the rapid pace of developments in 
plant biotechnology, especially GMOs, the Indian reg¬ 
ulatory system revised its existing guidelines in 2008. 
These were to provide greater clarity on data require¬ 
ments and include: Standard Operating Procedures 
(SOPs) for Confined Field Trials of Regulated, Genet¬ 
ically Engineered (GE) Plants [49], Guidelines for the 
Safety Assessment of Foods Derived from Genetically 
Engineered Plants [50], and Protocols for Food and 
Feed Safety Assessment of GE crops [51]. Guidelines 
for Institutional Biosafety Committees (IBSCs) are also 
currently under review. 

While the overall regulatory system remains 
unchanged, a notable difference is the classification of 
all GM field trials into two categories, based on size. 
The RCGM, operating in the DBT, is now the regula¬ 
tory authority for Biosafety Research Level I (BRLI) 
trials. BRLI trials are limited in size to no more than 
1 acre (0.4 ha) per location and a maximum cumulative 
total of 20 acres (8.1 ha) for all locations for each plant 
species/construct combination (e.g., one or more 
events originating from the transformation of a plant 
species with the same genetic construct), per applicant, 
per crop season. The GEAC, operating in the MoEF, is 
now the regulatory authority for Biosafety Research 
Level II (BRLII) trials. BRLII trials are limited in size 
to no more than 2.5 acres (1 ha) per location, and the 
number of locations is decided on a case-by-case basis 
for each plant species/construct combination, per 
applicant, per crop season. 

Members of the MEC, SBCCs, and DLCs and mon¬ 
itoring teams of SAUs have the authority to inspect and 
monitor confined field trials at the time of planting, 
during the growing and harvesting season, and during 
the period of post-harvest land-use restriction for com¬ 
pliance with the terms and conditions of authorization. 

Commercialization of GM Crops: Indian Approval 
Process 

The approval process in India begins with the submis¬ 
sion of an application regarding a new LMO event with 
potential benefits over the conventional variety/hybrid 
in terms of economic benefit to the farmer and/or the 
environment. The developer is required to follow a set 


procedure that involves providing all the necessary 
information specified by the regulatory body. Such an 
application is reviewed by the RCGM which, in the first 
instance, may recommend various limited contained or 
open field trials to be undertaken in order to generate 
specific biosafety data which may be lacking in the 
original submission. Once the full dossier of informa¬ 
tion from experiments undertaken under confined 
conditions is submitted, the RCGM will then ascertain 
as to whether the LMO presents any immediate adverse 
effects, either to humans, animals, and the environ¬ 
ment (including the likely impact of large-scale culti¬ 
vation on biodiversity). If considered as presenting 
minimal risk, the RCGM may permit large-scale open 
field trials to be conducted to generate data concerned 
with the agronomic performance of the LMO. Once 
more, possible adverse impacts on the environment, 
including on non-target organisms, are evaluated. The 
unconfined, open field trials are conducted either by 
the applicant or by the ICAR, involving their institutes/ 
State Agriculture Universities, and are monitored by 
the MEC. The MEC reports their observations directly 
to the RCGM and the GEAC. Based on the biosafety 
data and the field performance, the RCGM may rec¬ 
ommend the case to the GEAC for further evaluation. 
The GEAC will consider all the data provided and may 
ask the company to furnish additional data or repeat 
the trials in multi-locations during the next season. 
Based on the overall recorded benefits, the GEAC can 
approve the commercialization of the GM crop for 
a limited period and in a specified geographical zone. 
Data collected during this period will form the basis of 
the review undertaken by the GEAC for any extension 
or expansion to the set conditions of commercializa¬ 
tion. Any adverse impact on human, animals, and 
environment derived from such large-scale cultivation 
is required to be immediately notified by any individual 
or organization directly to the GEAC. 

All commercial authorizations are for a limited 
period, requiring renewal after the expiry period. Fur¬ 
thermore, approval is conditional upon the observing 
and collecting of relevant information on the risks, if 
any, arising from the commercial use of the GMOs and 
products thereof [42]. A key addition in the 1998 
guidelines is the requirement to generate data on com¬ 
parative economic benefits of a modified plant. Thus, 
the 1998 guidelines call for a demonstration that 
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a transgenic crop is both “environmentally safe and 
economically viable .” An agronomic evaluation of the 
transgenic crop to determine economic advantage to 
farmers is seen as an integral component of the trans¬ 
genic crop approval process, along with the biosafety 
evaluation [42]. Thus, when the government granted 
permission for large-scale field testing of transgenic 
cotton in India in July 2000 (the first crop to receive 
such approval), mandatory data to be generated by the 
applicant included “cost of transgenic seed, projected 
demand, and the area to be covered under transgenic 
cotton cultivation” [45]. 

The Philippines 

The Philippines’ National Agenda for Sustainable 
Development for the twenty-first Century (PA 21) 
provides the policy framework of the country’s strategy 
for sustainable development. In 2001, the Presidential 
Policy Statement on Modern Biotechnology [53] 
reiterated the government policy of promoting the 
safe and responsible use of modern biotechnology 
and its products as one of several means to achieve 
and sustain food security, equitable access to health 
services, sustainable and safe environment, and indus¬ 
try development. The Philippine government formally 
funds biotechnology as part of the annual budget for 
agricultural R&D through legislation [54]. In Decem¬ 
ber 2002, the Philippines became the first country in 
Asia to commercialize a GM crop for use as food, feed, 
or for processing [55] when the Department of 
Agriculture (DA) approved the Bt corn MON810 for 
import and propagation [56]. By summer 2005, the 
Philippines had approved 19 different LMOs for direct 
use as food, feed, or propagation [57], while in 2010, 
this had increased to 53 (when stacked events are also 
included). 

Regulatory Oversight in the Philippines 

The Filipino biotechnology regulatory system was 
established as a result of the recommendations from 
the scientists asking the national government to for¬ 
mulate a national policy on biosafety and create 
a technical body to draft guidelines to ensure that 
experiments using GMOs do not pose unacceptable 
risks to human health and the environment [12]. The 
Philippines has a body of policies aimed at regulating 


the development, importation, transfer, and use of 
GMOs. The first guidelines for biosafety were promul¬ 
gated in October 1990 as Executive Order (EO) 430, 
which established the National Committee on Bio¬ 
safety of the Philippines (NCBP). The NCBP was 
established to “oversee the compliance with policies 
and guidelines in all institutions, public or private, as 
well as to coordinate with the appropriate national 
bodies that have regulatory powers over any violations” 
[58] . At present, the NCBP is concerned with contained 
use (confined laboratory and greenhouse experiments 
on the regulated article), and its primary function is to 
identify and evaluate potential hazards involved in 
initiating genetic modification experiments and rec¬ 
ommend measures to minimize risks [59]. Additional 
guidelines were developed and published by the NCBP 
and the Department of Science and Technology in 
1991, 1998, and 2002 before being incorporated into 
the National Biosafety Framework, which was finalized 
in 2004 and issued as EO 514 in April 2006 [5, 12]. The 
rules and regulations for the import and release into the 
environment of plants and plant products derived from 
the use of modern biotechnology are set out in Admin¬ 
istrative Order no. 8, Series of 2002 of the Philippine 
Department of Agriculture (AO 8) [35, 55]. The fol¬ 
lowing year, the DA issued Memorandum Circular 
No. 8, which outlined the import requirements for 
biotech products. This was quickly followed by the 
issuance of Memorandum Circulars 11 and 12 in 
August 2003, which further clarified the import rules 
for biotech products for direct use as seed, food, feed, 
or for further processing [57]. Importers of GM 
plants for contained use, field testing, and propagation 
(or commercial planting), as well as GMOs for direct 
use as food, feed, and processing, are required to 
obtain an approval permit [15, 55] which stipulates 
that the performance of the GM crop and its effect on 
the environment as well as human and animal health 
have been positively assessed [11]. 

The decision-making process is vested in multiple 
national competent authorities (NCAs) after consul¬ 
tation with other agencies and/or with a multi¬ 
stakeholder advisory body. The Department of Agri¬ 
culture (DA) is the competent national authority 
responsible for biosafety decisions concerning plants 
and plant products derived from modern biotechnol¬ 
ogy, fisheries and other aquatic resources, domesticated 
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animals and biological products used for animal hus¬ 
bandry or veterinary purposes, and biological agents 
used for biocontrol. It is the government institution 
with mandatory responsibility for GM crop field 
releases and commercialization. Likewise, the Depart¬ 
ment of Science and Technology is responsible for 
research and development, the Department of Health 
for pharmaceuticals which are not addressed by other 
relevant international agreements or organizations, 
and the Department of Environment and Natural 
Resources concerning regulated organisms intended 
for bioremediation, the improvement of forest genetic 
resources, and wildlife genetic resources, and appli¬ 
cations of modern biotechnology with potential 
impact on the conservation and sustainable use of 
biodiversity [12]. 

Commercialization of GM Crops: Filipino Approval 
Process 

In consultation with the NCBP, the Bureau of Plant and 
Industry (BPI) of the DA is responsible for the granting 
of permits issued under AO 8 and are classified 
according to the intended use of the regulated article: 
(a) importation for contained use, (b) field testing, 
(c) release for propagation, and (d) importation for 
direct use as food or feed or for processing [61]. Appli¬ 
cations for import must be accompanied by a certificate 
from the competent authority in the country of origin 
stating that the regulated article has been locally 
approved and a notification in accordance with interna¬ 
tional obligations. Local applications must be supported 
with the necessary technical and scientific dossiers, 
a public information sheet (PIS), and a certificate from 
the BPI stating that the regulated article has undergone 
satisfactory field testing in the Philippines. The AO 
8 policy for commercial propagation stipulates that no 
regulated article will be released unless (a) field testing 
showed that the GM crop will not pose any significant 
risks to the environment, (b) food and/or feed safety 
studies showed that the GM crop will not pose any 
significant risks to human and animal health, and 
(c) a permit for propagation has been secured from the 
DA. If the GM crop has transgene-derived pesticidal 
properties, it must also be duly registered with the 
Fertilizer and Pesticide Authority (FPA) [61]. Upon 
receipt of an application, the BPI has 5 days to process 


and evaluate all of the documentation to ensure that it 
is sufficient in form and substance. If it is found to be 
defective, then the applicant is given a 60-day grace 
period to correct or provide further necessary informa¬ 
tion. Only a complete application will be accepted for 
evaluation by a multi-stakeholder advisory body, the 
results of which must be reported to the BPI within 
30 days of acceptance. For the duration of the evalua¬ 
tion process, the reviewers remain anonymous to both 
the public and the BPI. The Scientific and Technical 
Review Panel (STRP) and the Bureau of Agriculture 
and Fisheries Products Standards (BAFPS) evaluate all 
applications; the STRP comprises of at least three 
experts from a roster of independent scientists and 
particularly evaluates the risk assessment and risk man¬ 
agement strategies outlined by the applicant, whereas 
the BAFPS will make a determination of compliance 
with food-safety standards. Additionally, the Fertilizer 
and Pesticide Authority (FPA) and the Bureau of Ani¬ 
mal Industry (BAI) will also evaluate applications of 
those regulated articles which are also pest-resistant 
plants or to be used as feed, respectively. Concurrently, 
the applicant must carry out a public consultation by 
publishing the PIS in two newspapers of general circu¬ 
lation and inviting the public to make comments 
directly to the BPI within 60 days of posting the notice. 
A decision, together with any agreed permit conditions, 
is made within 120 days of publication of the notice 
[62]. Approved products are then included in the reg¬ 
istry for direct use maintained by the BPI. Once in the 
registry, for imported articles, the applicant is no lon¬ 
ger required to secure an import permit for succeeding 
shipments. However, a notification of shipment to 
BPI is required within 15 days before its arrival at 
a Philippine port [57]. 

South Africa 

South Africa is a biotechnology leader in Africa and is 
involved in sophisticated biotechnology activities, 
including those pertaining to the development and 
commercialization of GMOs [6, 63]. In 2000, the 
South African government began to focus on, and 
substantially increased, its research support for bio¬ 
technology. This led to the adoption of the 2001 
National Biotechnology Strategy (NBS), a policy 
framework to create incentives for the biotechnology 
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sector [63], involving several government departments 

[64] . The NBS commits more than US$300 million per 
year from government to finance a variety of biotech¬ 
nology initiatives [6]. 

South Africa is among only three African countries 
in which GM crops are commercially grown [2], In 
2006, the commercial release of insect-resistant (Bt) 
cotton and maize; herbicide-tolerant (RR) soybeans, 
cotton, and maize; and cotton with the “stacked gene” 
(Bt and RR) had been approved. At the time, it was 
estimated that these GM crops accounted for the culti¬ 
vation of 30.5% of yellow maize, 28.8% of white maize, 
59% of soybean, and 90% of cotton in South Africa 

[65] . The total area of commercialized transgenic crops 
increased in 2010 to 2.2 million hectares [2], which is 
mostly due to white and yellow maize, followed by RR 
soybeans and insect-resistant cotton. Giving an insight 
into potential future commercial releases, several vari¬ 
eties of these crops were also in field trials as of 2009, 
along with additional new crops. Between January and 
September 2009, the number of field trial permits 
issued totaled 267 [3]. Maize topped the list with 
222 approvals, followed by 24 permits for cotton, 15 
for vaccines, 3 for soybeans, and one each for sugar 
cane, sorghum, and table grapes. The traits associated 
in these approvals included drought tolerance and her¬ 
bicide tolerance in maize, herbicide tolerance in cotton, 
biofortified sorghum, fungus resistance in table grape, 
alternative sugar production pathways in sugar cane, 
and cassava with altered starch content. 


inspectorate to monitor function. According to the 
legislation, no person may import or export from 
South Africa, or develop, produce, use, release, or dis¬ 
tribute any GMO in South Africa, other than under 
a permit for undertaking such an activity [67]. Such 
a permit is to be issued after a technical assessment and 
risk analysis report has been submitted by the applicant 
and has been approved by the Executive Council. This 
Council is responsible for making regulatory decisions 
and is comprised of ten members: one representative 
from each of eight government ministries (Agriculture, 
Science and Technology, Health, Environmental Affairs 
and Tourism, Trade and Industry, Labour, Water 
Affairs and Forestry, and the Department of Arts and 
Culture), the chair of SAGENE who provides scientific 
and technical analysis of risk assessment data, and the 
GMO Registrar [68]. The GMO Regulations provide 
that an applicant shall notify the public of any pro¬ 
posed release of GMOs prior to the application for 
a permit for such release. Public notifications shall be 
in the form of a standard notice published in the 
printed media informing the public of the intended 
release. It is worth noting, however, that the first field 
trials were allowed in 1994, and since 1997, several 
multinational companies have been permitted to 
grow and import GMOs even before the GMO Act 
was belatedly implemented in November 1999 [63, 69] . 
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Regulatory Oversight in South Africa 

Concerns regarding the commercial release of GMOs 
led to South Africa enacting legislation to regulate 
the development, importation, and application of 
GMOs. The Genetically Modified Organism Act 1997 
(GMO Act) was passed in 1997, and it was subsequently 
modified in 2006 to bring it in line with the CPB [66]. 
Regulations for its implementation were initially 
adopted in 1999, and then, amended regulations took 
effect in February 2010. The formal structures for the 
implementation of this act include an Executive 
Council, which reviews applications for GMO work; 
a scientific advisory committee (South African Com¬ 
mittee for Genetic Experimentation [SAGENE]); 
a registrar to administer the GMO Act; and an 


Commercial activities concerning GMOs all require 
a permit, including those for: import, export, 
contained use (including development, production, 
distribution, transport, but not those under contain¬ 
ment levels 1 8c 2, i.e., in the laboratory or growth 
chamber), deliberate release of GMOs into the envi¬ 
ronment (trial and general release), and commodity 
clearance [70]. All applications must be submitted to 
the GMO Registrar at the DA, along with a copy of the 
public notice and proof (newspaper clippings) in order 
for the application to be processed. The public notice 
allows interested parties to submit comments or objec¬ 
tions in connection with the intended release to the 
Registrar within 30 days after the date of the notifica¬ 
tion [71]. The Registrar then undertakes an initial 
review of the application to determine compliance 
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with the provisions of the GMO Act. If the application 
is not compliant, the application is referred back to 
the applicant. Once compliant, the application is 
forwarded to a committee (expertise nominated by 
SAGENE chairperson) formed under SAGENE to con¬ 
duct a review of the proposed activity. The review 
includes an evaluation of the risk assessment data, 
including food safety (if applicable), submitted in the 
application. Conclusions of the assessment are detailed 
in a recommendation report, which is sent to the 
Registrar on completion of the review. At this stage, 
the application can be referred back to the applicant to 
address any concerns raised or to supply additional 
information, and the response returned to the commit¬ 
tee. Once all concerns have been addressed, the com¬ 
mittee makes a recommendation on the application. 
The recommendation document, public input, and 
a copy of the application is forwarded to the Executive 
Council for consideration, who will also take into 
account the socioeconomical impact that the GMO 
may have. The Council then submits its decision in 
writing to the Registrar. Should the Council raise any 
concerns, the Registrar will once again refer the appli¬ 
cation back to the applicant for clarification. Based on 
the information received from the applicant and the 
assessment done by the Council, the application will 
be approved or rejected. If the application is approved, 
the Council authorizes the Registrar to issue a permit to 
the applicant. This permit will be accompanied by 
specific containment conditions as prescribed by the 
Council. If the application was rejected, the Registrar 
will communicate the decision back to the applicant 
with reasons for the rejection [70]. Regulations of 
the Department of Health of 2004 provides for the 
labeling of foodstuff with genetically modified ingredi¬ 
ents that are significantly different to the non-GMO 
ingredients. The Consumer Protection Act of 2008 also 
addresses this issue. 

United States of America (USA) 

In the USA, GM crops have been sold since 1994 and in 
2006 were already planted on 54.6 million hectares 
(soybean, maize, cotton, canola, squash, papaya, and 
alfalfa), confirming the USAs role as the world leader in 
agro-biotechnology [10]. The regulatory system in the 
USA relative to biotechnology products is rather 


different from the one put in place in the EU, and 
the discrepancies mainly reflecting the different 
approaches taken by the governmental authorities, cit¬ 
izens, and firms toward GMOs and GM food, especially 
in the initial years of the biotechnology revolution. 

In the USA, agricultural biotechnology politics has 
been dominated by a strong and cohesive coalition of 
pro-biotechnology upstream and downstream pro¬ 
ducers and farmers. Lower public outrage has made 
mobilization of NGOs in the United States difficult 
and, in combination with a less-favorable institutional 
environment (notably, centralized regulatory policy¬ 
making), has resulted largely in their exclusion from 
agri-biotech policy-making [72]. 

Regulatory Oversight in the USA 

Taking the approach that GM products are essentially 
an extension of conventional products, the US Govern¬ 
ment has made use of existing laws to ensure the safety 
of GM products [ 15] . US regulatory authorities operate 
under the assumption that the fact that a plant has been 
genetically modified is less important than the specific 
effects of the modification [4]; therefore, the regulation 
focuses on the characteristics of the products rather 
than the way in which the product was produced 
[35]. The current system was delineated by the White 
House Office of Science and Technology Policy under 
the 1986 Coordinated Framework for Regulation of 
Biotechnology [73]. It is still the key document for 
regulating gene technology in the United States and 
provides the basis for the regulation of crop varieties 
produced by rDNA techniques. Under the Framework, 
the US Department of Agriculture (USDA), the Food 
and Drug Agency (USFDA), and the Environmental 
Protection Agency (USEPA) that were responsible for 
regulatory oversight of certain product categories or for 
certain product uses are also responsible for evaluating 
those same kinds of products developed using genetic 
engineering techniques. Transportation, growing 
(including field testing), and propagation of GM 
crops are governed by the USDA’s Animal and Plant 
Health Inspection Service (APHIS) under the Federal 
Plant Pest Act 1996 (FPPA) and the National Environ¬ 
mental Policy Act 1969. Specifically, the APHIS has two 
responsibilities: deciding on which GM seeds to over¬ 
see, so-called regulated articles, and which GM seeds 
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are safe enough to be free from the agency’s oversight, 
so-called deregulated articles. Deciding on GM seeds to 
be regulated depends on “familiarity,” gained from 
direct experience through field testing under regulated 
conditions. Any eventual deregulation, that is, 
exempting a GMO from the oversight of APHIS, 
involves a petition process whereby an advertisement 
is published in the US Federal Register, and a period to 
comment is provided to the public [74]. 

If a GM plant is not intended for human consump¬ 
tion and is not modified to contain a pesticide, the 
USDA is the leading agency. For plants genetically 
modified to produce their own pest protection, 
APHIS coordinates its evaluation with the USEPA. 
Pest-resistant GM crops fall under the authority of 
the USEPA under the Federal Insecticide, Fungicide, 
and Rodenticide Act 1996 (FIFRA) and the Toxic Sub¬ 
stances Control Act 1976. They are subject to a strict 
testing regime, where producers must submit testing 
data to the USEPA who determines the quantity of 
pesticidal substances that may be present in food 
[75]. In fact, industry is required to obtain an Experi¬ 
mental Use Permit (EUP) from the USEPA to test any 
pest-resistant plant in a field larger than 10 acres. Once 
the permit is granted, firms are expected to consult the 
USEPA on the details of the field experiment. While the 
APHIS analyzes data on the source of the new gene, 
the nature of the pesticidal substance produced, differ¬ 
ences with its natural equivalent, effects on non-target 
organisms, and environmental fate, the USEPA focuses 
on the toxicology, the digestive fate, and the potential 
allergenicity of the toxin. In line with the product- 
based approach, the USEPA does not assess the GM 
plant per se but the toxin produced by the plant. As for 
any pesticides, the USEPA subjects these toxins to 
a registration process. Pest-resistant plants whose 
toxin falls under this process are mostly of the Bt 
variety. Following this registration logic, if the toxin 
produced by a GM plant has been approved previously 
as a regular pesticide, a new registration is not required. 
The USEPA has been requesting for years new regula¬ 
tions to obtain a wider role in the assessment and the 
management of GM plants, but thus far with only 
limited success [11, 72, 74]. The US Food and Drug 
Administration (USFDA) regulates food applications 
of GM crops and relies on existing laws that hold food 
manufacturers responsible for food safety. Of the three 


agencies, the USFDA has had the most influence on 
biotechnology policy because most biotechnology 
products on the American market are health care or 
food products [72]. In 1999, public meetings were 
held by the agency with the aim of sharing its experi¬ 
ence regarding GM foods and soliciting views on 
whether its policies and procedures should be 
modified. Public comments indicated considerable 
public support for a mandatory and more transparent 
process [4]. 

Commercialization of GM Crops: US Approval 
Process 

The APHIS oversees the confined and unconfined 
release of transgenic plants as well as any importation 
and interstate movement under the FPPA. In addition 
to the FPPA, the USDA issued rules in 1987 for the 
“introduction of organisms and products altered or 
produced through genetic engineering which are 
plant pests or which there is reason to believe are 
plant pests” [76]. By these rules, the introduction of 
a crop produced by rDNA techniques into the environ¬ 
ment is only legal with an authorization by the APHIS. 
The APHIS grants a release permit after preparing an 
environmental impact assessment and “Finding Of No 
Significant Impact” (“FONSI”). Exempt from these 
rules are experiments with plants produced by rDNA 
technology in a contained environment (e.g., labora¬ 
tory, green house). 

In 1997, the USDA simplified the procedure for the 
unconfined release of GM crops into the environment 
by allowing the applicant to petition the APHIS for 
a “determination of non-regulated status” [77]. When 
receiving a petition, the APHIS prepares an environ¬ 
mental impact assessment taking into account the fol¬ 
lowing eligibility criteria: 

• The crop must not be listed as noxious weed or 
weed. 

• The introduced genetic material must be stable and 
characterized. 

• The introduced genetic materials must not 

- Result in any plant disease 

- Confer an infectious entity or encode toxic 
substances to non-target organisms 

- Encode products intended for pharmaceutical 
use 
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• Any plant virus-derived sequences must not pose 

a significant risk for new plant virus creation. 

• The GM crop must be free of known human and 

animal pathogens or allergens. 

After a complete petition is filed, it is published in 
the Federal Register to solicit comments from the 
public. Thereafter, the APHIS reviews the data, taking 
into account public comments and takes a final deci¬ 
sion, which is again announced in the Federal Register. 
The issuance of a “non-regulated” status for a trans¬ 
genic crop means that it is deregulated and can be freely 
commercialized in the US (unconfined release, import, 
interstate movement), except if it contains a pesticidal 
substance. In that case, an additional “plant pesticide” 
approval by the USEPA is required. 

The responsibility of the USEPA is to evaluate the 
risks of GM crops “producing their own pesticide” for 
human consumption under the FIFRA. The evaluation 
process is held to the same standards as those for 
pesticides applied to plants. To be registered under 
the FIFRA, a pesticide must not cause “unreasonable 
adverse effects” on the environment and on human 
health [78]. Transgenic insect- and virus-resistant 
plants fall under the jurisdiction of the USEPA; how¬ 
ever, viral coat proteins are normally exempted from 
the requirements as the USEPA considers these proteins 
as “low-risk applications” based on the principle of 
familiarity and their ubiquitous presence in the food 
supply. Today, Bt toxins, one viral coat protein, and the 
potato leaf roll virus protein are registered as pesticides 
and supervised by the USEPA. The agency evaluates the 
risks of these “plant-incorporated protectants” by tak¬ 
ing into account the following criteria: toxicological 
effects, effects on non-target organisms, insect resis¬ 
tance management, and persistence of the substance 
in the environment. The evaluation process lasts 
approximately 1 year. If adverse effects of insect- or 
virus-resistant plants are observed after commerciali¬ 
zation, the USEPA has the legal power to amend 
existing registrations. Moreover, the USEPA may 
impose new measures such as new pest resistance 
schemes [79]. 

Besides the pesticide registration under FIFRA, 
Section 408 of the FFDCA requires the USEPA to deter¬ 
mine tolerance limits for substances used as pesticides 
on and in food and feed [78,80] . “Nucleic acids that are 


part of a plant-incorporated protectant” are exempted 
from this requirement because the USEPA considers 
them as “safe” [80]. Once approvals from the APHIS, 
and from the USEPA when pesticidal substances are 
used, have been granted, it is legal to commercialize 
the GM plant or derived product in the USA. However, 
applicants normally engage in a voluntary consultation 
process with the USFDA before marketing of the 
GM plant or derived products. This policy is now 
under review. 

International Obligations Relevant to the 
Biosafety Frameworks 

Countries do not have complete discretion when decid¬ 
ing how to set up their biosafety regulatory system as 
there are several international treaties and agreements 
that relate to biosafety. If a country is bound by any or 
all of those international agreements, then their bio¬ 
safety regulatory system must be compliant with those 
obligations [81]. 

Cartagena Protocol on Biosafety 

The conclusion of the CPB was broadly recognized as 
a step forward in providing an international regulatory 
framework to reconcile trade with environmental pro¬ 
tection by creating an enabling environment for the 
environmentally sound application of biotechnology 

[82] . The Convention on Biological Diversity (CBD) 

[83] in Article 19 [3] states that: 

► The Parties shall consider the need for and modalities 
of a protocol setting out appropriate procedures, 
including, in particular, advance informed agreement, 
in the field of the safe transfer, handling and use of and 
living modified organism resulting from biotechnology 
that may have adverse effect on the conservation and 
sustainable use of biological diversity. 

In 1995, the second Conference of the Parties 
(COP) to the CBD began the consideration of the 
need for and modalities of such a protocol. COP Deci¬ 
sion II/5 commenced the negotiations for the protocol 
by launching an open-ended ad hoc Working Group on 
Biosafety [1]. Meeting six times between 1996 and 
1999, the Working Group concluded its work with the 
submission of a draft protocol for consideration by the 
first extraordinary meeting of the COP, convened with 
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the purpose of adopting a protocol on biosafety to the 
CBD. The result of this first extraordinary meeting, 
which took place in two separate meetings in 1999 
and 2000, was the adoption of the Protocol [84]. In 
accordance with its Article 36, the Protocol was opened 
for signature by States and regional economic integra¬ 
tion organizations from 15 to 26 May 2000 and 
remained open for signature from 5 June 2000 to 
4 June 2001. By that date, the Protocol had received 
103 signatures. The Protocol entered into force on 
11 September 2003, 90 days after receipt of the 50th 
instrument of ratification [85]. There are presently 
160 parties to the Protocol [86], with the COP to the 
CBD serving as the meeting of the Parties to the Pro¬ 
tocol (COP-MOP), the Protocol’s governing body. 
Since the coming into force of the Protocol, the COP- 
MOP has met five times. The fifth meeting of the COP- 
MOP took place from 11 to 15 October 2010 in 
Nagoya, Japan, and it approved a Supplementary Pro¬ 
tocol on Liability and Redress to the Cartagena Proto¬ 
col on Biosafety [87]. 

The Protocol’s scope is the “transboundary move¬ 
ment, transit, handling, and use of all living modified 
organisms that may have adverse effects on the conser¬ 
vation and sustainable use of biological diversity, 
taking also into account risks to human health” [82]. 
To ensure the safe transfer, handling, and use of GMOs, 
the Protocol sets up two separate procedures. The first 
time that a GMO is to be intentionally introduced into 
the environment, the Protocol sets up an Advance 
Informed Agreement (AIA) procedure. That procedure 
requires that the exporter of the GMO provides a notice 
with detailed information about the GMO to the 
importing country. The importing country then 
reviews the information, conducts a risk assessment, 
and decides, based on the risk assessment results, 
whether to approve or reject the GMO. The second 
procedure set up by the Protocol is for GMOs to be 
used for food or feed or for processing (such as corn, 
soybeans, wheat, or other grains that will be fed directly 
to humans or animals or used for processing). For 
those GMOs, the AIA procedure is not required. 
Instead, the Protocol establishes a simpler system that 
reflects the decreased likelihood that those GMOs will 
affect the importing country’s biodiversity. Before the 
GMO can be exported to another country, the only 
requirement is that the safety decision in the exporting 


country is communicated to other countries through 
the Biosafety Clearing-House. For LMOs used for other 
purposes, such as LMOs used in the laboratory, the 
Protocol leaves any regulation to the discretion of the 
individual country. The Protocol also does not cover 
products derived from LMOs, such as processed foods 
that have ingredients that came from LMOs. Although 
the Protocol comprehensively covers many issues, there 
are a few remaining to be addressed by the individual 
Party when establishing their biosafety regulatory 
regime [81]. 

In addition to the CPB, other relevant international 
agreements exist. Under international law, countries 
shall comply with all treaties to which they are parties, 
provided that the provisions of these treaties are not 
contradictory (principle of accumulation of interna¬ 
tional obligations). According to the principle of 
integration contained in the Rio Declaration on Envi¬ 
ronment and Development (Principle 4) and the 2002 
Plan of Implementation of the World Summit on Sus¬ 
tainable Development (Paragraph 92), environmental 
treaties and trade goals shall mutually support each 
other. In fact, the various agreements and instruments 
on the topic of the development and use of agro¬ 
biotechnology are all closely interrelated and need to 
be carefully considered in the decision-making process, 
both individually and collectively. This regulatory and 
public policy background consists of relevant interna¬ 
tional instruments and processes, such as: the entry 
into force of the CPB, the Codex Alimentarius Guide¬ 
lines on Food Safety and Labelling of GMOs, and the 
World Trade Organization (WTO) Agreements on 
Sanitary and Phytosanitary Measures and Technical 
Barriers to Trade. Table 1 presents a summary of the 
main international instruments and processes of 
relevance for biosafety. 

A Comparative Analysis of Regulatory Processes 
in All Study Countries 

Attributes of a Modern Regulatory Biosafety System 

Although the development of regulatory regimes is 
essentially based on national specific practices, legal 
systems, public management strategies, and local 
socioeconomic considerations, national regulatory 
biosafety frameworks will generally always consider 
the following elements [110-112]: 
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Commercialisation of GM Crops: Comparison of Regulatory Frameworks. Table 1 Relevant international instruments 
and processes (Taken from [88]) 


Agreement, declaration, 
or process 

Content 

Rio Declaration on 
Environment and 
Development 

The precautionary principle is a fundamental instrument for the safe use of biotechnology. 
This principle is contained in Principle 15, which sets forth that: " In order to protect the 
environment, the precautionary approach shall be widely applied by States according to their 
capabilities. Where there are threats of serious or irreversible damage, lack of full scientific 
certainty shall not be used as a reason for postponing cost-effective measures to prevent 
environmental degradation". 

United Nations Environment 
Program - International 
Technical Guidelines for 
Safety in Biotechnology 

Enacted in Cairo in 1995, these guidelines may assist governments, intergovernmental 
organizations, private organizations, and others to strengthen capacities and exchange 
biosafety information. 

The guidelines are based on the following principles: (a) identification of hazards; (b) risk 
assessment, taking into account the probability of any hazards arising and the potential 
consequences of such hazards; and (c) risk management, applying adequate management 
strategies which include developing procedures and methods to minimize risks and their 
consequences, or making decisions not to proceed. Such management strategies shall be 
proportional to the risk assessment results. 

World Trade Organization 
(WTO) Agreement 

The WTO Agreement recognizes the goal of sustainable development, considering that 
free trade shall protect and preserve the environment [89]. There are three WTO 
Agreements which can be associated with GMOs: The Agreement on the Application of 
Sanitary and Phytosanitary Measures (SPS; [90]), the Agreement on Technical Barriers to 
Trade (TBT; [91]), and the Agreement on Trade-Related Aspects of Intellectual Property 
Rights (TRIPS; [92]). It is also important to understand the regulations and procedures that 
govern Dispute Settlement Understanding (DSU) in the WTO context [93]. 

Agreement on Technical 
Barriers to Trade (TBT) 

The TBT Agreement is relevant to biotechnological products as it applies to technical 
regulations and rules, including requirements for packaging and labeling [91]. The TBT 
Agreement recognizes that no country shall be prevented from taking necessary measures 
to ensure the quality of its exports, the prevention of deceptive practices, and the 
protection of the environment and human, animal, and plant life or health. However, such 
measures shall not be taken as means of arbitrary or unjustifiable discrimination between 
countries, where the same conditions prevail, or as disguised restrictions on international 
trade. Furthermore, such measures must otherwise comply with the provisions of the 
Agreement [91]. 

The Agreement on the 
Application of Sanitary and 
Phytosanitary Measures 
(SPS) 

SPS rules are summarized in the following manner [90]: 

1. Members have the right to take necessary sanitary and phytosanitary measures to 
protect human, animal, and plant life or health, provided that such measures are not 
inconsistent with the provisions of the Agreement. 

2. Any sanitary or phytosanitary measures are applied only to the extent that is necessary 
to protect human, animal, and plant life or health, based on scientific principles. 
Furthermore, such measures cannot be maintained in the absence of sufficient scientific 
evidence of their necessity. 

3. Members shall ensure that their sanitary and phytosanitary measures do not arbitrarily or 
unjustifiably discriminate between members among whom identical or similar conditions 
prevail. Moreover, sanitary and phytosanitary measures shall not be applied in a manner 
that would constitute a disguised restriction on international trade. 
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Commercialisation of GM Crops: Comparison of Regulatory Frameworks. Table 1 (Continued) 


Agreement, declaration, 
or process 

Content 


4. Sanitary or phytosanitary measures, which are taken in accordance with the relevant 
provisions of the Agreement, are presumed to comply with the members' obligations 
under GATT 1994, which relate to the use of sanitary or phytosanitary measures, in 
particular, the provisions of Article XX(b) [94]. 

5. In principle, members shall base their sanitary or phytosanitary measures on 
international standards, guidelines, or recommendations, where these exist. 

6. Sanitary or phytosanitary measures which comply with international standards, 
guidelines, or recommendations are deemed necessary to protect human, animal, and 
plant life or health and are presumed to be consistent with the relevant provisions of GATT 
1994 [94] and the SPS Agreement [90]. 

7. Members may introduce or maintain sanitary or phytosanitary measures which result in 
a higher level of sanitary or phytosanitary protection than the level that is required by the 
relevant international standards, guidelines, or recommendations, provided that there is 
a scientific justification to this heightened level of protection or that this level of protection 
is determined by a member country to be appropriate according to Article 5 of the SPS 
Agreement regarding risk assessment and adequate protection levels [90]. However, 
measures which result in a heightened level of sanitary or phytosanitary protection need 
not go against any other provisions in the Agreement. 

The above-mentioned standards, guidelines, and recommendations are defined as those 
established by international organizations, such as the Codex Alimentarius Commission, 
the Office International des Epizooties, and the Secretariat of the International Plant 
Protection Convention. As for other subjects that are not within the scope of the 
aforementioned organizations, they shall be regulated by "other international 
organizations" identified by the WTO Committee on Sanitary and Phytosanitary Measures 
as being open to the Parties' membership. 

The Codex AHmentarius 

Created by the World Health Organization (WHO) and the Food and Agriculture 
Organization (FAO) of the United Nations, the Codex Alimentarius (http://www. 
codexalimentarius.net) is the organization which is in charge of establishing regulations 
related to food safety. Codex Alimentarius standards need to ensure fair trade practices in 
food trade, according to WTO rules. 

The Codex Alimentarius Commission established an Ad Hoc Intergovernmental Task Force 
on Food Derived from Biotechnology [95] to handle issues associated with food obtained 
through biotechnological processes, particularly food used for health and nutrition 
purposes. In July 2003, the Codex Alimentarius Commission held a meeting during which 
three risk assessment standards for food resulting from biotechnology were approved [96]. 
These standards establish risk assessment principles for food derived from modern 
biotechnology. The principles refer to the concept of "tracing," which is a risk assessment 
tool whose meaning is the subject of an important debate. In fact, the United States of 
America consider this concept to be different from traceability and limit themselves to 
following only the previous and subsequent link in the LMO movement chain. The 
approved standards are the following: 

- Principles for the Risk Analysis of Foods Derived from Modern Biotechnology [97] 

- Guideline for the Conduct of Food Safety Assessment of Foods Derived from - 
Recombinant-DNA Plants [98] 

- Guideline for the Conduct of Food Safety Assessment of Foods Produced Using 
Recombinant-DNA Micro-organisms [99] 
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Commercialisation of GM Crops: Comparison of Regulatory Frameworks. Table 1 (Continued) 


Agreement, declaration, 
or process 

Content 


In 2008, the Ad Hoc Intergovernmental Task Force on Food Derived from Biotechnology 
completed a Guideline for the Conduct of Food Safety Assessment of Foods Derived from 
Recombinant-DNA Animals [100]. At its 31st session, the Commission noted that the Task 
Force had completed its work, one year ahead of schedule, and agreed to its dissolution 
[101]. 

Likewise, the Codex Committee on Food Labelling is working on draft guidelines for the 
Labelling of Food and Food Ingredients Obtained through Certain Techniques of Genetic 
Modification/Genetic Engineering, as well as on a Draft Amendment to the General 
Standard for the Labelling of Pre-packaged Foods (in order to address the issue of 
genetically modified foods) [102]. Finally, in 2007, the Codex Alimentarius Commission 
adopted the "Working Principles for Risk Analysis for Food Safety for Application by 
Governments" that were proposed by the Codex Committee on General Principles [103]. 
These principles allude to the precautionary approach. For more information on Codex 
activities, see www.codexalimentarius.net. 

International Plant 

Protection Convention 
(IPPC) 

The IPPC (https://www.ippc.int/) aims to prevent the international spread and introduction 
of pests of plants and plant products and to promote appropriate measures for their 
control. The Convention was reviewed in 1997 and entered into force in 2005 [104]. 

A Commission Working Group studied plant pest risks associated to LMO issues and an 
international standard (ISPM No.11 pest risk analysis for quarantine pests, including 
analysis of environmental risk and LMOs) [105]. This text intends to protect plants and 
ecosystems from LMO-related risks. According to the text, such protection measures 
should be cost effective, nondiscriminatory, and feasible and should not limit basic trade 
needs. 

World Organization for 
Animal Health (OIE) 

The World Organization for Animal Health, also known as the Office International des 
Epizooties (OIE; http://www.oie.int/), is an intergovernmental organization created by the 
International Agreement signed on 25 January 1924. OlE's mission is to ensure the 
transparency of animal health conditions worldwide. Member countries promise to declare 
animal diseases detected on their territory. In addition, the OIE is in charge of safeguarding 
international trade through the elaboration of health regulations destined to be applied to 
international transboundary movements of animals and animal products. The WTO 
recognizes such regulations as international reference rules. 

Regional International 
Organization for Plant 
Protection and Animal 

Health (OIRSA) 

In 2000, the Regional International Organization for Plant Protection and Animal Health 
(OIRSA; http://www.oirsa.org) produced a Regional Guideline on the Safety of Plant 
Biotechnology [1 06]. In fact, the Guideline is based on the CPB and essentially regulates the 
same issues as the Protocol. However, in view of its nature, the guideline also aims to 
harmonize the various regional laws and practices pertaining to the issue of biosafety. 

Central American 
Commission for 

Environment and 
Development (CCAD) 

The Central American Commission for Environment and Development (CCAD), which is the 
environmental authority of the Central American Integration System (SICA), through its 
biodiversity program and territorial legal system, prepared a Central American Protocol on 
the Safety of Modern Biotechnology. A series of technical consultations was undertaken 
and led to the adoption of a draft Protocol, which was then approved by the Central 
American Ministries of Environment in 2002. Although the Central American Protocol is 
based on the CPB, some of its provisions go beyond the scope of the CPB. In fact, 
provisions regarding labeling, documentation, liability, contained use, and transit, as well 
as various biosafety principles, have been included in the Central American Protocol in an 
attempt to avoid the inconsistencies and ambiguities which resulted from the multilateral 
negotiations that lead up to the adoption of the CPB. 
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Commercialisation of GM Crops: Comparison of Regulatory Frameworks. Table 1 (Continued) 


Agreement, declaration, 
or process 

Content 

The Inter-American Institute 
for Cooperation in 
Agriculture (IICA) 

As requested by the Ministries of Agriculture in 2002, IICA, in collaboration with the OIRSA 
and the Tropical Agricultural Research and Higher Education Centre (CATIE), was put in 
charge of preparing a regional biosafety regulatory framework, in response to the lack of 
regulations in certain countries that are receiving, or may potentially receive, requests for 
field trials or trade in LMOs. A group comprising of various agencies was formed, along 
with a consultative process, which led to the preparation of a Regulatory Framework Draft 
for Living Modified Organisms for Agricultural and Livestock Use in Central American 
Countries. This initiative intends to promote a model regulation on agricultural 
biotechnology that is to be adapted by each country, according to its particular needs and 
situation. There is hope that this regulation will be enacted in a similar fashion by the 
various nations, thereby triggering a process of harmonization of regulatory frameworks in 
the region. 

Other relevant agreements 
and declarations 

- CGRFA draft Code of Conduct on Biotechnology [107] 

- United Nations Model Regulations on the Transport of Dangerous Goods (UN 
Recommendations) [108] 

- The Aarhus Convention on Access to Information, Public Participation in Decision¬ 
making and Access to Justice in Environmental Matters, which contains specific guidelines 
on public participation in LMO-related decision-making processes [109]. The Convention 
entered into force in 2001. Although it was enacted by the United Nations Economic 
Commission for Europe, the convention is open to other nations. The Convention was 
amended in 2005 in order to set out more precise provisions on the deliberate release of 
genetically modified organisms, but the amendment has not yet entered into force. The 
amendment will enter into force once it has been ratified by at least three-quarters of the 
Parties. In September 2007, the amendment had only been ratified by four countries. The 
third meeting of the Parties was held in Riga, Latvia, on 11-13 June 2008. The meeting 
adopted the Riga Declaration and a strategic plan for the Convention, resolved the issue of 
how to calculate ratification of amendments, and renewed the mandates of taskforces 
dealing with access to justice, electronic information tools, and public participation in 
international forums. 


• Biosafety policy, occasionally as part of a broader 
biotechnology or biodiversity policy 

• Regulatory policy 

• A mechanism to handle requests and permits 
related to LMO use, in particular, notifications 
required by the CPB (i.e., an administrative system) 

• A mechanism for monitoring and inspections 

• A system that allows public participation and 
information 

According to Jaffe [113]: 

► The purposes of a national biosafety regulatory system 
are to scientifically assess the safety of genetically 
engineered (GE) organisms to humans and the envi¬ 
ronment, manage any potential risks, and authorize the 


development and marketing of safe GE organisms and 
their products. To develop such a regulatory system, 
a government can use existing laws or develop new 
laws. Any national biosafety regulatory system that is 
proposed, however, must be functional, protective, 
and also comply with international trade standards 
that are evolving in recognition of the growing impor¬ 
tance of GE organisms in world affairs. 

A national biosafety regulatory system is a regula¬ 
tory regime responsible for assessing and managing the 
full range of potential risks that might be posed by 
a GMO and its products. It addresses potential risks 
to the environment and biological diversity as well as 
any food/feed risks or other safety-related issues 
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involving GMOs (e.g., worker health, drug safety, etc.). 
A protective biosafety regulatory system ensures that 
any risks from GMOs are managed and allows safe 
GMOs to be developed, marketed, and utilized for 
their intended purpose. Such a system, however, must 
also be functional, which means that it should be 
understandable, workable, equitable, fair, adaptive, 
and enforceable [114]. 

Existing biosafety regulatory systems from around 
the world reflect, among other things, the type of 
government in the country, the politics of the country, 
the country’s view on the relative safety of GMOs, and 
the country’s regulation of food, agriculture, and envi¬ 
ronmental issues. Establishing those systems required 
balancing numerous goals and trading off different 
interests. Through an analysis and comparison of dif¬ 
ferent existing biosafety regulatory systems, however, 
one can identify key characteristics and components 
that are generally important to a functional and pro¬ 
tective biosafety regulatory system [113]. Incorporat¬ 
ing each of those characteristics and components in 
a functional and protective biosafety regulatory system 
involves problem solving because there can be tensions 
between the different characteristics. 

Jaffe [113] indicates the following attributes of 
a system: 

• Comprehensive. 

• Adequate legal authority to subject each GMO to 
a food-safety and environmental risk assessment 
approval. 

• A clear safety standard. 

• Proportionate risk-based reviews. 

• Transparent and understandable. 

• Participatory. 

• Post-approval oversight. A biosafety regulatory 
system does not stop its oversight once a GMO 
has been approved for a confined field trial or for 
a commercial release. 

• Flexible and adaptable. 

• Efficient, workable, and fair. 

Decision-makers are facing an important challenge 
because biotechnology and biosafety fields are ever 
evolving at a dramatic pace. In this context, overly 
precise and detailed regulatory frameworks can easily 
become obsolete in a short period of time. In order to 


avoid forever having to enact new legislation and pol¬ 
icies, it is preferable to develop a regulatory framework 
that takes on the form of a general guide, thus regulat¬ 
ing more specific biosafety aspects by way of by-laws 
and other regulations. In fact, this alternative allows 
more efficient regulation of future situations but has 
the effect of conferring the power of regulating specific 
aspects of biotechnology to the executive branch of 
government and other administrative institutions, 
instead of to the Parliament [115]. 

General Comments on Evolution of Biosafety 
Regulatory Systems 

The nature of GM crop regulations around the world 
has as much to do with social and political values as 
with concerns about health and safety. Consumers’ 
growing awareness of their rights and farmers’ increas¬ 
ing fear of dependence on multinational companies are 
symptoms of a deeper concern about values and prior¬ 
ities, the type of environment that people want, the role 
of biodiversity, their tolerance to risk, and the price that 
people are prepared to pay for regulation. Therefore, 
the regulations in the study countries were formed, or 
amended, during this period of heightened public 
awareness and are a reflection of how the various gov¬ 
ernments have attempted, or not, to address these 
concerns. A comprehensive discussion of regulatory 
requirements for GM crops at the national and inter¬ 
national levels is a broader topic than can be covered 
here, and previous studies have addressed them in 
detail [5, 116]. 

In general terms of regulatory systems, several 
approaches have been considered worldwide with 
respect to the safe use of modern biotechnology: 
(a) certain countries decided to apply preexistent 
plant or animal health regulations to GMO-related 
issues or to incorporate biosafety provisions into 
plant and animal health protection laws; (b) other 
countries decided to adopt specific biosafety regula¬ 
tions that need to be enacted by the Parliament, the 
executive power (through agreements or resolutions), 
or public sector institutions; (c) certain countries 
decided to mention biosafety in their environmental 
regulations; (d) some countries decided to apply seeds, 
pest control, and plant or animal regulations to 
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modern biotechnology, enacting but a few coordina¬ 
tion rules and specific provisions associated to 
biosafety. 

The original trend of agricultural biosafety regula¬ 
tions focusing on transgenic plants continues to exist in 
some countries but is in the process of being replaced 
by a broader focus that comprises biosafety regulation 
of animals, microorganisms, fish, and forest species. 
This progressive expansion of biosafety toward new 
areas results from the need to regulate research and 
eventual trade of other types of organisms created 
through genetic engineering. The CPB, whose scope 
extends beyond the issue of plants, and the subsequently 
developed regulatory frameworks implementing its 
provisions, brought on a gradual increase in the 
number and variety of organisms and activities being 
regulated. In addition, health authorities have 
increased participation in biosafety advisory commis¬ 
sions and committees. 

In view of the complexity of biosafety issues, par¬ 
ticularly issues of risk assessment and risk manage¬ 
ment, and several other issues associated to the 
introduction of GMOs into the environment, almost 
all of the countries have established a committee or 
a national commission to assist decision-makers in 
the development of biosafety regulations. The compo¬ 
sition of these committees and commissions varies 
considerably in the region, mainly with respect to the 
inclusion of the productive sector, consumers, and 
non-governmental organizations. However, the trend 
points toward the inclusion of these organizations, 
even though such an inclusion can potentially lead to 
conflicts of interest. Currently, these committees or 
commissions are for the most part directly connected 
to the regulatory authorities in place. Nevertheless, 
independent national commissions (commissions that 
are not connected to the regulatory authorities) were 
created in some countries and were assigned political 
and coordinative functions, leaving the issue of 
technical recommendations to be dealt with by other 
sector-based authorities established by each country’s 
competent Ministry or Secretariat. 

The entry into force of the CPB had the effect of 
introducing the Ministries of Environment to the bio¬ 
safety debate. For the reasons mentioned in the previ¬ 
ous paragraph, the Ministries of Agriculture have 


traditionally been the authorities involved in the deci¬ 
sion-making process regarding GMOs. Gradually, 
environmental authorities started demanding and 
assuming an active role in the regulation of GMOs in 
view of their mandate to protect the environment. This 
trend has been strengthened by the incorporation of 
biosafety provisions into environmental laws and 
regulations. In some cases, GMOs are even subject to 
environmental impact assessments (at least based on 
a literal interpretation of the pertinent legal provisions). 

Comparison of Approval Systems 

In Asia, the only major GM crops approved for com¬ 
mercial release are Bt cotton, which is grown commer¬ 
cially in China, India, and Indonesia, and GM corn 
recently approved in the Philippines. To date, no 
Asian government has given official permission to 
plant GM soybeans or rice. 

In the context of GM crops, the concept of “bio¬ 
safety” is, in principle, a broad one, covering three 
areas: the health safety of humans and livestock, the 
safety of the environment (i.e., ecology and biodiver¬ 
sity), and socioeconomic safety (i.e., the economic and 
social impact on farmers, consumers, and different 
social classes, as well as on trade and economy in 
general) [44]. While the biosafety regulations in force 
in industrialized countries (e.g., Canada, the European 
Union, and the USA) address only the health and 
environmental risks and exclude socioeconomic con¬ 
siderations, the regulations in developing countries 
(e.g., India, Argentina, SA, and the Philippines) tend 
to include all three areas. 

Countries have responded differently to the oppor¬ 
tunities presented by GM crops and the potential risks 
associated with them (Table 2). The composition of the 
“trade off” between potential benefits and risks in each 
case depends upon whether a government adopts 
a permissive, precautionary, or prohibitive policy 
approach to GM crops. Three basic conditions may 
thus trigger application of protective measures: uncer¬ 
tainty, risk, and lack of proof of direct causal link [5]. 
As major agricultural exporters, Argentina, Canada, 
and the USA have each adopted a permissive attitude 
very early on, widely authorizing most GM products 
for production and consumption, thereby benefiting 
from lower production costs and greater export profits. 
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Commercialisation of GM Crops: Comparison of Regulatory Frameworks. Table 2 Main characteristics of 
commercialization approval systems in study countries 



Country/region 







Element of approval system 

Argentina 

Canada 

China 

Europe 

India 

Philippines 

S. Africa 

USA 

Biosafety/GMO-specific law/ 
act 

X 

X 

X 

/ 

X 

X 


X 

Regulatory trigger: process (A) 
or product (B) 

B 

B a 

B 

A 

A 

A 

A 

B 

Responsible ministry or 
government department: 
agriculture (A), environment 
(E), or health (H) 

A 

A,H 

A 

A or E b 

E 

A 


A, E c 

ERA committee composition: 
academia (A), commercial (C), 
government (G), or public (P) 
representatives 

A,C, G,P 

G 

A,G 

A 

G 

A,G 

A,G 

G 

Obligatory domestic field 
testing 

/ 

x d 

/ 

x d 

/ 

/ e 



Obligatory prior approval in 
export country 


X 

/ 

X 


/ 

/ 


Compulsory compliance with 
food-safety requirements 

/ 

/ 

/ 

/ 

/ 

/ 

/ 

x f 

Socioeconomic impacts 
considered 

/ 

X 

X 9 

X 

/ 


/ 

X 

Compulsory variety 
registration 

/ 

/ 

/ 


/ 




Mandatory post-market 
monitoring 11 


X 


/ 

/ 



X 

Time- or spatially restricted 
authorization 


/ 


/ 

/ 



X 

Public consultation (days) 




30 

30' 

60 

30 

/ 


a ln contrast to all other countries, Canada relies on the concept of novelty to trigger regulatory oversight and has declared that all plants 
derived through genetic modification are considered novel 

b The European Commission sends its draft approval to the Council of Ministers (agricultural or environmental ministers), which has three 
months to reject or adopt it. If they do not act within this time, the Commission may adopt its own decision and authorize the new GM 
product 

c For those GM plants producing their own pesticide, the evaluation is coordinated between APHIS and the USEPA 

d Where data from field studies on other continents are supplied, the applicant should submit a reasoned argument that the data is 

applicable to domestic conditions 

e For local applications only, not for applications for import 

f Developers of GM crops engage in a voluntary, but recommended, consultation process with the USFDA (voluntary pre-market review). 
This process is currently under review 

9 Taking socioeconomic considerations into account during the risk assessment process is not legally required in China [30] 

V - Mandatory requirement for approved post-marketing monitoring plans and reporting. X - No specific approval requirement, but the 
developer is expected to monitor for existing and emerging risks that may be associated with its product and notify the regulatory 
authorities whenever new information is uncovered 

'Thirty-day time period is provided for public consultation after the formal approval 
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Regulators in India, Europe, and the Philippines, on the 
other hand, have taken a more cautious approach based 
on guaranteeing a very low level of risk to human 
health and the environment. They have therefore 
imposed strict control measures on approval and mar¬ 
keting of GMOs and GM products [5]. While China 
had initially moved quickly on the approval of GM 
crops for environmental and commercial releases, the 
approval process has slowed considerably since 2000, 
and strict regulations have been implemented for GMO 
imports [15]. 

Further differences are obvious. Process-based reg¬ 
ulation is the rule in almost all countries that have 
developed national biosafety regulatory systems. Even 
in countries employing a product-focused RA process, 
the scope of regulatory oversight is defined by the 
process of genetic modification. Canada is the only 
country in which regulatory oversight is triggered 
solely by the novelty of the trait(s) expressed by plants, 
irrespective of the means by which the novel traits were 
introduced - an approach that is most consistent with 
the scientific principle that the risks associated with 
GM crops are not inherently different than those 
posed by more conventional crops [117, 118]. Indeed, 
the US National Research Council has explicitly 
recommended using objective compositional changes, 
not breeding method, as the basis for regulatory scru¬ 
tiny and even then, only “when warranted” [119]. 
India, Argentina, Canada, most EU countries, and 
South Africa have all used non-statutory guidelines to 
manage the environmental impact of GM crops before 
promulgating new acts or regulations. There is no 
evidence that this approach has ever compromised 
environmental safety. India is the exception among 
the study countries in locating its biosafety decision¬ 
making authority solely within the ministry responsible 
for the environment, while the Ministry of Agriculture 
predominates among the remaining study countries. 
As a rule, environment ministries have a more 
precautions or preventative approach to introducing 
new technologies, as compared to those with the 
responsibility for agriculture. Different structural 
approaches are used to secure the necessary scientific 
advice for the decision-making process. The EU has 
implemented a system of expert advisory committees, 
while others, such as India, Canada, and the USA, rely 


primarily on scientists and professionals working 
within government departments and agencies. Other 
countries, e.g., Argentina, China, and South Africa, 
have a combination of both. Only India and the EU 
mandate post-market monitoring in attempts to gauge 
the impacts of the introduction of GM crops over 
the long-term as well as larger spatial scales. Other 
countries may address this indirectly by authorizing 
time-limited or geographically limited introductions 
(e.g., Canada), whereas the USA does neither. 

Conclusion and Future Directions 

Governments have an important role in ensuring that 
novel foods are safe for human consumption and that 
novel agricultural inputs do not cause major negative 
impacts on the environment and long-term agricul¬ 
tural production. The adoption of biosafety regulatory 
frameworks is a challenging task since decision¬ 
makers are faced with numerous difficulties, such as 
ever-evolving technology, which can quickly render 
specific regulations obsolete. Countries and policy¬ 
makers are responding to these various problems with 
different legislative and policy-based strategies. As seen 
above, most countries, with the notable exception of 
the USA, consider GM crops to be novel foods, regard¬ 
less of the characteristics of their final product. Hence, 
new laws and institutions to regulate potential bio¬ 
safety and food-safety issues have and continue to be 
established, requiring that GM products be approved 
before they may be grown in, consumed in, or 
imported into a country. Concurrently, public opinion 
in many parts of the world still regards the use of GM 
crops as controversial. Concerns about new risks have 
led biosafety, food-safety, and labeling regulations to 
become complex and costly, with no tiered mecha¬ 
nisms in place to regulate the various and different 
GMOs based upon the level of risk presented and the 
amount of regulatory experience gained with similar 
products. As a result, regulation has evolved which 
implicitly assumes that all GMOs have the ability to 
present the same (high) risk unless proven otherwise, 
requiring the over-production of data of questionable 
value to decision-making. This regulatory position has 
become a real threat to the future development of GM 
crops in the non-corporate and public sectors, 
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especially for those subsistence crops involving toler¬ 
ance to abiotic stress and higher nutrient contents 
being specifically developed for the benefit of farmers 
and consumers in the developing world. Should 
regulatory safety standards be set to an impossibly 
high threshold (e.g., at zero risk), these GM crops are 
unlikely to be approved in those countries who stand to 
gain most from their potential benefits. 

The requirements of setting up effective and effi¬ 
cient regulations and legislative systems pertaining to 
products of rDNA technology inevitably involve addi¬ 
tional costs, e.g., the development and maintenance of 
institutions, procedures, and management tools, costs 
which many developing countries cannot afford. Even 
should developing countries decide to form their reg¬ 
ulatory frameworks by adapting regulatory guidance 
already implemented elsewhere, cost sharing still 
carries a financial burden. Once a system is in place, 
other relevant costs include the cost of compliance with 
biosafety regulations and risk management conditions, 
as well as the economic, environmental, and health 
costs related to delayed access to new technologies 
and products and their associated benefits. The variety 
and disparity of potential frameworks call for a nor¬ 
malization of information requirements and, wherever 
synergies and cooperation mechanisms are promoted, 
a harmonization of the scientific and technical aspects 
of regulatory oversight at the national and subregional 
level. As such, an expanded use of internationally 
accepted consensus can promote the acceptance of 
regional approaches to regulation. Examples include 
OECD documents on scientific aspects of risk assess¬ 
ment, as well as guidelines issued by international stan¬ 
dard-setting bodies such as the International Plant 
Protection Convention and the Codex Alimentarius. 
Programs that create networks on regional and subre¬ 
gional levels may also facilitate the acquisition and dis¬ 
semination of biosafety expertise at reduced costs. 
Providing useful, relevant information is a worthwhile 
task, but it can also prove expensive to provide training 
on how best to utilize such information in decision¬ 
making. It is therefore imperative that external funding 
streams continue to be directed toward supporting the 
establishment of effective regulatory systems in develop¬ 
ing countries that respond to national needs and poli¬ 
cies, until such time that they become self-sustainable. 


Disclaimer 

Opinions and views expressed in this chapter are 
strictly those of the authors and may not necessarily 
represent those of the organizations where the authors 
are currently employed. 
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Glossary 

Community-owned water resources Water sources 
that can be easily found within the boundaries of 
a community such as rainwater or reclaimed water. 
Non-potable water Water use for all applications 
except a direct potable application such as drinking. 
Potable water Water that is suitable for any kind 
human consumption. Simply, the term can be 
defined as drinking water. 

Public perception As applied to water aggregate of 
individual attitudes or beliefs held by general public 
on water or wastewater. 

Rainwater harvesting Collection, storage, and use of 
rainwater for various water applications. 
Reclaimed water Already used water that has been 
recovered by various treatment processes to meet 
specific water quality criteria. 

Reuse As applied to water, the use of treated water for 
a beneficial use including domestic, urban, indus¬ 
trial, or agricultural application. 
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Runoff water Part of the precipitation that appears 
in surface and does not soak into the ground 
where it falls. 

Sustainable water use Optimizing the benefits of 
present water use without diminishing the ability 
of future generation to meet their water needs. 

Definition of the Subject and Its Importance 

Observational records and climate projections provide 
abundant evidence that climate change affects the 
hydrological systems such as changing precipitation 
patterns, intensity and extremes, increasing evapora¬ 
tion, changes in soil moisture and runoff, etc. Thus, 
global freshwater resources are being under growing 
pressure owing to climate change effect on hydrological 
cycle. Population growth, rapid economic develop¬ 
ment, and land use change along with climate change 
bring water issues to the forefront. The insufficient 
water availability may cause the degradation in 
human health, ecosystems, agricultural and industrial 
output, and increase the potential of water-related con¬ 
flict. Adaptation measures have to be pursued to deal 
with the water scarcity problem and to manage water 
resources sustainably. It is better to explore more stable, 
economical, and environmental friendly ways to meet 
water demand through the proper evaluation. Rainwa¬ 
ter and reclaimed water have been receiving growing 
interests as complementary water sources for urban 
communities. 

Water quality (WQ) is the most important issue in 
such complementary waters that determines the 
acceptable and safety use for a given application. How¬ 
ever, general public often does not understand the 
quality difference among different types of possible 
water resources and hence many communities do not 
have a positive image on alternative water sources such 
as reclaimed water and rainwater. Complexity of 
understanding the WQ causes people to hesitate use 
of reclaimed water or rainwater. Therefore, it is impor¬ 
tant to provide WQ information from trusted scientific 
and regulatory sources in a simple manner so that 
general public can easily understand the quality char¬ 
acteristics of water resources for different water use. 
The proposed WQ information platform integrates the 
multivariate nature of WQ data by providing scoring 
system that will logically combine all WQ parameters 


and provide a simple and easily understood description 
of community-owned water resources. 

Introduction 

Impact of Climate Change on Freshwater Resources 

The United Nation Framework Convention on Climate 
Change (UNFCCC), in its Article 1, defines “climate 
change” as a change of climate which is attributed 
directly or indirectly to human activity that alters the 
composition of the global atmosphere and which is in 
addition to natural climate variability observed over 
comparable time periods [1]. The scientific panel of 
International Panel on Climate Change (IPCC) 
revealed that the global average surface temperature 
has increased since 1861 and over the twentieth cen¬ 
tury, the increase has been 0.6°C zb 0.2°C [2]. Knowing 
the future global threat, at the United Nations Climate 
Change Conference in Copenhagen in the end of year 
2009, world leaders agreed to cap the global tempera¬ 
ture rise by committing to significant emission reduc¬ 
tions, and to raise finance to kick-start action in the 
developing world to deal with climate change [3]. 
Therefore, it is now well evident that climate change 
is already happening and going to be one of the greatest 
environmental, social, and economic threats facing the 
whole world. 

While many factors continue to influence climate, 
scientists have shown that human activities are the 
most dominant force, and responsible for most of the 
climate change observed over the past 50 years [4]. 
Human activities contribute to climate change by caus¬ 
ing changes in Earth’s atmosphere in the amounts 
of greenhouse gases, aerosols, and cloudiness [5]. 
Increases in the concentrations of greenhouse gases 
and aerosols will affect climate by altering incoming 
solar radiation and outgoing infrared radiation that are 
parts of Earth’s energy balance [6]. The overall effect 
has caused to warm the lower atmosphere and earth 
surface and is likely to continue to increase the mean 
temperature about 1°C by 2025 and 3°C by the end of 
the next century [7]. The impacts of climate change are 
across many sectors including health, ecosystem, agri¬ 
culture, and water. The Second Climate Conference, 
which was held in 1990, in Geneva, recognized that 
among the most important impacts of climate change 
were its effects on the hydrological cycle and on water 
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management systems [8]. Thus, climate change and its 
potential impacts on water resources have become an 
increasingly common topic among the general public, 
engineers, water managers, as well as scientists. 

Since the water is involved in all components of the 
climate system including atmosphere, hydrosphere, 
cryosphere, land surface, and biosphere, climate 
change affects water through many ways. Climate 
change observed over the past several decades is poten¬ 
tially associated with changes in several processes of the 
hydrological cycle and systems such as changing pre¬ 
cipitation patterns, intensity and extremes, widespread 
melting of snow and ice, increasing atmospheric water 
vapor, increasing evaporation. It leads to changes in 
soil moisture and runoff [9-11]. 

Quantity The Earth’s surface radiates part of the 
solar energy back to the atmosphere in the form of 
infrared radiation. Most of this infrared radiation is 
trapped in the atmosphere by greenhouse gases, such 
as carbon dioxide, methane, and nitrous oxide and 
directed back toward the Earth’s surface. This process 
causes warming the planet and ultimately will lead to 
change in hydrological cycle. As suggested by some 
research, warmer temperature will cause more water 


to escape from earth surface and return to atmosphere 
through evaporation and transpiration. Thus, precipi¬ 
tation changes are expected and will be highly complex 
and variable than the usual pattern. 

Figure 1 shows the unstable precipitation patterns 
in Japan and it seems that the fluctuations have grad¬ 
ually become larger in last 50 years. The consequent 
change in the hydrological cycle adversely affects the 
stability of groundwater and surface water sources, 
accounting for nearly 31% of total fresh water budget 
in the world. As a result of the overall effects, some 
regions will receive more precipitation, while other 
regions receive less. Therefore, depending on the fluc¬ 
tuating precipitation, severe droughts and floods may 
occur more frequently. The flood frequency was 
projected to increase over many regions except those 
including North America and central to western Eur¬ 
asia, whereas the drought frequency was projected to 
increase globally except northern high latitude, eastern 
Australia, and eastern Eurasia. Both flood frequency 
and drought frequency are projected to increase even 
in the same region of the world. Such regions 
show a decrease in the number of precipitation days, 
but an increase in days with heavy rainfalls [ 13] . At the 
end of the twenty-first century, the annual mean 
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Annual precipitation ratios from 1898 to 2008 in Japan (The Japan Meteorological Agency [12]). Note: The bars indicate 
annual precipitation ratios to normal (1971-2000 average) averaged of the 51 stations. The green line indicates 5-year 
running mean 
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precipitation, evaporation, and runoff increase in high 
latitudes of the Northern Hemisphere, southern to 
eastern Asia, and central Africa, whereas they decrease 
in the Mediterranean region, southern Africa, southern 
North America, and Central America [14]. Changes in 
precipitation patterns and evapotranspiration will also 
directly affect the soil moisture condition. In a summer 
season, high evaporation may exceed the precipitation 
and hence subsurface water storage would act as 
a moisture source to the atmosphere. This may cause 
the upward movement of water from shallow aquifers, 
resulting in a steep decline of the water table. 

As climate change increases ocean temperatures, 
the seawater will expand, contributing to sea level rise 
due to thermal expansion. The IPCC has found that the 
global mean sea level has risen at an average rate of 
about 1.7 ± 0.5 mm/year during the twentieth century 
[15]. Changes in sea level will increase coastal erosion, 
and flooding of wetlands and lowlands, leading to 
significant impacts on the availability of freshwater. 
Climate changes will also have impact on saltwater 
instruction into aquifers due to the sea level rise, and 
thus adversely affect the groundwater in coastal aqui¬ 
fers [16]. 

Quality In addition to the water availability, changes 
in climate could lead to important changes in water 
quality as well. Current changes in climate possibly 
influence water quality by altering the balance between 
atmospheric, terrestrial, and aquatic processes in 
a watershed in a variety of ways. Frequent heavy pre¬ 
cipitation in some regions may cause wash-off of more 
nonpoint pollutants and soil erosion into water bodies 
further deteriorating the water quality. 

Increasing water temperature caused by increased 
air temperatures may affect the reaction rate in biogeo- 
chemical processes and the chemical properties of 
water [11, 16, 17]. Among water quality parameters, 
dissolved organic matter, nutrients, micropollutants, 
and pathogens are susceptible to rise in concentration 
because of temperature increase [18]. Since warmer 
water can hold less oxygen, global warming would 
lead to lower dissolved oxygen contents in surface 
waters [19]. The warmer temperature will also foster 
algal blooms [20] in water bodies. In addition, evapo¬ 
rative water losses could increase the salinity of surface 
waters, especially in lakes and reservoirs. Warmer water 


combines with excess nutrients washed by heavy rain¬ 
falls and increase of algae, bacteria, and fungi content 
will ultimately deteriorate the ecosystem of lakes, 
streams, and their biological communities. Water qual¬ 
ity impairment has been also observed in rivers, such as 
increased BOD and NH 4 , because of the loss of stream 
dilution capacity and low water velocities derived from 
the reduced stream flows [21]. Increased salinity is 
another potential problem in rivers and estuaries, 
where streamflows are reduced. Figure 2 is a simple 
illustration showing the influence of climate change 
on water quality of freshwater resources. 

Adaptive Measures Against Climate Change Effect on 
Water Resources 

There is absolutely no doubt that climate change is 
possibly the major environmental challenge for the 
coming decades. It is also well evident that climate 
change has the potential to be affecting the quantitative 
and qualitative status of freshwater resources by 
altering hydrological cycles with wide-ranging conse¬ 
quences for human societies and ecosystems. There¬ 
fore, in order to cope with the climate change effect on 
water resources, there have been two main responses: 
mitigation and adaptation. Mitigation is needed in 
order to prevent future anthropogenic interference 
with the climate system that can influence water 
resources and their management. On the other hand, 
adaptation is an adjustment in natural or human sys¬ 
tems to a new or changing environment, including 
water resources. Adapting or coping with climate 
change will therefore become necessary in sustainable 
water use. Adaptation measures may differ across and 
within local, national, regional, or global level and must 
be addressed under complex and uncertain conditions. 
These measures can be undertaken at different scales 
and may vary from individual households and local 
communities to a range of public and private sectors 
through policies, investments in infrastructure and 
technologies, and behavioral change. 

As the impacts of climate change on water resources 
begin to be realized, attention on adaptation technol¬ 
ogies is also becoming increasingly important and 
must be increased as a matter of urgency. The report 
of Working Group II, Fourth Assessment of IPCC 
(2007), addresses the future impacts of climate change 
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Influence of climate change on water quality of freshwater resources 


c 


on various sectors including water resources and the 
potential for adaptation and the implications for sus¬ 
tainability [22]. Adaptation to climate change impact 
on water resources can adopt a variety of forms, such as 
water demand control through better education, train¬ 
ing, and awareness of climate change as well as more 
technical measures for water resource management, 
water withdrawal, transportation, treatment, supply, 
water uses, etc. Some possible adaptation initiatives 
that take into account the potential impacts of climate 
change on water resources can be listed as follows. 

1. Water demand and supply management 

(a) Water demand management through metering 
and pricing 

(b) Water leakage reduction 

(c) Efficient water usage in agriculture 

2. Water resource conservation 

(a) Soil moisture conservation 

(b) Coastal defense design and implementation to 
protect water supply against contamination 

(c) Eutrophication control 

(d) Conservation of groundwater through artifi¬ 
cial recharge 

(e) Conservation of upstream forest 

3. Enhancement and development of new water 
resources 

(a) Stimulate artificial rainfall 

(b) Seawater desalination 


(c) Reclaimed water usage and rainwater 
harvesting 

4. Other measures 

(a) Public participation in water-saving activities 

(b) Education for sustainable water use 

(c) Continuous water quality monitoring 

Climate change effect, coupled with current water 
use patterns and increased freshwater demands with 
increased population, will worsen the water challenges 
already existent. Therefore, coping with the effects of 
climate change on water will require better demand 
management measures to enhance efficiency of water 
use. There are ranges of demand management tech¬ 
niques that may be engaged to reduce water use and 
encourage greater productivity of water usage. Reduc¬ 
ing the quantity of water required to accomplish 
a specific water application is one of effective way to 
mange water demand. For example, toilet flushing or 
landscaping does not require high-quality water, used 
for potable purpose. Water metering and pricing 
assures the optimal allocation of water as it provides 
a clue to both water producers and consumers, hence 
indirectly reducing the water consumption. Public par¬ 
ticipation in water-saving activity plays a major role in 
adaptation measures. Significant water savings can be 
achieved by introducing a dual water supply system 
such as toilet flushing by reclaimed water. It is required 
that communities, scientists, and policy makers work 




































2324 


c 


Community-Owned Water Resource and Climate Change, Quality Management 


together to implement adaptation activities. Therefore, 
better information, communication, and public aware¬ 
ness are necessary to produce changes in the behavior 
of water users to complement sustainable water use. 
Moreover, the development of public understanding 
on the links between climate change and its impacts 
on water resources is vital in order to promote adaptive 
behavior to climate change, which are happening now, 
and potential future impacts. 

Reducing the water loss in distribution system is an 
important adaptation practice. Control of water leak¬ 
age is one of the most critical aspects in sustainable 
water use. On the other hand, leakage is an energy loss 
that results in unnecessary carbon dioxide emission, 
ultimately contributing to global climate change. 
Detecting and repairing water leaks in Tokyo metro¬ 
politan area has reduced the amount of water wastage 
from 8.0% (1998) to 3.1% (2008) [23]. 

The water available for agriculture becomes limit¬ 
ing due to competition with other water users, precip¬ 
itation changes, and water quality degradation. Since 
the agricultural sector is the largest water user, efficient 
water usage in agriculture will be essential. Educating 
efficient water use within farming communities is 
also an important factor so that they could adapt to 
climate change. In addition, better information on 
challenges and solutions should be provided toward 
improvement in the agricultural irrigation systems 
through better maintenance and adapting the schedul¬ 
ing of plant watering. 

Groundwater resources are vulnerable and have the 
potential to be strongly impacted by climate change 
effect in terms of quantity as well as quality. Moreover, 
rapid urbanization with land use changes has decreased 
significantly the infiltration rates of rainfall into the 
soil, which causes to diminish the natural recharging 
of aquifers. One of adaptive methods of sustaining 
groundwater resources is recharge of aquifers naturally 
as well as artificially. Rainwater or reclaimed water have 
been successfully used for augmentation of groundwa¬ 
ter in many regions. In Tokyo metropolitan area, infil¬ 
tration facilities have been constructed with the 
objective of groundwater recharge as well as storm 
water runoff control [24]. Artificial groundwater 
recharge with reclaimed water has been effectively 
practiced in Orange County, California and 


contributed to the prevention of water shortages in 
the area [25]. 

Seawater desalination can produce good quality 
water, which can be used as even potable water. One 
of the leading desalination methods is reverse osmosis 
technique using membranes that filter out the salt, but 
the main drawback is high-energy consumption as well 
as the conventional desalination by distillation method. 
The high-energy need ultimately will result in acceler¬ 
ation of greenhouse effect. However, it is being discussed 
now the possibility of coupling the desalination technol¬ 
ogy with renewable energy such as solar, wind, etc. [26] . 

Domestic and municipal applications of rainwater 
harvesting and water recycling and reuse for non- 
potable usage have been receiving increased interests 
as a reliable solution to cope with climate change and 
rapid urbanization issues. Reclaimed water use has 
been successfully implemented in several countries 
such as US and Japan for a number of years (see section 
“Reclaimed Water Use” in this article). Thus, wastewa¬ 
ter reclamation and reuse can be used as economically 
promising adaptation measure for water scarcity and 
it plays a significant role to achieve a sustainable 
development, especially in arid regions. Rainwater 
harvesting as a method of utilizing rainwater for both 
potable and non-potable use is widely used throughout 
the world. It is a technique that has been used since 
ancient times and is increasingly being accepted as 
a practical method in urban and peri-urban areas, 
where water supply service is still limited (see section 
“Rainwater Use”). One of the advantages of rainwater 
harvesting is very simple to use and consumes less 
energy. In a feasibility study in Tokyo, it has been 
estimated that rainwater usage for non-potable water 
application could reduce the demand for traditional 
potable water supply by 77% while cutting down 
the emission of greenhouse gas C0 2 by as much as 
36% [27]. 

As part of adaptation against climate change effect 
on water resources, the potential use of supplementary 
water resources such as rainwater, brackish water, gray 
water, even reclaimed water for the purposes of non- 
potable application will need to be assessed. This paper 
discusses how to assess rainwater and reclaimed water, 
available for urban communities. These alternative 
water resources in this paper are defined as 
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“community-owned water resources” and it is impor¬ 
tant to evaluate their water quality from the view¬ 
point of sustainable water use. A water quality 
labeling approach is explained focusing on a case 
study in Japan, where they are deeply dependent on 
surface water resources. 

Water Availability in the Future 

Climate change is regarded as one of many processes 
affecting water availability. Population growth, eco¬ 
nomic development, land use change, aging infrastruc¬ 
ture, and changing social values along with climate 
change bring water scarcity to the top problem. As 
depicted in Fig. 3, it is also anticipated that the future 
increase in world populations is expected in developing 
countries, especially in Asia and Africa. With the urban 
population increases, remarkable increase of water 
demand by population growth and elevated water con¬ 
sumption per capita for these regions can be expected. 
This may also lead to increased competition for water, 
and conflicts between water uses and users [19]. 

The second assessment report of the IPCC 
(1995) discussed the uncertainties on water availability 


as to the impacts of population and economical 
growth, and climate change and Fig. 4 depicts the 
water availability per capita in 21 countries as of 1990 
and forecasts for 2050 [29]. 

According to Falkenmark’s water crowding indica¬ 
tor Aw (= Q/C; where Q and C are renewable freshwater 
resources and population) [30], regions with more 
than 1,700 m 3 /capita/year were categorized as “no 
water stress,” which means that only local water man¬ 
agement problems occur. Below this threshold limit, 
countries may begin to experience frequent water stress 
issues. The range of 1,000-1,700 m 3 /capita/year indi¬ 
cates “water stress,” while 500-1,000 m 3 /capita/year 
means “water scarcity” and less than 500 m 3 /capita/ 
year “absolute water scarcity.” As in the Fig. 4, some 
countries, particularly in Africa, Asia, and the 
Middle East, will shift from “no water stress” toward 
“water stress” or “water scarcity” as combination 
results of population and economic growth and climate 
change effects. It is noticeable that if the current con¬ 
sumption trends continue, several countries will 
face water availability limitations by the middle of 
this century. 
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Community-Owned Water Resource and Climate Change, Quality Management. Figure 3 

Projected world population (Adapted from population data from population division of the department of economic and 
social affairs of the United Nations, 2009 [28]) 
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Community-Owned Water Resource and Climate Change, Quality Management. Figure 4 

Forecast for water availability (m 3 /year/capita) in 2050 for the present climatic conditions (Adapted from the second 
assessment report of the IPCC (1995) cited in [29]) 


Climate change is expected to accelerate water 
cycles and thereby increase available renewable fresh¬ 
water resources. As a result, the available renewable 
freshwater resources are expected to increase at 
a higher rate than water demand, calculated from pop¬ 
ulation and economic growth. Because of this, two 
conventional water stress indexes, Falkenmark’s water 
crowding indicator Aw (<1,000 m 3 /year/capita) and 
water scarcity index Rws (= [W — S]/Q > 0.4; Wand 
S, are water withdrawal and water generated by desali¬ 
nation) showed that climate change contributes to slow 
down increase of the number of people living under 
water stress on a macro scale [31]. It is, however, 
important to note that these indexes are based on 
estimates of annual average volume. As described 
above, precipitation will become more intense and 
intermittent, and the risks of both floods and droughts 
will increase even in the same region owing to 
climate change. Oki and Kanae pointed out that sev¬ 
eral anticipated impacts of climate change on water 


resources, such as seasonal variation, degradation of 
water quality, and associated changes in water resource 
management, are not taken into account and that these 
changes in risks are not well considered in current 
global assessments on future water resource manage¬ 
ment [31]. 

Future challenges are required to respond to such 
anticipated impacts of climate changes and changes in 
risks. Water engineers and managers as well as different 
water users have to deal with those challenges in the 
development of sustainable water resource manage¬ 
ment. The achievement of the sustainability is required 
ensuring a long-term supply of water with adequate 
quality for all designated purposes minimizing adverse 
economic, social, and ecological impacts [32]. It is very 
clear now that merely dependence on a single water 
resource and centralized piped system cannot cope 
with these challenges for sustainable water use. These 
uncertainties stimulate research on alternative ways of 
providing water for a community. Reclaimed water and 
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runoff water are two types of promising water resources 
that can be adapted as complementary sources. 

Importance of Community-Owned Water 
Resources 

General Status of Rainwater and Reclaimed Water 
Use in Water Balance 

Understanding of water balance is essential for effec¬ 
tive water management including water supply and 
sewerage systems. The water balance approach helps 
us to evaluate the significance of rainwater and 
reclaimed water. 

As an example of increase of water demand by 
population increases, water balance in Tokyo 23 
wards and Japan are shown in Fig. 5. The district of 
Tokyo 23 wards is the most populated metropolitan 
city, and the population density is 14,000 persons/km 2 . 
Amount of water use (m 3 /year) is divided by the 
city area (km 2 ) and illustrated at the unit of mm/year. 
It is hence easy to compare the amount of water use 
with that of precipitation. The average annual precip¬ 
itation in Tokyo 23 wards is 1,467 mm, and is distrib¬ 
uted to 906 mm (62%) of runoff, 438 mm (30%) of 
evapotranspiration, and only 123 mm (8%) of infiltra¬ 
tion. Since the imperviousness is estimated to be 
greater than 80% in Tokyo 23 wards, the infiltration 


volume (123 mm) is remarkably lower in Tokyo 23 
wards than the nationwide average (361-542 mm), 
which is approximately 20-30% of the precipitation. 
On the other hand, total water demand in Tokyo 23 
wards is 1,811 mm and larger than precipitation 
(1,467 mm). Large amount of water is supplied 
from outside of Tokyo 23 wards to meet the water 
demand. In response, water recycling has been much 
higher in the field of domestic water in Tokyo, while the 
recycling level is similar for Tokyo and nationwide 
average. 

Figure 6 shows temporal trends of three types of 
reused water in Tokyo; building-scale, community- 
scale, and large-scale. All types of reused water have 
been increasing, especially within a recent decade. 
Reuse rates of domestic water (building-scale + 
community-scale), and reclaimed water and rainwater 
are 2.4% (43/1,811 mm), 2.0% (56/2,816 mm), and 
0.05% (0.8/1,467 mm) in Tokyo 23 wards, respectively 
and much higher than nationwide averages (0.044%, 
1.4%, 0.001%), as shown in Fig. 5. Nonetheless, the 
reuse rates are not high enough to cover the large 
amount of the water demand in Tokyo 23 wards. 
To minimize the risks of droughts, it is now expected 
to increase community-owned water resources 
such as reclaimed water and rainwater as well as 
groundwater. 


c 



Community-Owned Water Resource and Climate Change, Quality Management. Figure 5 

Water balance in (a) Tokyo (23 Wards) and (b) Japan [unit: mm/year]. DWTP drinking water treatment plant, WWTP 
wastewater treatment plant, PT primary treatment, ST secondary treatment, AT advanced treatment (Data were quoted 
with modifications from [32-34]) 

































































2328 


c 


Community-Owned Water Resource and Climate Change, Quality Management 


120000 - 


Large-scale ■ Community-scale □ Building-scale 


100000 - 


& 

P 80000 


£ 60000 - 


~o 

8 40000 

=3 

CD 

oc 

20000 - 



O CM ^ (D CO 

00000000000 ) 0 ) 0 ) 0 ) 0)00000 
0 ) 0 ) 0 ) 0 ) 0 ) 0 ) 0 ) 0 ) 0 ) 0 ) 0 ) 0)00000 
1— 1— ■»“ 1- T- T- T— 1— 1— 1— T— T— C\1C\1C\1C\1C\1 

>>>>>>>>>>>>>>>>> 


Community-Owned Water Resource and Climate Change, Quality Management. Figure 6 

Temporal trends of reused water in Tokyo (Adapted from [35]) 


Reclaimed Water Use 

It is better to explore more stable, economical, and 
environmental friendly ways to meet water demand 
through the proper evaluation. Reclaimed water use 
has been practiced in various ways for decades in 
several countries. The rapid development of new tech¬ 
nology in wastewater treatment, such as membrane 
technology, makes it possible to deal with various con¬ 
taminants including pathogenic organisms and viruses 
present in wastewater, gaining more attention toward 
the reclaimed water. 

As shown in the Fig. 7, reclaimed water has been 
identified as a readily available source that covers 
a wide range, including industrial, residential, recrea¬ 
tional, and environmental enhancement purposes. 
Reclaimed water use has several advantages; it reduces 
demand for higher quality water; it reduces wastewater 
discharge, subsequently reducing water pollution; and 
it is economically efficient since it minimizes the water 
transporting cost from distant sources [36, 37]. 

However, reclaimed water or reuse water systems 
may also link with some drawbacks. Additional costs 
and energies are needed for advanced treatment and if 
not well managed, reclaimed water may generate num¬ 
ber of risks associated with public health. 

Geographic, climate, and economical factors dic¬ 
tate the degree and form of water reclamation and reuse 


in different regions or countries [38]. Table 1 demon¬ 
strates the water reuse quantity in Israel, Italy, Japan, 
South Korea, Spain, and USA. As in the Table 1, the 
water reuse in Israel is considered as one of the highest 
rates and water reuse per capita in the world [39, 40]. 
Spain shows by far the highest water reuse rate in 
European Union (EU) followed by Italy [41]. In terms 
of quantity, USA is the largest producer, with a volume 
of reused water estimated as 3,650 million cubic meters 
per year [42]. 

Figure 8 shows breakdown of reclaimed water use 
in Japan, South Korea, and California. Although the 
water use category is different among three countries, 
reclaimed water can be applied in agriculture, industry, 
maintenance of river water level, groundwater 
recharge, and municipal use including toilet flushing, 
and landscape irrigation. In Japan, reclaimed water is 
mainly used for maintenance of water levels of rivers, 
landscaping, and snow melting. The total amount of 
wastewater treated in wastewater treatment plant 
(WWTP) is 14.1 billion cubic meters per year in 
Japan. However, reuse rate is only 1.4% (0.19 billion 
cubic meters). In Korea, total amount of wastewater 
treated is 6.6 billion cubic meters per year. Reuse rate is 
4.7% (0.31 billion cubic meters). Reclaimed water is 
mainly used to maintain water level of rivers, similar to 
Japan. However, the rate of reclaimed water used for 
agriculture is higher in South Korea than in Japan. 
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Community-Owned Water Resource and Climate Change, Quality Management. Figure 7 

Tree of water resources recycling [43] 


Community-Owned Water Resource and Climate 
Change, Quality Management. Table 1 Water reuse sta¬ 
tus in some countries [42, 44, 45] 


Country 

Water reuse quantity 

Reuse water 

Percentage 

(%) 

Total quantity 
(million m 3 /year) 

amount per 

capita 

(L/day) a 

Israel 

83 

280 

107 

Italy 

7 

230 

11 

Japan 

1.4 

200 

4 

South 

4.7 

310 

18 

Korea 




Spain 

12 

350 

21 

US 

6 

3,650 

32 


a Based on population data from population division of the 
department of economic and social affairs of the United Nations, 
2009 [28] 


It is known that reclaimed water use for agriculture 
is remarkably high in the State of California as shown 
in Fig. 8. By the end of 2001, reclaimed water use in 
California had reached over 0.65 billion cubic meters 


per year [46]. Compared to Japan and Korea, agricul¬ 
tural and landscape irrigation is the dominant use of 
reclaimed water in California, which accounts for more 
than two third of the total water reuse by volume. 
Artificial groundwater recharge is also practiced suc¬ 
cessfully in California. Groundwater replenishment 
system at the Orange County, California is a frontier 
region where the largest water reclamation facilities 
have been operated with advanced treatment process 
that includes reverse osmosis, and treatment by ultra¬ 
violet light and hydrogen peroxide [25, 46]. 

Reclaimed water is also being used to maintain 
constructed wetlands, to enhance natural and artificial 
wetlands, and to sustain or augment stream flows. 
Reclaimed water has been successfully used in Tokyo 
for regeneration and restoration of the dried-up rivers 
and waterways. This gives multiple benefits such as 
restoring ecological habitats, enhancing landscape, 
and providing opportunity for citizens for environ¬ 
mental education (Fig. 9). 

Another new type of water reuse application is 
practiced in Tokyo. Extensive built-up of subways 
and underground structures have occupied large 
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Community-Owned Water Resource and Climate Change, Quality Management. Figure 8 

Percentage and purpose of reclaimed water use in Japan, South Korea, and California [44-46] 


surface volume in Tokyo and hence raise the ground- 
water level causing seepage. Railway companies 
collect this seepage water in tunnels and drain into 
sewerage system with payment. This was considered 
as waste of good quality water and additional 
financial burden for the railway companies and 
extra water to treatment system. Recently, collected 
seepage water began to be introduced into neighbor¬ 
ing recreational ponds and waterways for their 
restoration. 

Rainwater Use 

Rainwater is widely recognized as a valuable water 
resource and it can provide a readily accessible water 


supply. Rainwater harvesting systems vary from the 
small and simple to the large and complex ones. 
Where a suitable harvesting surface exists, such as 
roof or even road, rainwater collection can provide 
sufficient and low-cost source of water for non-potable 
usage. Effective use of rainwater as a supplementary 
water supply can indirectly be beneficial for flood con¬ 
trol too. Generally, rainwater can be regard as better 
quality water than treated wastewater and is likely to be 
suitable for wide range of potable and non-potable 
applications, including cooking, laundry use, toilet 
flushing, car and equipment washing, garden irriga¬ 
tion, cooling and heating, and many industrial pro¬ 
cesses. Rainwater usually gets better public acceptance 
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Community-Owned Water Resource and Climate Change, Quality Management. Figure 9 

Created waterways using reclaimed water in urban zones, which function as amenity and community activity places 



Community-Owned Water Resource and Climate Change, Quality Management. Figure 10 

Rainwater harvesting at (a) Kokugikan for Sumo wrestling and (b) Tokyo Dome Stadium 


for domestic use than reclaimed water. However, it is 
important to realize that air pollutants, particularly in 
urban areas, from industrial discharges, vehicle exhaust 
fumes, and agricultural chemicals used at farms or in- 
home gardens may wash off with rainwater. Moreover, 
dust, leaf debris, animal feces, and leaching from roof¬ 
ing, guttering and plumbing materials, or materials 
of storage tank may further degrade the quality of 
rainwater. 

Roof runoff for miscellaneous use such as toilet 
flushing and water cooling is successfully employed in 
various regions in the world, in an individual scale as 
well as in a large scale. Although road runoff generally 
has higher pollutant concentrations than roof runoffs, 
there is a trend to use the relatively polluted rainwater 
draining from streets and courtyard surfaces as service 
water in some countries [47] . Roof runoff water collec¬ 
tion can be successfully practiced under different scales 


in both urban and rural areas. A frontier attempt of 
the large-scale harvesting in Japan can be seen at 
Kokugikan (indoor Sumo stadium in Tokyo’s Ryogoku 
area) for Sumo wrestling as shown in on Fig. 10a. This 
system was introduced in 1984 and utilizes a tank with 
effective capacity of 750 m 3 to store the rainwater 
collected from the roof with an area of 8,400 m 2 . The 
collected water is used for cooling system and toilet 
flushing. Other early examples are City Office Building 
in Sumida ward, Tokyo in 1988, and Tokyo Dome 
Stadium in 1988. Figure 10b shows the system at the 
Tokyo Dome Stadium having a storage tank with 
a capacity of 3,000 m 3 . The maximum capacity for 
storage is controlled at 2,000 m 3 . It means that one 
third of its capacity is used for flood control (this capac¬ 
ity is left empty for the case), another 1,000 m 3 for 
miscellaneous water use, and the remaining 1,000 m 3 
for urgent water use in earthquake and fire events. 
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Water retention pavement as a countermeasure 
against heat island phenomena is one of the new 
types of rainwater use adopted in Tokyo. Rainwater 
falling on square block flows into side ditch and is 
stored in an underground tank as shown in Fig. 11. 
Water from the tank is then supplied to paving blocks 
via sprinkling pipes. 

Another application of rainwater is groundwater 
recharge via infiltration-type sewerage system. Tokyo 
Metropolitan Sewerage Bureau constructed infiltration 
facilities over 1,400 ha area [24, 48, 49] to cope with 
a heavy rainfall of 50 mm/h and hence provide benefits 
such as groundwater recharge and reduction of non¬ 
point pollution loads from urban surfaces. 

However, rainwater has a limitation as a stable 
water source because the amount largely is dependent 
on the variability of weather and climate. It is also 
expected to greatly vary in precipitation in future due 
to the effect of climate change. Thus, during a long 
drought season, the storage tank may dry up. As rainfall 
is typically unevenly distributed throughout the year, 
rainwater collection can serve as a supplementary 
method to support household and municipal water 
use. Therefore, rainwater may be used along with 
other stable water supply from surface water or/and 
groundwater at household or community-scale level 
from the viewpoint of sustainable water use. 


Standards and Guidelines for Reclaimed Water and 
Rainwater 

It is well evident that global freshwater resources are 
being under growing pressure as a result of rapid 
urbanization, industrial development, and climate 
change effect. As a result, supplementary water 
resources such as reclaimed water can no longer be 
neglected. As discussed in the previous section, 
reclaimed water has been used successfully in many 
regions, where wastewater collection and treatment 
have been the common practice. However, water recla¬ 
mation and reuse activities should be in practice with 
proper attention to the environment, public health, 
and esthetic consideration. Therefore, it is necessary 
to consider use of reclaimed water in terms of not 
only quantity aspect but also quality aspect. The water 
quality should be characterized and evaluated in terms 
of its intended application and practice. 

Standards and guidelines for reclaimed water 
applications have been established in various localities 
and countries, for example, by the World Health 
Organization (WHO), US Environmental Protection 
Agency (USEPA), Australia, and Japan. In general, the 
implementation of standards and guidelines for 
reclaimed water is based on local environmental and 
socioeconomic conditions in each country. However, 
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Rainwater use for water retention block pavement in Tokyo 
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these guidelines may also provide insight for other 
countries or localities. In 1973, WHO first developed 
guidelines for the reuse of effluents, considering 
methods of wastewater treatment and health safe¬ 
guard This guideline provided direction on how 
to protect public health and how to facilitate the 
rational use of wastewater and excreta for agriculture 
and aquaculture purposes [50]. When this criterion 
was revised in 1989 and most recently in 2006, 
more epidemiological and microbiological evidence 
concerning health risks had been taken into account 
[51,52]. 

In 1992, USEPA together with USAID (United 
States Agency for International Development) 
published guidelines for reclaimed water and these 
guidelines were updated in 2004, including many new 
and updated research activities and results, new infor¬ 
mation on treatment and disinfection technologies, 
emerging chemicals and pathogens of concern [53, 
54]. This new regulation and guideline in 2004 also 
summarized the water reuse regulations of various 
states throughout the USA. In general, USEPA suggests 
guidelines on reuse of wastewater for urban, industrial, 
agricultural, recreational and habitat restoration usage, 
groundwater recharge, and augmentation of potable 
supplies [54]. 

Comprehensive national guideline was established 
in Australia in 2006. It was designed for how water 
recycling can be safely and sustainably achieved for 
agriculture, fire control, municipal, residential and 
commercial property, and industry usage. The guide¬ 
lines consider management of risks to human health 
and environmental health and focus on two specific 
situations; water recycled from a centralized sewage 
treatment plant and from gray water [55]. This guide¬ 
line was extended later in 2008, considering the use of 
treated wastewater and storm water to augment water 
resources for potable purpose. The guideline also 
focuses on the source of water, initial treatment pro¬ 
cesses, and blending of recycled water with drinking 
water sources [56]. 

At the moment, there are no guidelines or regula¬ 
tions of water reuse at European Union (EU) level. 
Despite the fact that no guidelines or regulations 
yet exist, several member states or autonomy regions 
in EU have now published their own standards or 
regulations [57]. 


The water quality criteria and guidelines on the 
reclaimed water use for miscellaneous purposes in 
Japan were established by the Ministry of Land, Infra¬ 
structure and Transportation since 1981. Any guideline 
or standard should be dynamic in nature and subjected 
to revision as new scientific information becomes avail¬ 
able. The standard and guideline for reclaimed water 
quality in Japan were established for four respective 
miscellaneous applications in 2005 [58]. Water quality 
standard for rainwater and treated gray water under the 
Building Sanitization Law were amended in 2004. The 
latest values for water quality of reclaimed water, used 
for miscellaneous purposes, and water quality of rain¬ 
water and treated gray water are summarized in Table 2. 
These standards help to enforce the water quality stan¬ 
dards for reclaimed water and also help to ensure the 
restrictions for its safe use. Primarily, the reclaimed 
water should be utilized securing hygienic safety to 
protect the public health. The usual practice is that 
the greater the contact or potential for contact with 
humans, the greater the risk and hence the more exact¬ 
ing the water quality requirements. Therefore, where 
there is likely to be more human contact with reclaimed 
water, advanced treatment and high level of disinfec¬ 
tion are needed to minimize health risks [46]. In the 
case of human-related uses, pathogens including virus 
should be considered as quality guideline parameters. 
The introduction of virus parameter in the reclaimed 
water quality standard in Japan has been discussed in 
the Explorer Committee for the standard establish¬ 
ment, but has not been included because of absence 
of commonly available detection method of viruses 
[37]. 

These guidelines and standards propose on mini¬ 
mum standards for reclaimed water and draw risk 
management protocols to protect the public health 
and environment. Implementation of sustainable 
water use will likely require combination of policy/ 
regulation and technology and participation of general 
public. However, general public’s perception on alter¬ 
native water sources such as reclaimed water or rain¬ 
water is influenced by attitudes, emotions, and their 
accepted practice. Therefore, it is important to provide 
water quality information from trusted scientific and 
regulatory sources in a simple manner so that general 
public can easily understand the quality of different 
water resources. 
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Community-Owned Water Resource and Climate Change, Quality Management. Table 2 Water quality standards and 
guidelines for reclaimed water, rainwater, and treated gray water for miscellaneous use in Japan 



Guidelines for reclaimed water quality 


Standards for rainwater and 
treated gray water quality 3 


Toilet 

flushing 

Sprinkling 

water 

Landscaping 

water 

Amenity water 

Toilet 

flushing 

Sprinkling/ 
landscaping/ 
cleaning water b 

Escherichia coli 

Not 

detectable 

Not 

detectable 

<1,000 
CFU/100 mL 
(tentative 
value) 

Not detectable 

Not 

detectable 

Not detectable 

pH 

5.8-8.6 

5.8-8.6 

5.8-8.6 

5.8-8.6 

5.8-8.6 

5.8-8.6 

Appearance 

Not be 
distasteful 

Not be 
distasteful 

Not be 
distasteful 

Not be distasteful 

Almost 
clear and 
colorless 

Almost clear and 
colorless 

Turbidity 

(degree) 

(Control 
target) <2 

(Control 
target) <2 

(Control 
target) <2 

<2 

- 

<2 

Chromaticity 

(degree) 

C 

C 

<40 d 

<10 d 

- 

- 

Odor 

Not be 
distasteful 0 

Not be 
distasteful 

Not be 
distasteful 

Not be distasteful 0 

Not be 
abnormal 

Not be abnormal 

Residual 
chlorine (mg/L) 

(Control 
target) free: 
>0.1 or 
combined: 
>0.4 

(Control 
target) 6 free: 
>0.1 or 
combined: 

>0.4 

Not be 
stipulated 

(Control target) 6 
free: >0.1 or 
combined >0.4 

Free: >0.1 

or 

combined: 

>0.4 

Free >0.1 or 
combined: >0.4 

Facility 

standards 

Sand filtration 
or its 

equivalent 

Sand filtration 
or its 

equivalent 

Sand 

filtration or its 
equivalent 

Coagulation and 
sedimentation + 
sand filtration or its 
equivalent 




a The standard values stated by the Building Sanitation Laws. 

b For this purpose, raw water should not include human wastes. 

c Based on the intent of the users, standard values shall be set as required. 

d Based on the intent of the users, more stringent value shall be set as required. 

e This guideline value is not applied if disinfection is not required. 


Evaluation of Water Quality in Community- 
Owned Water Resources 

Community-owned water resources can be an attrac¬ 
tive solution for many regions as a competitive option 
to increase water availability and adapt to water short¬ 
age due to climate change. Reclaimed water and rain¬ 
water have been receiving growing interests as 
community-owned water resources as a substitution 
for conventional water resources. Water quality is the 
most important issue in such a community-owned 


water system that determines the acceptable and safety 
use for a given application. However, general public 
still fears or hesitates to use this kind of alternative 
water sources as they usually do not understand the 
quality difference among different types of waters. 

Public Perception to Reclaimed Water and Rainwater 

Successful execution of a reclaimed water project 
depends on several aspects such as economical envi¬ 
ronmental feasibility and as well as the social aspects 
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related to using reclaimed water. In addition to the 
scientific and engineering-based factors, public accep¬ 
tance can be considered as the key to cause a reclaimed 
water system in a community to fail or succeed, 
because general public has to be willing to pay for 
such a reclaimed system [59]. Reclaimed water or run¬ 
off water system may face public opposition resulting 
from a combination of prejudiced beliefs, fear, atti¬ 
tudes, lack of knowledge, and distrust. 

The main public objection is related to the quality 
of the reclaimed water. Because the primary source of 
recycled water is municipal wastewater that carries 
what humans excrete and discharge to the drain from 
sources such as toilet, bathroom, kitchen, and laundry, 
or miscellaneous dumps of household, a psychological 
objection to close contact frequently occurs [38, 61]. 
This has prompted the community to always be 
concerned about water quality. However, general pub¬ 
lic often does not understand the quality difference 
among different types of possible water resources and 
hence many communities do not have a positive image 
on alternative water sources such as reclaimed water. 
Lack of knowledge on the water quality of water sources 
causes people to hesitate to use water sources such as 
reclaimed water or rainwater. Figure 12 shows a survey 
result of public opinion toward reclaimed water and 
rainwater harvesting in Tokyo. The survey was 
conducted in 2008 with 1,839 randomly selected 
Tokyo residents. In general, results revealed that 86% 
of Tokyo residents are positive to use reclaimed water 
or rainwater. However, majority of them (59%) are 
ready to use reclaimed water or rainwater if only 
water quality is good enough for the intended applica¬ 
tion [60]. Therefore, public awareness of water quality 


information of these alternative water sources plays 
a key role in sustainable water use project. 

Kantanoleon et al. also emphasized that there is 
a major knowledge gap affecting the public perception 
of wastewater nature, treatment, and reuse [62]. This 
public’s fears related to water quality can be overcome 
with clear and adequate scientific explanations in terms 
of benefits on the use of reclaimed water from wastewa¬ 
ter or rainwater. The challenge is how to share scientific 
knowledge such as water quality information with pub¬ 
lic and communicate with them in simple manner. 

Development Method of Water Quality Information 
Platform 

One of the difficult tasks that environmental engineers 
are facing is how to transfer complex water quality 
(WQ) data into information that is understandable 
and useful to technical and policy persons as well as 
the general public. Therefore, it is important to develop 
a simple way to represent the difference of WQ among 
various water resources available in a community, 
including reclaimed water. The objective of this section 
is to propose a methodology to develop a WQ infor¬ 
mation platform. The WQ information platform pro¬ 
vides WQ criteria reference to be compared among 
different waters using WQ labeling and ranking. The 
WQ labeling and ranking is a better tool to evaluate 
WQ level of possible water resources available in 
a community including reclaimed waters. 

The general practice is to describe WQ of water 
sources along with official guidelines on the basis of 
different WQ parameters. Although these traditional 
practices consist of bunch data and information, the 


c 


Total positive, 86.4 % Total negative, 12.1% 


1 1 

1 l 

1 \ 

Wiiling to use, 27.8% 

Ready to use if the water quality is 
good enough, 58.6% 

No idea, 1.4% 

Reluctant 
to use, 
9.3 % 

|l\lot at all, 2.8% 


t 

Other, 0.1% 


Community-Owned Water Resource and Climate Change, Quality Management. Figure 12 

Public opinion on reclaimed water and rainwater in Tokyo [60] 
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general public has neither the inclination nor the train¬ 
ing to study these details. The new proposed WQ 
information platform reduce the multivariate nature 
of WQ data by providing scoring system that will 
logically combine all WQ measures and provide 
a simple and easily understood description of different 
waters [27]. The proposed WQ evaluation method is 
based on five steps: 

1. Data Gathering: Selecting WQ parameters of water 
sources and collecting WQ standard values 

2. WQ Scoring: Determining five-level scores based on 
scientific evidences such as drinking and ambient 
WQ standards 

3. WQ Characterization: Characterizing WQ using 
multiple scores based on scientific evidence 

4. Water Use Ranking: Ranking by water availability 
and WQ characterizing information for several 
water usage 

5. WQ Labeling and User judgment: Judging the type 
of water usage such as gardening, toilet flushing, car 
washing, landscape, etc. 

This framework can accurately evaluate and sum¬ 
marize WQ data in a way that should be scientifically 
accepted and easily understood by both professionals 
and the public. Relative evaluation of WQ of different 
water sources is beneficial rather than absolute 
values of WQ parameter. If the relative evaluation is 
conducted on familiar water sources to community 
such as well-known river waters, it is easy for citizens 
to understand the WQ levels of community-owned 
water resources. Thus, this kind of WQ information 
platform will be useful for general public to change 
their attitude and behavior toward the quality of 
reclaimed water system. The evaluation is also a useful 
tool for describing the WQ state of the different water 
sources and for ranking the suitability of waters for 
different usage. The next entry will provide details of 
WQ information platform development using a case 
study in Japan. 

Characterizing and Ranking of Different Water 
Resources - A Case Study on WQ Information 
Platform in Japan 

The ultimate aim of developing WQ information 
platform is to develop effective management and 


monitoring system for community-based water reuse 
system. The WQ information platform provides a WQ 
reference for a community to evaluate reclaimed waters 
or any alternative water sources. Therefore, WQ infor¬ 
mation platform should integrate the significant phys¬ 
icochemical and biological constituents of different 
water resources available for a community and present 
them in a simple and scientifically defensible way. The 
case study in Japan explains five steps to develop 
a nationwide WQ information platform that indicates 
the ranking and labeling of water resources. The steps 
contained in this case study can be used as a guide for 
a process of evaluation of WQ in any community or 
a region, depending on the needs or objectives of that 
particular community. In this case study, WQ of 
advanced treated wastewater, runoff waters, and 
groundwater was evaluated and compared with the 
first-class rivers in Japan, in terms of their usage. 

Data Gathering 

Appropriate assessment of several WQ data is essential 
to develop WQ information platform. The first 
important step is to make WQ reference to be com¬ 
pared with WQ level of different water sources. In this 
study, river waters were selected as reference water 
resource, because surface water is the major source 
of water supply in Japan. Out of 109 rivers through¬ 
out Japan that are designated as “the first-class” by 
the River Bureau Ministry of Land Infrastructure 
Transport and Tourism, 37 rivers were selected to 
develop the reference profile [63]. Figure 13 shows 
the selected rivers and sample locations, which repre¬ 
sent nine administered regions in Japan. Water samples 
were collected during dry season from the most down¬ 
stream section. 

To confirm the representativeness of the selected 
rivers, catchment areas, population density, concentra¬ 
tions of total nitrogen (T-N) and total phosphorus 
(T-P) of the selected 37 rivers of this study and the 
annual average of 109 rivers based on monthly moni¬ 
toring were compared [64]. Figure 14 shows that the 
distributions of catchment areas, population density, 
T-N and T-P concentrations in this study agreed well 
with their annual averages. Therefore, the selected riv¬ 
ers could be regarded as being representative, and 
hence can be used as WQ criteria reference. 
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Rumoi River 
Ishikari River 


Yura River 


Saru River 
Yoneshiro River 

Aka River 
Ara River 




Kita River 
Yamato River 
Asahi River 

Ashida River 
Gouno River 
Oze River^ 

Chikugo River 

Honmyo River 

Oono River 
Kimotsuki River 


Tokoro River 

Naruse River 
Kitakami River 


Tone River 

Tama River 

Tsurumi River 


Shounai River 


Yodo River 

'Monobe River \ yoshino River 
Doki River 
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Selected rivers in Japan for sampling in this study 

The first stage was to select WQ parameters that will 
be used to evaluate the water resources. In order to 
develop the WQ information platform in this study, the 
data related to several chemical analysis and estrogenic 
activity (bioassay) as shown below were used. It is 
better to use assessment using multiple bioassay tests 
in more advanced study. However, in this case study, 
estrogenic activity alone was used as bioassay. 

Chemical analysis: 

• Organic matter (TOC, COD, BOD) 

• Nitrogen (TN, N0 2 , N0 3 , NH 4 ) 

• Phosphorus (TP, P0 4 ) 

• Metals (Zn, Cu, Mn, Ni, Pb, Cd, As, etc.) 

• Endocrine disrupting chemicals (El, a-E2, (3-E2, 
E3, BPA, NP, OP) 

• Polycyclic aromatic hydrocarbons (benzo(a) 
pyrene, etc.) 

• Perfluorinated surfactants (PFOS, PFOA, etc.) 

• Pharmaceuticals and personal care products (ibu- 
profen, carbamazepine, triclosan, etc.) 

• Synthetic detergents (LAS, SPC, etc.) 


Quality Management. Figure 13 

Bioassays: 

• Estrogenic activities 

Only use of traditional chemical parameters is not 
sufficient for WQ assessments, since reclaimed water 
contains complex mixtures of trace contaminants. 
Emerging organic and microbial contaminants usually 
represent major concerns in water reclamation as they 
pose either acute or chronic health impacts to the 
consumers. Therefore, apart from the typical chemical 
WQ parameters of organic matters, nitrogen and phos¬ 
phorus, other micropollutants such as heavy metals, 
endocrine disrupting chemicals (EDCs), polycyclic 
aromatic hydrocarbons (PAHs), perfluorinated surfac¬ 
tants (PFSs), pharmaceuticals and personal care prod¬ 
ucts (PPCPs), and synthetic detergents were also taken 
into account for developing WQ information platform 
[63, 65-68]. It is always better to use multiple bioassay 
data along with chemical data, in more advanced 
assessment study. Bioassays are important for effective 
integrated assessment of WQ. Because chemical 
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O Selected 37 rivers • 109 rivers 



T-N concentration (mgN/L) T-P concentration (mgP/L) 

Community-Owned Water Resource and Climate Change, Quality Management. Figure 14 

Comparison of catchment areas, population density, T-N concentrations, and T-P concentrations between 109 rivers and 
selected 37 rivers 


analysis gives only quantitative statements for a limited 
number of chemicals, bioassays such as estrogenic 
activities, algal growth inhibition, algal growth poten¬ 
tial, Microtox, and mutagen formation potential 
(MFP) are required for an integrated assessment of 
effects or toxic potentials of chemical substances. How¬ 
ever, this case study will be explained only with the data 
of estrogenic activities along with chemical data, 
because bioassay data are not so easily available in 
most cases. 

The selection of parameters to evaluate the water 
resources depends on the usage that is intended. If the 
water usage is only limited for purposes such as toilet 


flushing, the coloring and appearance are also focused 
on to develop the WQ information platform. However, 
additional parameters are recommended when the WQ 
information platform is being used as a decision tool 
for more advanced water usage such as amenity pur¬ 
pose with human contact. 

Subsequently, WQ references of each parameter 
were represented as percentile distributions. Percentiles 
are values that divide a set of observations into 100 
equal parts. The percentile rank is the proportion of 
values in a distribution that a specific value is greater 
than or equal to. Therefore, the lower percentile rank¬ 
ings here indicate the better WQ while higher percentile 
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rankings indicate poorer WQ. Thus, the Oth and 100th 
percentiles describe the best and worst of the WQ 
distribution, respectively, while the median or 50th 
percentile describes the WQ value at the middle of the 
distribution. These percentile distributions can be used 
for WQ criteria reference to be compared with different 
water resources such as reclaimed waters. In this study, 
different types of water resources; infiltrated water of 
road runoff [69] secondary treated effluent [70], 
advanced treated wastewater by sand filtration [70], 
infiltrated water of secondary treated effluent 
[71, 72], advanced treated wastewater by ozonation 
[70], and groundwater [68, 73] were compared with 
the river WQ references. 

Figure 15 shows the example of percentile distribu¬ 
tion of TN and two PPCP parameters (ibuprofen and 
carbamazepine) of 37 rivers in Japan. The figure also 
compares the secondary effluents, sand filtered efflu¬ 
ents, and ozonated effluent of WWTP with the refer¬ 
ence river WQ. As in this example of Fig. 15, there is no 
significant difference in TN among three types of 
treated wastewater effluents and value is equal to 
100th percentile rank of the selected rivers. Ibuprofen 
concentration in ozonated effluent is equal to 35% of 
the percentile distribution. The 35 th percentile 
describes a value (or 0.5 ng/L of ibuprofen) for which 
around 35% of selected rivers in the distribution are 
equal to, or lower than, ibuprofen concentration in the 
ozonated effluent. In the case of carbamazepine, the 
percentile rank of the ozonated effluent is 20%, 


indicating 80% of the river water contains higher con¬ 
centration of carbamazepine than the ozonated efflu¬ 
ent. Since no WQ standards are set so far for emerging 
contaminants such as PPCP, this kind of comparison 
with known river waters can be used to reflect the WQ 
condition of the relevant water source. 

Water Quality Scoring 

Once the WQ references have been developed as the 
percentile distribution, the next step is the scoring 
process. The first stage of this step is to divide the 
WQ percentile distribution of rivers into six factors 
according to the criteria of each WQ parameters, 
which describe the state of WQ based on scientific 
evidence such as drinking and ambient WQ standards. 
Then, the other water resources were compared with 
reference scale to determine their scores. 

All the WQ parameters available for chemical anal¬ 
ysis and bioassay were classified into six different WQ 
factors, such as: 

WQ factors based on chemical analysis: 

1. Coloring and appearance 

2. Eutrophication 

3. Conservation of aquatic ecosystem 

4. Chronic toxicity to human 

5. Acute toxicity to human 

WQ factors based on bioassay: 

6. Estrogenic activity 
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Community-Owned Water Resource and Climate Change, Quality Management. Figure 15 

Comparison of the TN, PPCP (ibuprofen and carbamazepine) of secondary effluents, sand filtered effluents, and ozonated 
effluent with those of percentile distribution of reference river waters 
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These six factors are combined to form the score 
matrix, which can fall into one of the five level score 
index. Although six different factors were used in this 
study, the number of factors may depend on the avail¬ 
ability of data and purpose of WQ evaluation. The 
factors based on chemical analysis were scored 
according to the WQ standard or guideline, while the 
factors based on bioassays were scored according to 
the potential effect of biological and on ecosystems. 
The five level scores index, which gives a number 
between 0 and 4, was used, with 0 being the best and 
4 indicating the poorest WQ within the distribution. 
The score “0” implies the water is safe with a virtual 
absence of threat or impairment. The score “1” implies 
there is a minor degree of threat or impairment and 
continuous monitoring is needed for that particular 
water. The score “2” to “4” means the water is threat¬ 
ened or impaired in terms of particular WQ parameter 
and some action is needed to improve the WQ. 
Depending on the degree of action level, scores were 
given as 2, 3, or 4. 

The example of the Fig. 16 shows the score distri¬ 
bution for Zn concentration considering the WQ factor 
for “conserving of aquatic ecosystem” within the same 
percentile ranking in the first-class rivers in Japan. 
Since 0.03 mg/L is the guideline value in the rivers for 



Zn concentrations (mg/L) 


Community-Owned Water Resource and Climate 
Change, Quality Management. Figure 16 

Distribution of Zn concentrations and their scores in the 
first-class rivers based on the WQ factors for conservation 
of aquatic ecosystem 


conservation of aquatic ecosystem in Japan, less than 
0.003 mg/L of Zn in waters is regarded as score 0, 
0.003-0.03 mg/L as score 1, 0.03-0.3 mg/L as score 2, 
0.3-3 mg/L as score 3, and more than 3 mg/L as score 4. 
The waters of score 3 represent the water even after ten 
times dilution is threatened from the view point of 
conservation of aquatic ecosystem. Similarly, scores 
can be given to all other parameters related to each 
WQ factors. 

Table 3 summarizes the example of WQ thresholds 
value for WQ factors and their scores with respect to 
the relevant WQ parameters. In this way, once all the six 
factors have been divided into five levels index in the 
reference distributions, scores of different types of 
water resources could be determined. Table 4 shows 
the example of final score matrix of different water 
resources. 

Water Quality Characterization 

This is the step to summarize the scores given to the 
each water resource in the previous step. Character¬ 
ization can be considered as similar to a WQ label 
in a drinking or mineral water bottle. As several 
WQ parameters (nitrogen, fluoride, etc.) are seen 
in a water bottle to show the quality of water, five 
WQ factors (coloring, eutrophication, acute toxic¬ 
ity, etc.) were used in this study to characterize the 
different water resources including reclaimed 
waters. Based on scores in Table 4, characterizing 
indexes were calculated. In the case of the WQ factor 
of “Acute toxicity to human,” the characterizing 
indexes were determined by taking the maximum 
score value, while the index values of all the other five 
factors might be determined by taking the average of 
each score under a particular WQ factor. It is also 
possible to give weighting coefficients to WQ parame¬ 
ters of each factor. 

In order to understand the WQ characterizing 
indexes easily, radar charts (also known as spider 
charts) were developed to express characterizing 
WQ graphically. In the radar charts, each axis repre¬ 
sents the WQ factor developed at the step 2 of WQ 
Scoring. The six factors are plotted along the chart 
axes from the center of the chart. For a given WQ 
factor, score of “4” is located in the center and “0” at 
the edge. Lines connect the axes at the position of 
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Community-Owned Water Resource and Climate Change, Quality Management. Table 3 Example of water quality 
thresholds value for scoring 



Unit 

Score 0 

Score 1 

Score 2 

Score 3 

Score 4 

Coloring and appearance 

pH 

- 

5.8-8.6 

4.8-5.8, 8.6-9.6 

3.8-4.8, 9.6-10.6 

2.8-3.8, 10.6-11.6 

<2.8, >11.6 

TOC 

mg/L 

<0.5 

0.5-5 

5-50 

50-500 

>500 

Cu 

mg/L 

<0.1 

0.1-1 

1-10 

10-100 

>100 

Zn 

mg/L 

<0.1 

0.1-1 

1-10 

10-100 

>100 








Conservation of aquatic ecosystem 

nh 4 

mgN/L 

<0.02 

0.02-0.2 

0.2-2 

2-20 

>20 

Zn 

mg/L 

<0.003 

0.003-0.03 

0.03-0.3 

0.3-3 

>3 

Triclosan 

ng/L 

<6.3 

6.3-63 

63-630 

630-6,300 

>6,300 








Estrogenic activity 

Yeast estrogen screen assay 

ng-E2/L a 

<1 

1-10 

10-100 

100-1,000 

>1,000 


a E2 (17|3-estradiol) equivalent concentration. 


Community-Owned Water Resource and Climate Change, Quality Management. Table 4 Example of scores in several 
water resources 



100% 
percentile 
rank river 

50% 

percentile 
rank river 

0% 

percentile 
rank river 

Groundwater 

Wastewater 
treated by 
sand 
filtration 

Wastewater 
treated by 
ozonation 

Infiltrated 
water of 
secondary 
wastewater 

Infiltrated 
water of 
road 
runoff 

Coloring and appearance 

pH 

1 

0 

0 

0 

0 

0 

0 

0 

TOC 

2 

2 

1 

2 

1 

1 

1 

1 

Cu 

0 

0 

0 

0 

0 

0 

0 

0 

Zn 

1 

0 

0 

0 

0 

0 

0 

0 










Conservation of aquatic ecosystem 

nh 4 

2 

1 

0 

0 

1 

1 

1 

0 

Zn 

2 

1 

0 

0 

2 

2 

1 

1 

Triclosan 

1 

1 

0 

0 

2 

1 

1 

0 










Estrogenic activity 

Yeast 

estrogen 

screen 

assay 

2 

0 

0 

0 

1 

0 

0 

0 
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each data object, forming a spiral around the center. 
The larger the area of the spiral, better the WQ of 
a particular water source. 

Figure 17 shows the radar chart related to Oth, 50th, 
and 100th percentile rank of the first-class rivers in 
Japan as well as groundwater, ozonated wastewater 
effluent, and infiltrated water of road runoff. According 
to the graphical representation, overall WQ of ground- 
water is roughly equal to 50th percentile rank of the 
first-class rivers. Radar charts of ozonated wastewater 
effluent and infiltrated water of road runoff depict that 
their overall WQ is between 50th and 100th percentile 
rank of the first-class rivers. 

However, each WQ factor should be carefully 
evaluated in the viewpoint of the water usage pur¬ 
pose. For example, ozonated wastewater effluent 
shows poorer WQ for eutrophication than the 100% 
percentile rank river. This indicates that the ozonated 
wastewater effluent in this case is not suitable to apply 
to water usage such as landscaping pond water, in 
which the eutrophication would be a possible impair¬ 
ment factor. 


Water Use Ranking 

The aim of this step is to integrate the characterized 
water resources with the objectives of water usage. 
Since water for amenity, landscaping, toilet flushing, 
and fire protection has different WQ requirements, 
suitability of water resources depends on the usage for 
which it is intended. Under this step, the characterized 
values of WQ factors were used to provide a water use 
ranking as “good,” “fair,” or “not applicable” for differ¬ 
ent water usages. Table 5 describes an example of 
criteria to judge the ranking for water usage. It is not 
necessary to consider all the factors for each water 
usage. As in the Table 5, for amenity water usage only, 
four factors were considered, while six factors were 
considered in water for landscaping usage. 

Table 6 shows an example for ranking of different 
water usage of ozonated wastewater effluent in terms of 
considered WQ factors. Regarding the WQ factor of 
“acute toxicity to human,” ozonated wastewater efflu¬ 
ent can be ranked as “good” for most of water usage, 
but “fair” for water for emergency usage, in which 


Coloring/ 

appearance 


Estrogenic 

activity 


Conservation of 
aquatic ecosystem 

Eutrophication 

100% percentile rank river 


Coloring/ 

appearance 


0 


Estrogenic i 

>0 



Estrogenic 


Acute toxicity 

activity 



Acute toxicity 

activitv^^ 


to human 


2 


1 to human 



3^ 



3^ 






Chronic toxicity 

Conservation of ^ 


Chronic toxicity 

Conservation of^\. 


to human 

aquatic ecosystem 


to human 

aquatic ecosystem 


Coloring/ 

appearance 


Chronic toxicity 
to human 


Eutrophication 

50% percentile rank river 


Eutrophication 

0% percentile rank river 


Estrogenic 

activity 



Conservation of 
aquatic ecosystem 

Eutrophication 

Groundwater 


Acute toxicity 
to human 


Chronic toxicity 
to human 


Estrogenic 

activity 



Conservation of 
aquatic ecosystem 

Eutrophication 


Acute toxicity 
to human 


Chronic toxicity 
to human 



Conservation of 
aquatic ecosystem 


Chronic toxicity 
to human 


Wastewater treated by ozonation 


Eutrophication 

Infiltrated water of road runoff 


Community-Owned Water Resource and Climate Change, Quality Management. Figure 17 

Example of WQ characterization of several water resources 
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Community-Owned Water Resource and Climate Change, Quality Management. Table 5 Example of criteria for 
several water applications 



Coloration/ 

appearance 

Acute 
toxicity to 
human 

Chronic 
toxicity to 
human 

Eutrophication 

Conservation of 
aquatic ecosystem 

Estrogenic 

activity 


Amenity water 

>2: n.a. 

>4: n.a. 

>3: n.a. 

>3: n.a. 




1-2: fair 

3-4: fair 

2-3: fair 

2-3: fair 

<1: good 

<3: good 

<2: good 

<2: good 

Water sprinkling 
and car wash 

>3: n.a. 

>4: n.a. 

>4: n.a. 





2-3: fair 

3-4: fair 

3-4: fair 

<2: good 

<3: good 

<3: good 

Toilet water 

>3: n.a. 

>4: n.a. 

>4: n.a. 





2-3: fair 

3-4: fair 

3-4: fair 

<2: good 

<3: good 

<3: good 

Landscaping 

water 

>3: n.a. 

>4: n.a. 

>4: n.a. 

>3: n.a. 

>3: n.a. 

>3: n.a. 


2-3: fair 

3-4: fair 

3-4: fair 

2-3: fair 

2-3: fair 

2-3: fair 

<2: good 

<3: good 

<3: good 

<2: good 

<2: good 

<2: good 

Water for 
emergency use 

>2: n.a. 

>2: n.a. 

>3: n.a. 





1-2: fair 

1-2: fair 

2-3: fair 

<1: good 

<1: good 

<2: good 

Water for fire 

>4: n.a. 

>4: n.a. 

>4: n.a. 

- 

- 

- 


protection 

3-4: fair 

3-4: fair 

3-4: fair 






<3: good 

<3: good 

<3: good 






n.a. not applicable, - not considered 


household water use is expected, including toilet flush¬ 
ing, washing, and bathing. 

Water Quality Labeling and User Judgment 

The last but the most important step is the judgment of 
water usage by users using WQ labeling information. 
In the previous step, each water resource was ranked 
with respect to the each factor such as “coloration,” 
“acute toxicity,” etc. The aim of this step is to decide the 
overall judgment for water usage of each water 
resources. As shown in Table 6, use of ozonated waste- 
water effluent for amenity purpose has been ranked as 
“good” for WQ factors of coloration, acute toxicity to 


human, chronic toxicity to human, but been ranked as 
“fair” for WQ factor of eutrophication. Therefore, 
overall WQ rank of ozonated wastewater effluent for 
amenity purpose was labeled as “fair” taking the low¬ 
est rank in each factor. Similarly, all the water 
resources subjected to evaluation were ranked and 
labeled in Table 7. 

As summarized in Table 7, water usage labeling 
evaluates each water source and simply describes 
“how good or how bad” water sources are for different 
water applications. The outcome of this labeling infor¬ 
mation provides easy expression to compare WQ of 
different water resources directly with those of river 
WQ reference. Hence, it leads to overcome public 


2343 


c 

































2344 



Community-Owned Water Resource and Climate Change, Quality Management 


Community-Owned Water Resource and Climate Change, Quality Management. Table 6 Example for ranking of 
wastewater treated by ozonation for several water applications 



Coloration/ 

appearance 

Acute 
toxicity to 
human 

Chronic 
toxicity to 
human 

Eutrophication 

Conservation 
of aquatic 
ecosystem 

Estrogenic 

activity 


Example of 

user 

judgment 

Amenity 

water 

Good 

Good 

Good 

Fair 

- 

- 


Fair 

Water 
sprinkling 
and car wash 

Good 

Good 

Good 





Good 

Toilet water 

Good 

Good 

Good 

- 

- 

- 


Good 

Landscaping 

water 

Good 

Good 

Good 

Fair 

Good 

Good 


Fair 

Water for 
emergency 

use 

Good 

Fair 

Good 





Fair 

Water for fire 
protection 

Good 

Good 

Good 

- 

- 

- 


Good 


not considered. 


Community-Owned Water Resource and Climate Change, Quality Management. Table 7 Example of labeling of 
several water resources 



Amenity 

water 

Water sprinkling 
and car wash 

Toilet 

water 

Landscaping 

water 

Water for 
emergency use 

Water for fire 
protection 

100% percentile rank 
river 

Fair 

Good 

Good 

Not 

applicable 

Not applicable 

Good 

50% percentile rank river 

Good 

Good 

Good 

Fair 

Fair 

Good 

0% percentile rank river 

Good 

Good 

Good 

Good 

Good 

Good 

Groundwater 

Good 

Good 

Good 

Good 

Fair 

Good 

Wastewater treated by 
sand filtration 

Fair 

Good 

Good 

Fair 

Fair 

Good 

Wastewater treated by 
ozonation 

Fair 

Good 

Good 

Fair 

Fair 

Good 

Infiltrated water of road 
runoff 

Good 

Good 

Good 

Fair 

Fair 

Good 

Infiltrated water of 

wastewater 

Fair 

Good 

Good 

Fair 

Not applicable 

Good 


uncertainty in the use of alternative water sources 
(reclaimed water, runoff, etc.). The concept and pro¬ 
cess described in this case study can be used as 
a guide for WQ evaluation of community-based water 


resources. It should be mindful that required WQ 
might depend on the needs or objectives of that partic¬ 
ular community and should be combined with effective 
management and monitoring system. 
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Although the WQ labeling framework indicates the 
relative WQ of a particular water source and can be 
used to compare and rank the water sources, the WQ 
labeling should never stand alone for the full interpre¬ 
tation of the feasibility of a water source. Therefore, in 
order to optimize the community-owned water use, 
it is beneficial to analyze both the quality and the 
quantity of water resources against potential reuse 
applications and WQ requirements. Moreover, energy 
consumption for water intake, purification, and distri¬ 
bution should be further considered to compare an 
existing water supply system and community-owned 
water resources [27]. 

Future Directions 

It is apparent that climate change is already happening 
and going to be one of the greatest environmental, 
social, and economic threats facing the whole world. 
Since the water is involved in all components of the 
climate system, including atmosphere, hydrosphere, 
cryosphere, land surface, and biosphere, climate 
change affects water through many ways. Therefore, 
the climate change has the potential to be affecting 
the quantitative and qualitative status of global fresh¬ 
water resources by altering hydrological cycles with 
wide-ranging effects on human societies and ecosys¬ 
tems. It is also anticipated that growing pressure on 
water resources will stimulate by the population 
growth, urbanization, land use changes, and some 
other drivers. 

Various proposals have already been made to help 
for mitigation and adaptation measures focusing par¬ 
ticularly on impacts of climate change on water 
resources. As part of adaptation against climate change 
effect on water resources, the potential use of rainwater 
and reclaimed water, which can be found within 
a community, is an attractive solution for many regions 
to meet the increase in water demand. However, pub¬ 
lic’s perception on the community-owned water 
resources is influenced by attitudes, emotions, and 
accepted practice, and hence makes general public to 
hesitate to use them due to the lack of knowledge or 
complexity of understanding the WQ. 

Therefore, it is important to provide WQ informa¬ 
tion from trusted scientific and regulatory sources in 
a simple manner so that general public can easily 


understand the quality of different water resources. 
The proposed WQ information platform in the last 
section reduces the multivariate nature of WQ data by 
providing scoring system that will logically combine all 
WQ information and provide a simple and easily 
understood description of different waters. The con¬ 
cept and process in this evaluation will assess each 
water resource and simply describes “how good or 
how bad” water resources are for different water appli¬ 
cations. However, it should be mindful that required 
WQ might depend on the needs or objectives of 
a particular community and should be combined 
with effective management and monitoring system. 
Moreover, quantitative analysis of water resources and 
energy consumption for water intake, purification, and 
distribution should be further considered to compare 
an existing water supply system and community- 
owned water resources. 
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Glossary 

Absorber Device which absorbs concentrated solar 
radiation and produces heat. 

Black body Ideal body which would absorb all of the 
radiation falling upon it. 

Blocking, blocking losses Power which is lost due 
to interception of part of reflected sunlight 
from one heliostat by the backside of another 
heliostat. 

Capacity factor Energy produced over a specified time 
interval. 

Central receiver system, concentrating receiver 
system (CRS) Solar power plant system in which 
solar radiation is converted by a heliostat field onto 
a tower-mounted solar receiver. 

Concentration ratio Ratio of radiant flux density out¬ 
put to radiant flux density input. 

Direct normal irradiation (DNI) Direct part of 
energy carried by sun rays on a given area. 
Dispatchability, dispatchable Ability to dispatch on- 
demand produced electricity to the grid. 


Heat recovery steam generator (HRSG) A conven¬ 
tional power plant component which produces steam. 
Heat transfer fluid (HTF) in a CRS Fluid used to 
absorb heat in the solar central receiver and to 
transport and to deliver it to another component 
of a power plant (e.g., Heat recovery steam genera¬ 
tor, storage) 

Heliostat Device consisting of an assembly of mirrors, 
support structure, drive mechanism, and mounting 
foundation, which follows continuously the sun in 
two dimensions and reflects the sun rays to a fixed 
direction. 

Heliostat field Sum of all heliostats of a CRS. 

Hybrid system Any energy system which operates on 
two or more energy input sources, or which pro¬ 
vides more than one form of energy output. 
Parasitic power, parasitic losses Power required to 
operate an energy conversion system (e.g., pumps, 
buildings, blowers, electric devices). 

Radiation Emission and propagation of electromag¬ 
netic energy through space or material. 

Receiver efficiency Ratio of the thermal output deliv¬ 
ered by the receiver HTF to the incident solar radi¬ 
ant flux under reference conditions. 

Receiver, solar receiver Radiation absorbing system 
that works like a heat exchanger as it accepts solar 
radiation and delivers heat to a HTF. 

Spillage, spillage losses Fraction of concentrated solar 
radiation which misses the absorber of the solar 
receiver. 

Thermocline Zone or layer in a thermal storage vol¬ 
ume in which the vertical temperature profile 
changes rapidly. 

Definition of the Subject and Its Importance 

This contribution is an attempt to write an original 
comprehensive yet concise introduction to concen¬ 
trated solar receiver technologies, with the specific 
aim to find ways to solve the rising energy demand 
with using clean energy. 

Among others in the world today there are two huge 
problems: climate change and rising energy demand. 
Solar-thermal power plants constitute a significant 
technological option for a sustainable future energy- 
mix. Solar-thermal energy is booming all around the 
world as many power plants are being built or planned. 
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Since the decade of the 1980s power production 
with concentrated solar tower power plants, as for 
example solar towers, has been a way to substitute fossil 
fuels. 

From the beginning of the 1980s until the end of the 
1990s many research activities in the field of solar tower 
technology took place in North America and Europe. 
The experience gained from these activities provided 
the necessary conditions for commercialization and 
creation of the first solar tower power plants. Pioneers 
in establishment and development of this technology in 
Europe are Germany and Spain. Different solar tower 
technologies concerning the concentrator, heat transfer 
media, and power cycle are competing. 

In order to raise the operation hours of a central 
receiver system (CSR) a combination with a storage 
system or an upgrade to a hybrid system by combining 
the solar operation with different components is pos¬ 
sible. This is a significant advantage over many other 
renewable energy technologies because heat is used as 
an intermediate energy carrier. In order to achieve 
‘round-the-clock, 100% renewable energy, CSP can be 
hybridized with biomass energy. For finding ways to 
solve a further important problem of the water short¬ 
age, desalination systems using the heat and the elec¬ 
tricity from solar power plants can be applied. 

The introduction depicts the huge solar thermal 
potential of the world’s Sunbelt. In the next section 
the advantages of solar tower technology are analyti¬ 
cally described. The third section of this article pre¬ 
sents, in a technical but reader-friendly way, the 
different components of CRS technologies. Although 
the main focus of this article is to describe this tech¬ 
nology and to present the installed solar plants (section 
“Examples of CRS Plants”), there is a diverse coverage 
from solar only operation (section “Providing Firm 
and Dispatchable Power”) to combination with storage 
systems and hybrid solar tower power systems (section 
“Increase of Operation Hours of CRS by Storage and 
Hybridization”). 

Section “Characteristics of Different Heat Transfer 
Fluids” emphasizes the characteristics of different heat 
transport fluids which are used in CRS. Section “Second¬ 
ary Uses of CRS” presents ways to use the heat provided 
by solar central receivers for other applications. 

The chapter closes with a short economical consid¬ 
eration and an overview of the future perspectives, 


emphasizing the potential of future technology and 
the cost estimations. 

The most immediate value of this contribution 
resides in providing important and up-to-date informa¬ 
tion for all readers who maybe interested in the growing 
field of concentrated solar power (CSP) and especially 
CRS technology. The article fills the need for a wide- 
coverage systematic description of the different systems 
as well as of different power plant configurations which 
enable firm and dispatchable energy production, thus 
helping the reader to understand the key technological 
advantages and disadvantages of the central receiver 
technologies. 

It is addressed to experts in engineering profes¬ 
sionals, academics and scientists, technicians, man¬ 
agers, private and public institutions, students, and 
common citizens. The article may even be adopted as 
a text contribution for college courses that deal with 
renewable energy focusing on solar thermal energy 
and/or sustainability. 

It includes tables and colored figures as well as 
equations which describe the thermodynamic behavior 
as well as the efficiencies of CRS technologies and 
applications. Especially, many colored pictures are 
selected in order to show almost all installed solar 
thermal power plants and test facilities worldwide. 
Complete literature is included at the end of the article 
to direct the curious and interested reader to further 
information. 

The main aim of the contribution is to find solu¬ 
tions for main global environmental problems, so that 
the energy future mix may be made more efficient, 
clean, and sustainable. 

Introduction 

In spring 1931, about half a year before his death, 
Thomas Alva Edison, US American inventor, had 
a conversation with Henry Ford and Harvey Firestone, 
in which he apparently said: “I’d put my money on the 
sun and solar energy. What a source of power! I hope 
we don’t have to wait till oil and coal run out before we 
tackle that.” 

This is not surprising since only about 1% of the 
Sahara (used for solar thermal power plants) is suffi¬ 
cient to cover the total current electricity demand of the 
earth (Fig. 1). 
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Potential of solar thermal power plants (Source: DLR) 


According to the International Energy Agency, solar 
thermal power stations are the cheapest option in 
MW-scale for generating electricity from direct solar 
radiation commercially and constitute a pillar for the 
long-term worldwide electricity supply. Contrary to 
other renewable energy technologies the application 
of large thermal storages or, as advised in this project, 
the combination with additional conventional compo¬ 
nents for the hybridization, enable the decoupling of 
solar input. Thus, solar thermal power stations can also 
be used for grid stabilization and a need-based power 
production. 

The parabolic trough, the solar dish, the Fresnel 
collector, and the solar tower belong to the group of 
solar thermal power systems. Parabolic trough and the 
solar tower are already competitive and economically 
feasible. 

In solar thermal tower power plants with nearly 
planar mirrors focus solar radiation and direct it onto 
a receiver, which is located on the top of a tower. Very 
high temperatures in the receiver, resulting from this 
concentrated solar radiation enable generation of 
power plant process steam. The steam can be expanded 


in a steam turbine and generate power. Replacing fossil 
fuels, this clean power generation technology solely 
makes use of highly concentrated sunlight to produce 
electricity. 

Advantages of the Use of CRS for Supply of 
Energy 

The development of solar tower power plants aims to 
use higher concentrating solar radiation compared to 
parabolic trough as the power plant process at higher 
temperature and therefore operates with better effi¬ 
ciency. Higher temperature is also an advantage for 
storage of thermal energy, as storage volume per unit 
of energy can be reduced. Short-term developments 
aim to couple the heat from the solar towers in steam 
power plants. 

Life cycle assessment of the emissions produced, 
together with the land surface impacts of CRS systems, 
shows that they are ideally suited to the task of reducing 
greenhouse gases and other pollutants, and without 
creating other environmental risks or contamination 
(Power 2020). Its low adverse environmental impact, 
with none of the polluting emissions or safety concerns 
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associated with conventional generation technologies is 
a major benefit of solar thermal tower power. Like all 
solar technologies CRS have an inexhaustible fuel 
source and the entire process is not polluting and no 
emissions or greenhouse gases are produced. There is 
hardly any pollution in the form of noise during oper¬ 
ation, and decommissioning a system is not problem¬ 
atic [ 1 ]. Even considering the live cycle of the emission 
of carbon dioxide in the atmosphere during the pro¬ 
duction of the plant components, in the dismantling 
and installation of the power plant it has been found 
that comparatively the solar thermal systems emit only 
12 kg of carbon dioxide per MWh, the hydropower 
plants 14 kg/MWh, the wind turbines 17 kg/MWh, 
and the photovoltaic systems 110 kg/MWh. 

CRS plants have a very good ecological balance. All 
materials used are fully recyclable and most used mate¬ 
rials like glass and steel are available worldwide. Solar 
tower power can therefore make a substantial contri¬ 
bution toward international commitments to reduce 
the steady increase in the level of greenhouse gases and 
their contribution to climate change. 

The use of tower technology is not restricted to level 
surfaces. It can also be installed in hilly landscapes. 
Tower plants can adapt to land diverse in orography, 
ranging from certain undulations in the area for imple¬ 
mentation, to include land characterized by more com¬ 
plex orography, which may involve valley slopes for 
positioning solar fields. An important requirement is 
that the heliostat field should face the south in latitudes 
in the Northern Hemisphere or the north in the 
Southern Hemisphere. In the end, considerable addi¬ 
tional area can be exploited for solar technology 
worldwide. 

A central receiver plant is made up of proven com¬ 
ponents. Operational safety and reliability as well as 
easy maintenance and cutting edge technology can be 
guaranteed. Parasitic pumping losses are smaller due to 
reduced pipe lengths, and the heat transfer fluid can 
also serve as the storage fluid, eliminating the need for 
heat exchangers that increase costs and decrease effi¬ 
ciency [2]. 

A cost-effective series production is possible for 
middle- to large-scale solar tower plants. This feature 
is not only applicable for solar parts like heliostats or 
receiver components but also for conventional parts 
like tubes. Furthermore the solar tower technology 


creates new jobs and contributes to the development 
of a new market. 

Another important advantage is the capability for 
storage at high temperatures and the possibility of 
hybrid operation (see Increase of Operation Hours of 
CRS by Storage and Hybridization). In the future, 
further applications are possible in the fields of solar 
chemistry by the production of fuels and in industrial 
processing, for example, by the melting of metals. 

Concentrating Receiver Systems for Power 
Production 

Main components of a solar tower are the heliostat 
field, the receiver, and the tower itself. 

Heliostat Field 

A heliostat field is the sum of all heliostats of a solar 
tower. 

Heliostats are mirrors which are equipped with 
a two-axes tracking system in order to track the sun’s 
path. A heliostat field provides thermal energy for 
a solar tower power plant (also referred to as a central 
receiver system). Heliostats are named after the Greek 
words helio meaning “sun” and stat meaning station¬ 
ary, because it describes the heliostat’s function which 
is to reflect the solar image and to focus it on a fixed 
position on a tower-mounted absorber, called receiver. 
In order to do so, the heliostat must track the sun’s 
position over the course of the day. As many hundreds 
or thousands of heliostats with a lot of mirror area 
reflect the solar image onto a comparatively small 
area of the receiver, high sunlight concentrations are 
reached. The furthest distance that a heliostat can be 
located away from the tower is approximately 1 km [3] . 

A heliostat has to satisfy criteria like the precision of 
the device and local conditions at the heliostat field site. 
The list of these criteria has a major influence on the 
design and the costs. They include the optical quality 
required, all the extraneous loads to be sustained, the 
required performance of the tracking system, protec¬ 
tive measures for the heliostat and the receiver, and the 
life of individual components, and the heliostat whole 
system [4]. 

Wind loads, temperature, and hail impacts have 
a major impact on the supporting structure of 
a heliostat. 
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Heliostat Components Classical heliostat design is 
dominated by mirrors brought into position by steel 
structures and drives that guarantee high accuracies 
under wind loads and thermal stress situations. Figure 2 
shows the typical parts of a heliostat (shown from the 
back). 

To the structures that hold the mirrors include 
struts, beams, the pylon, and the foundation. The linear 
actuators enable moving of the heliostat in two dimen¬ 
sions (Fig. 3). There exists a zenith and an azimuth 
actuator together with their step motors. 


their tracking systems. Each type has its own charac¬ 
teristics in terms of wind-performance (and tracking 
during windy conditions), costs, and complexity of 
control. A field made up of more small heliostats will 
require a more complex control system than a field of 
fewer heliostats. Thus, large heliostats are more 
cost-efficient than small ones (on a $/m 2 basis). 
Improvements in controllability may cause this to 
change, however, and several companies are attempting 
the strategy of many small heliostats as opposed to 
fewer large heliostats [6]. 


c 


Heliostat Types Several different types of heliostat 
mirrors exist, which can be differentiated according to 
size, shape, basic design concept, or even according to 
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Parts of a heliostat [5] 


Heliostat Shapes 

Canted-Glass Mirror Heliostat A canted-glass mirror 
heliostat is often a single large heliostat that is divided 
into a number of sub-mirrors (it may also be a single, 
large mirror). Each sub-mirror is displaced relative to 
the others so that rays from the center of the sun are 
reflected from analogous points of the sub-mirrors and 
all converge to the same point on the receiver. Thus, 
perfect focusing results in the minimum image size by 
eliminating the contribution of the heliostat size to the 
size of the reflected image. Perfect canting also 
approximates perfect focusing by reducing the total 
heliostat size to that of a single sub-mirror. The 
greater the number of canted sub-mirrors a given 
sized heliostat has, the smaller the contribution of the 
size of the heliostat to the image will be. In other words, 
canting is a Fresnel approximation to focusing (note 
also that the sub-mirrors of a canted heliostat can each 
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View of the main heliostat components (Source: SIJ) 
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be focused in 0-2 dimensions). The curvature or 
displacement required for focusing or canting is time 
dependent, since it depends on the angles between the 
heliostat, sun, and receiver. For most heliostat designs 
the curvature cannot be varied, and the heliostat will be 
perfectly focused or canted for only the one or two 
times of the year when the sun is in the correct 
position. At all other times, the heliostat will produce 
“off-axis aberration” of the image: distortions of the 
ideal image due to operation when the sun is not in the 
design position for the heliostat to be perfectly focused 
[7] (Fig. 4). 

Stretched-Membrane Focused and Non-focused 
Heliostat The reflector of a stretched-membrane 
(SM) focused heliostats is made of either stretched 
polymer (plastic) or metallic foil. Figure 5 shows how 
the image from a SM heliostat is focused. 

The SM design concept has not been very widely 
developed. It uses a thin reflective plastic membrane 
stretched over a hoop. Wind and temperature have 
a direct impact on the supporting structure of a SM 
heliostat. This design must be protected from the 
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Canted heliostat [7] 
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Focused heliostat [7] 



weather but requires considerably less expenditure in 
support structure and the mechanical drive mechanism 
because of its light weight. Membrane renewal and 
cleaning are important considerations with this design. 

McDonnell Douglas, a former major American 
aerospace manufacturer and defense contractor, calcu¬ 
lated that a 20% cost reduction could be achieved with 
developing heliostats with a mirror area of at least 
100 m 2 . In order to further reduce the heliostat costs, 
Sandia National Laboratories (USA) proposed 
a circular SM heliostat with a size of 150 m 2 . 

The proposed SM heliostat was lighter and simpler 
(i.e., requiring fewer parts) than a heliostat constructed 
of glass and metal. 

Moreover, predictions showed that larger sized 
heliostats are more cost-effective [7]. Figure 6 below 
shows an SM heliostat design. 

Flat Type Single Mirror Heliostats The Solar Tower 
Jiilich concentrator field consists of 2,150 heliostats 
with about 8 m 2 of reflective surface densely packed 
in rows on an area of about 18,000 m 2 . The heliostats 
are designed with a single glass facet fixed to a metal 
frame. The small size allows the use of simple linear 
drive units to track the suns position (Fig. 7). 

Heliostat Size The reflective mirror area of heliostats 
varies between 1 and 150 m 2 . The advantage of having 
larger sized heliostats is that since fewer heliostats are 
installed, fewer step motors are required and fewer 
units need to be maintained. The disadvantage how¬ 
ever, is the more difficult installation and maintenance 
of large heliostats, for example, replacement of actua¬ 
tors or broken mirrors. In order to withstand the high 
wind loads, the pylon, foundation, crossbeam and the 
cantilever arm must be designed a lot larger than for 
small heliostats (Fig. 8). 

In the 1980s, it was believed that the way to reduce 
area-specific costs was to build larger heliostats. A more 
recent analysis by Sandia National Laboratories in the 
USA has reconfirmed this; however it does not apply to 
all power tower technologies. 

Several research organizations have predicted that 
much smaller (10-20 m 2 ) heliostats could be cheaper. 
However, these organizations are generally using helio- 
stats within an ultra-high concentration power tower 
system. Such systems require small beam sizes, which 
can be more easily achieved with small heliostats [6]. 
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ASM-150 SM heliostat built by Babcock Borsig Power Environment [3] 


l 



Concentrating Receiver Systems (Solar Power Tower). 
Figure 7 

Flat type heliostat (Source: SIJ) 


The research and demonstration power tower in Jiilich, 
Germany, is a good example of this. The concentration 
factor is more than 750 times, on an ambient- 
air-pressure volumetric receiver. For the lower 
concentration power towers utilizing molten-salt or 
solid-particle receivers, the analysis predicts that low- 
cost heliostats are most likely to be >150 m 2 . 

The 148 m 2 Advanced Thermal Systems (ATS) 
heliostat has successfully operated for the last 20 years 
at the NSTTF (National Solar Thermal Test Facility) in 
Albuquerque, USA. It has survived multiple high-wind 
events, some in excess of 144 km/h, and the quality of 
the beam has not degraded significantly [6]. 

The SM heliostats (ranging between 120 and 
150 m 2 ) proved to be good mechanically, although the 
polymer mirror glued to the metal membrane quickly 
suffered ultraviolet (UV) damage and the face-up stow 
position led to hail-induced membrane damage. As a 
result, Sandia recommended the use of thin glass rather 
than a polymer film. SM heliostats employing glass were 
built and tested at Solar Two and in Spain in the mid 
1990s [6]. 

In 2006 and 2007, in Spain, more than 200,000 m 2 
of heliostats were installed at the PS 10 and PS20 solar 
tower plants. A large-area (121 m 2 ) pedestal-mounted 
heliostat similar to the ATS was selected after an eval¬ 
uation of alternate concepts indicated it was the lowest- 
cost option. 
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Glass/Metal Technology 



Concentrating Receiver Systems (Solar Power Tower). Figure 8 

Heliostats used in solar power plants and their costs (Source: Sandia) 


A recent development toward using small heliostat 
sizes was realized by the company eSolar, which built 
the Sierra SunTower. The advantage of using such small 
mirrors is that they are easily and quickly mountable 
(no cranes are required, see Fig. 9) and the heliostats 
can be mass manufactured. The mirror and 2-axes 
tracking system is deployed on a frame. 

Current research on innovative small-mirror sys¬ 
tems is also being conducted by the Solar-Institut Jiilich 
(SIJ) and German Aerospace Center (DLR) in 
Germany. A possible option to reduce costs lies in 
a heliostat design where all moving parts are protected 
from wind loads. In this way, drives and mechanical 
layout may be kept less robust thereby reducing mate¬ 
rial input and costs. In order to keep the heliostat at an 
appropriate size, small mirrors are used. These are 
placed in a box with a transparent cover. Several designs 
have been examined concerning the drive system and 
the mechanical structures of the mirror array in order 
to develop a cost-effective design. 

Heliostat Field Design The layout of the heliostat 
field is a complex topic. Decisions regarding the best 
position for locating heliostats relative to the receiver 


and how high to place the receiver above the field 
constitute a multifaceted problem, in which costs and 
heliostat loss mechanisms are among the variables, and 
which is solved by an iterative process [8]. 

Figure 10 shows the various basic field positions 
according to the location of the power plant on the 
earth. In the Northern Hemisphere, the suns position 
is to the south of the plant and the field must be placed 
to the north of the tower. In the Southern Hemisphere 
the opposite is true. The general terminology for this 
position would be the “polar side” of the tower, which 
can be applied to towers in both the Northern and 
Southern Hemispheres. The opposite side of the 
tower is the “equatorial side.” With a field on only 
one side of the tower, a cavity receiver is usually used. 
Close to the equator, where the sun passes directly 
overhead, a surrounding-field can be used in combina¬ 
tion with an open receiver. Another possibility is the 
use of two fields - one on either side of the tower, in 
combination with a cavity receiver for each. In a system 
using secondary concentrators, an elliptical field 
should be used, since light reflected from wider-angled 
positions in the field would not enter the secondary 
concentrators. 
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Deployment of mirror field (Source: eSolar) 
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Heliostat field design in dependence of the location [5] 


The aim in the designing of a heliostat field is 
to reliably produce the power required by the power 
block whilst keeping land usage and total costs 
to a minimum. As previously mentioned, the calcula¬ 
tion of the losses, laying out and optimization of the 
field is an iterative process and there are many com¬ 
puter programs which have been designed over the last 
30 years to perform this task. A detailed analysis of 
these computer programs is beyond the scope of this 
article. 


Theoretical Considerations When laying out 
a heliostat field there are several types of losses that 
must be considered. These are not only the optical 
losses, namely, the cosine losses and losses due to 
shadowing, blocking, spillage, and atmospheric atten¬ 
uation, but also technical considerations, that is, the 
mirror reflectivity, mirror surface defects, tracking 
accuracy, wind load, and tower oscillations (due to 
wind load) as well as the heliostat fault rate. The normal 
to the heliostat mirror surface is practically never facing 
the sun, except at the day in the year when at solar noon 
the suns position is directly behind the receiver (in that 
case the particular heliostat would be covered in shade 
from the tower shadow). Over the course of any day the 
heliostat’s azimuth and zenith angle orientation is 
changed every few seconds in order to guarantee accu¬ 
rate tracking of the suns path. Hence, mathematically, 
the effective mirror area is less than the actual mirror 
area. The reduction in effective mirror area is propor¬ 
tional to the cosine angle of the angle of incidence [9]. 

The equation for the loss due to the cosine is given by: 

Qcos,los = 1 COS0 (l) 

The radiant power is calculated with the equation: 

Prad = f dir ' A e ff (2) 
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Figure 11 shows the effect that the cosine loss has on 
the radiant power, which is reflected by a heliostat. The 
cosine loss is the most significant loss of the heliostat field. 

In the illustration, the example heliostat is posi¬ 
tioned directly north of a solar tower. The equation 
for the cosine loss, however, is valid for heliostats in any 
location in a heliostat field. 

Blocking is an effect which occurs for any heliostat 
which has neighboring heliostats in front and front 
sideways of it. Blocking means that a rear heliostat 
cannot reflect a portion of the light rays onto the 
receiver surface because a front heliostat blocks some 
of it, that is, some of the backside of the heliostat is 
illuminated. The lower the sun elevation angle and the 
further a heliostat is located away from the tower, the 
more blocking will occur. 

The following graph illustrates the mechanism of 
blocking (Fig. 12): 

Blocking can be avoided as much as possible by 
increasing the distance between the heliostats Ax as 
shown in Fig. 13. 

To eliminate blocking completely, Ax has to take 
a value such that a line (here illustrated in orange color) 
can be drawn from the center of the receiver to the 
upper tangent of the mirror’s swivel circle of heliostat 


and in continuation to the lower tangent of the mirror 
swivel circle of heliostat i+1 . 

As an approximation, Ax can be calculated as fol¬ 
lows: 


Rearranging for Ax: 

Ax » x • — (4) 

Zt 

where x is the horizontal distance of the heliostat 
pivot point to the center of the surface of the receiver. 
With growing distance x from the tower and with 
increasing ratio of z s /zj the distance between the conse¬ 
quent heliostats must be increased. The distance between 
the heliostats must not be chosen bigger than necessary 
because otherwise the tracking of the sun becomes too 
complex and the land requirement too much [9]. 

Shadowing is similar to blocking except that here 
the light rays, incident on a mirror surface, cast 
a shadow on the heliostat behind or sideways behind 
it. Figure 14 shows the partial shadowing of a mirror. 

The heliostat field reflects and directs the direct 
normal irradiance on the receiver surface. Depending 
on factors such as the mirror quality and tracking 
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Effect of cosine loss on radiant power reflected by heliostat (Source: SIJ) 
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Concentrating Receiver Systems (Solar Power Tower). Figure 12 

Heliostat blocking (Adapted from [9]) 
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Blocking of neighboring heliostats according to [9] 


system accuracy, a portion of radiation may miss the 
receiver surface - this is called spillage. 

These factors contribute to a reflected image at the 
receiver surface that is errant from its intended path to 
the receiver and distorted from its shape as it originates 
from an individual heliostat. 


Figure 15 illustrates the spillage loss for a solar 
tower with north-only field. Spilled radiation is the 
radiation which does not hit the receiver surface (rect¬ 
angular gray area). 

Another field loss is due to atmospheric attenua¬ 
tion, or scattering of beam radiation as the reflected 
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Shadowing loss (Source: SIJ) 


heliostat image passes through air toward the receiver. 
The overall loss due to the atmospheric attenuation 
depends on the distances of the heliostats to receiver. 
The attenuation on the other hand is a function of the 
visibility, which, of course, can vary during the course 
of the day. As an example for the test location Barstow 
(USA), the visibility of a clear day is defined to be 
23 km and for a hazy day 5 km [7]. 

How good a mirror reflects depends not only on the 
quality of the reflective surface, but also on the degree 
of mirror soiling (in certain time intervals the heliostat 
mirrors need to be washed to increase reflectivity) [7]. 

The overall optical loss is the product of the indi¬ 
vidual optical losses, according the following equation: 

VSF,i = Vb,S ' Vc ' Vf ' Vwj ' Vr ' V Fail ( 5 ) 


For computing the overall efficiency of the heliostat 
field rj s F , the following weighted equation is valid: 


Vsf ~ 


n 


Z) VsF,i • Asiv 

i= 1 


Asp 


(6) 


Where n is the number of mirrors, A S F,i the area of 
a single heliostat in m 2 , and ASF the area of the entire 
heliostat field. 

The heliostat field reflects and directs a usable ther¬ 
mal power Quse,SF to the receiver according to the 
following equation: 


Receiver 


Spilled radiation 
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Figure 15 

Spillage loss (Source: SIJ) 


Quse,SF — Asp ' FIsf 1dir 

where T^yis the direct normal irradiance. 


( 7 ) 
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Heliostat Aiming Strategies Aiming strategies serve 
the purpose of distributing the irradiation as evenly as 
possible on the receiver. There are several reasons for 
this: 

• Absorbed radiation is transformed into heat and 
must be removed by a heat transfer medium 

• Resulting temperature of the absorber material of 
the receiver depends on the mass flow of the heat 
transferring fluid 

• If the temperature of the fluid is to be kept as high as 
possible, the mass flow cannot be increased 
arbitrarily 

• Allowable flux density is fixed indirectly by the 
allowable temperature of the absorber material of 
the receiver 

• One (central) aim point in the receiver plane cannot 
be used in most cases. Along with this, high tem¬ 
peratures result in higher-than-average radiation 
losses 

There are several options for aiming the heliostats 
at different points. Many of the computer programs 
used to lay out the heliostat field are also designed to 
control the aiming strategies of the heliostat field as 
well as measure and control the flux density distribu¬ 
tion on the receiver. This makes perfect sense, since the 
flux density incident upon the receiver is a direct result 
of the layout and aiming strategies of the heliostat field. 
As with the heliostat field layout programs, a detailed 
analysis of these programs is also beyond the scope of 
this text and mathematical details can again be found in 
each of the quoted sources [7, 10, 11]. 

The flux density on the receiver depends on several 
factors: 

• Sun position, sun shape (daytime and season) 

• Number, size, and optical quality of the heliostats 

• Focal length and the canting of the heliostats 

• Most important influence: distribution of the indi¬ 
vidual aim points of the heliostats 

The aiming options described below (A to F, in 
Fig. 16) are quoted from the handbook of “Delsol 3” 
[7], which is an early computer code for calculating the 
optical performance and optimal system design for 
solar thermal central receiver plants. The five different 
strategies are generally required for the design of flux- 
limited liquid-(or steam)-carrying receivers. Different 


aiming strategies are also applicable to various types of 
receivers. The receiver types which will be mentioned 
here are: 

• External (cylindrical) 

• Cavity with Single Aperture 

• Single Flat Plate 

• Rectangular Aperture or Flat Plate 

• Elliptical Aperture or Flat Plate 

The aiming options are time dependent; that is, the 
number of aim-points can change over the year if the 
image changes. 

(A) Single Aim Point: This is the simplest of all aiming 
options. All heliostats are pointed at the center of 
the receiver (when viewed from the heliostat sur¬ 
face). This option produces the maximum flux on 
the receiver. 

(B) One-Dimensional Aiming: The heliostat images 
are spread out along the “height” of the receiver 
or aperture until the spillage starts to increase. As 
seen in the figure, the smaller images of the inner 
(or best focused) heliostats can be spread out over 
more aim points than the larger images of the 
outer (or less well focused) heliostats. This option 
reduces both the peak flux and the flux gradients 
on the receiver. This is the aiming option to use 
with external receivers and with cavity or flat plate 
receivers of elliptical shape. Since the size of the 
images from the heliostats can change with time, 
the one-dimensional aiming also changes with 
time, so that both the number and position of 
the aim-points may change. 

(C) Two-Dimensional Aiming: This option is similar 
to (B) except that the images are spread out in two 
dimensions as shown in the figure. This results in 
even smaller peak fluxes than (B). However, this 
option should only be used with rectangular cavity 
apertures or rectangular flat plates. If used with 
elliptical receivers the spillage will increase. Fur¬ 
thermore, if used with external cylinders much of 
the flux will be incident on the receiver at very fine 
incidence angles where the absorption is poor. 

(D) Single Aim Point at the Lower Part of the Receiver: 
The heliostats are aimed as close to the bottom 
of the receiver as is possible without increasing 
spillage significantly, as shown in the figure. 
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Concentrating Receiver Systems (Solar Power Tower). Fig 

Basic heliostat aiming strategies of computer code "Delsol 3 


The aim-points will vary somewhat with time as 
the heliostat image sizes vary, so that spillage 
remains relatively constant. There are several rea¬ 
sons for considering this strategy. First, if the fluid 
enters from the bottom of the receiver the peak 
fluxes will occur near the colder (and presumably 
stronger) end of the piping. In reality, the penalty 
is increased radiation and convection losses, since 
the average receiver temperature is increased. 
However, if the fluid enters from the top of the 
receiver, the radiation and convection losses are 





ure 16 

"[ 6 ] 

minimized, but the peak flux occurs near the hot 
end of the tube. Typically, peak fluxes will be close 
to those levels resulting from the single aim-point 
strategy (A). 

(E) One-Dimensional Aiming at the Lower Part of the 
Receiver: Similar to (D), except that the images are 
spread out along the bottom of the receiver, as 
shown in the figure. This option should only be 
used with rectangular cavity apertures or rectan¬ 
gular flat plates, for the same reasons as described 
in Section (C). 
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(F) User Defined Aiming Strategy: A user defined 
aiming strategy can be defined for each zone. 
Each zone can be assigned a rectangular grid of 
points. This aiming strategy is not time dependent; 
that is, the aim-points are the same no matter what 
time of day or year it is, even though image size 
and thus spillage might change with time. 

A combination of the aiming strategies (C) and (E) 
is also possible. For this aiming strategy, inner helio- 
stats will be aimed at the bottom of the aperture or flat 
plate receiver. However, starting at zones farther out, 
aim-points are spread out over the entire aperture or 
flat plate. This option is one method of reducing the 
internal spillage (flux impinging on un-cooled parts of 
the receiver, such as the roof) in a cavity receiver 
without exceeding flux limits at the bottom of the 
heat absorbing area. 

Examples of systems which have been applied in 
real power plants are [11]: 

• Solar-Two Plant at Bar stow, California: 

Full-System-Experiment with cylindrical salt- 

receiver required a 3-level aim point strategy to 
avoid receiver damage. 

Excess Flux Density Protection System: 

SAPS (Static Aim Processing System). Every 
10 min a new aim point distribution for all 
1926 heliostats is calculated for optimal flux 
density distribution. New aim points are 
transmitted to the heliostat field control 
system. 

DAPS (Dynamic Aim Processing System). Flux 
density distribution is re-calculated under 
current operation conditions. Currently, 
allowable flux density distribution is calcu¬ 
lated in dependence of the current salt tem¬ 
perature at each measuring point. If local 
flux exceeds allowable local flux due to 
abnormal operating conditions, the respon¬ 
sible heliostat will be driven to a stand-by 
position. 

• PHOEBUS-Receiver system on CESA-1 -Tower at 

Almeria, Spain: 

Full-System-Experiment required an aim point 
strategy to avoid deterioration of the receiver 
due to excessive temperatures. Open Volumetric 
Receiver with metal-wire-mesh-absorber. 


Heuristic knowledge-based heliostat field control to 
control the temperature distribution of the 
receiver. 36 thermocouples were used for the 
measurement of the current temperature distri¬ 
bution on the receiver. 

The system used a five-point aiming strategy, with 
displacement of the five aim points on the 
receiver plane to adjust the temperature 
distribution. 

Receiver 

The function of a receiver is to absorb the concentrated 
solar irradiation energy impinging from the heliostat 
field and transfer it to the working fluid. Requirements 
to a solar receiver include high thermal conductivity, 
dark color of the body for high absorptivity, and tem¬ 
perature resistance. Four concepts for receivers are 
described in this section. 

Direct Absorption Receiver: Falling Particle or 
Molten Salt Liquid Film Falling particle or molten 
salt liquid film receivers fall in the category of direct 
absorption receivers, which is a concept where the heat 
transfer medium is directly exposed to and heated by 
concentrated solar irradiation. This concept is a very 
interesting and promising option as the medium of 
either of the two technologies has the ability to handle 
high temperatures up to 2,000°C as well as high fluxes 
up to 2 MW/m 2 , or more. Moreover, no walls, but 
merely containment is required. The design with par¬ 
ticles as a heat transfer fluid has the advantage that 
particles have a selective behavior [12]. 

A concept for a molten salt liquid film receiver was 
worked out by researchers who developed a beam- 
down receiver design. The receiver is designed cone 
shaped and is located near the ground. The incident 
concentrated solar irradiation from the heliostat field is 
reflected by a central reflector and directed onto the 
receiver. The design includes a hot and a cold heat 
storage system. When the plant is in operation, molten 
salt is pumped from the cold storage and fed to the 
receiver’s salt distribution system. After the molten salt 
is heated by the receiver it is passed to the hot storage 
and, when the design temperature has been reached, 
from there is passed to a steam generator (not shown in 
diagram). The molten salt transfers its heat to the steam 
cycle and is pumped to the cold storage. 
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The advantage of this receiver design is that all 
components but the central reflector are located on 
the ground, hence the pumping power requirement is 
lower as no media are pumped to high altitudes. The 
disadvantages are the optical losses associated with the 
central reflector. These are beam dilution due to 
a longer beam path and the increased number of reflec¬ 
tions [13]. 

To date, studies on the concept of falling particle or 
molten salt liquid film receivers exist. Such studies were 
conducted in Germany (German Aerospace Center - 
DLR) and in the USA (Sandia National Laboratories, 
SERI, LBL). Further prototypes for direct absorption 
central receiver have been build and tested. A solid 
particle receiver was built and tested on-sun at Sandia 
National Laboratories. It consists of a 6 m tall cavity 
through which aim wide curtain of spherical ceramic 
particles is dropped and directly heated with concen¬ 
trated solar energy [ 14] . The receiver was tested on top 
pf Sandias 61 m tall central receiver located at the 
National Solar Test Facility in Albuquerque, NM. 

Tube Receiver A tube receiver consists of a large 
number of vertically arranged pipes through which 
the HTF is pumped in upward direction as shown in 
the following illustration (Fig. 17). 

The heat transfer fluids that have been deployed in 
the past years are molten salt, H 2 0 for direct steam 
generation, and sodium. 

An example of such a tube receiver is the solar tower 
power plant Solar Two near Barstow, California that 
used molten salt as HTF. The receiver consists of a great 
number of HTF pipes. The pipes are fixed vertically 
whereby the salt smelt is pumped in upward direction. 
On leaving the receiver, the molten salt is pumped into 
a hot-salt storage tank where it has a temperature of 
approximately 565°C. 

A disadvantage for using the pipe receiver is the 
high reflection loss which is higher than for other 
receiver types, especially the receiver with porous 
absorbers. 

Cavity Receiver A cavity receiver consists of a cavity 
with a small opening (inlet aperture). The concentrated 
solar irradiation is directed into the small opening 
where it impinges on tubes carrying the working 
fluid. Cavity receivers can have the same working fluids 
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Figure 17 

Tube receiver concept and HTF flow direction (Adapted 
from [15]) 

as external tube receivers, but also work with a gas. The 
idea behind the cavity receiver is to minimize the radi¬ 
ation losses. From the radiation entering the inlet aper¬ 
ture, only small amounts are reflected back into the 
atmosphere through the inlet aperture (Fig. 18). 

In the 20 MW e industrial Gas Cooled Solar Tower 
project GAST, which the German Aerospace Center 
(DLR) participated in, air was used as heat transfer 
fluid (HTF). The dual cavity receiver atop the 80 m 
high CESA-1 tower at the Plataforma Solar de Almeria 
(PSA), Spain, had a design thermal power of 28 MW t h 
each, which was provided to the subsequent Brayton 
cycle [15]. 

Another cavity receiver concept is implemented in 
the PS 10 solar tower plant in Spain. The tube receiver is 
basically a forced circulation radiant boiler with low 
ratio of steam at the panels output, in order to ensure 
wet inner walls in the tubes. Special steel alloys have 
been used for its construction so that it can operate 
under significant heat fluxes and possible high temper¬ 
atures [16]. 
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Piping Hot fluid outlet 



Concentrating Receiver Systems (Solar Power Tower). 
Figure 18 

Basic cavity receiver concept (Adapted from [15]) 

Air Receiver There are two types of air receivers, the 
open volumetric and pressurized volumetric receiver. 

Open Volumetric Receiver The class of volumetric 
receivers may be summarized according to two 
criteria [17]. Firstly, with regard to materials used 
there exist metal or ceramic types. Secondly, with 
regard to geometry design there exist a great variety of 
designs which seek to achieve an acceptable 
compromise between good optical properties and 
high heat transfer properties. Volumetric receivers of 
different designs have been developed in the last three 
decades and nowadays several design concepts of 
absorbers have been tested and will be developed 
further in future. The current available types are 
grouped into three categories: Absorbers with channel 
geometry, ceramic or metal wire-mesh absorbers, and 
ceramic foam absorbers [18]. 

For all these absorber types there are two character¬ 
istic dimensions that play an important role in their 
design [18]: 

Open porosity : The open porosity describes how 
much open cross-sectional area the porous structure 
has for the inflow air. The higher the porosity, the 
thinner the walls of the channels, and hence the more 


prone to breaking of the structure becomes due to 
thermal stresses. 

Volume-related heat transfer cross-sectional area : 
This characteristic dimension is calculated via the 
porosity and the distance between the inner walls of 
a channel. Generally, it is desired that the volume- 
related heat transfer cross-sectional area is as large as 
possible such that more irradiation can be absorbed. 
However, it has to be taken into account that the 
thickness of the channel walls cannot be designed too 
thin or the structure would becomes prone to breaking, 
hence there is a limit to increasing the volume-related 
heat transfer cross-sectional area. 

In this receiver type, the absorption of concentrated 
solar radiation takes place in a spatial volume and not, 
as usual, on a surface. Because of the enlargement of the 
heat transfer surface, gases like air which are easy to 
handle can be used as HTF [19]. 

Three main conditions must be posed to the devel¬ 
opment of the volumetric receiver: 

• Receiver material must consist of small substruc¬ 
tures to ensure an acceptable heat transfer 

• Receiver material must stand high temperatures 
and temperature gradients 

• Receiver geometry must support a temperature dis¬ 
tribution within the receiver that increases toward 
the inner region 

The main advantage of volumetric structures on the 
one hand is that the absorption does no longer take 
place on just one level, but within a volume, which 
results in a lower radiation ratio per aperture unit. 
On the other hand, spatial structures alter the develop¬ 
ment of the temperature profile within the receiver, 
leading to a more favorable efficiency [20]. Because of 
the spatial absorption, the temperature maximum is 
shifted from the aperture plane into the receiver core, 
causing a reduction of the emissive losses, which are the 
major source of loss in any receiver. 

The open volumetric receiver technology is best 
described taking the Solar Tower Jiilich (STJ), 
Germany, as an example as it is the first tower using 
this technology: 

The Solar Tower Julich (STJ) is a research and 
demonstration plant for central receiver systems and 
has been built to demonstrate the high temperature 
air receiver technology as a complete system [21]. 
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The receiver of the Solar Tower Julich has been fitted 
with open volumetric absorber type, which is an 
absorber with channel geometry, because it best meets 
the high demand in the application of a solar tower 
power plant that is, withstand high temperatures and 
offering high stability. This technology works with 
a porous absorber material that absorbs concentrated 
radiation inside the volume of the structure and trans¬ 
fers the absorbed heat to a fluid passing through the 
structure. This allows the use of air as heat transfer 
medium despite its low heat transfer coefficient. This 
volumetric receiver technology has been developed 
since the early 1990s in various R8cD projects 
with major contribution of the German Aerospace 
Center (DLR). 

Pressured Volumetric Receiver The pressurized 
volumetric receiver was developed within the 
project REFOS and offers new opportunities for solar 
towers. 

The closed volumetric receiver has a quartz window 
in front of the absorber modules separating them from 
the ambience. An air compressor sucks in ambient air 
and builds up the air pressure to 15 bar, the pressure at 
which the air enters the receiver (see Fig. 19). 

The high operating pressure enables a hot air tem¬ 
perature of approximately 800° C. 

The advantage of the closed to the open volumetric 
receiver lies in the higher efficiency that can be reached 
as combined gas and steam turbine processes can reach 
efficiency beyond 50% compared to only 35% of that of 


a conventional steam turbine process. This type solar 
tower with closed volumetric receiver therefore makes 
possible solar system efficiencies of over 20% [23]. 

Tower 

The choice of tower constructions depends primarily 
on the required height of the tower. Towers are mainly 
constructed of steel or reinforced concrete. Steel towers 
are similar to guy-wire supported television transmis¬ 
sion towers or free-standing microwave relay towers. 
Several central receiver design studies have considered 
guyed towers, but the presence of guy wires and 
their attachments to the tower in concentrated 
solar flux proved unworkable [24]. Concrete towers 
are similar to tall chimneys at conventional fossil 
power plants. 

Factory-made monopoles commonly used for wind 
turbines may also be a possible solution for tower 
constructions. The shape of a tower differs from cylin¬ 
drical to quadratic. 

Providing Firm and Dispatchable Power 

CRS main application is the production of electricity. 
For this reason the heat produced at the solar receiver is 
fed to a heat recovery steam generator. The steam is 
then expanded in a steam turbine producing electricity. 
Three main systems for providing firm and 
dispatchable power with a CSR exist nowadays 
depending on the heat transfer fluid that is used. 
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Concept REFOS [22] 
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In the first system, molten salt is pumped and 
passed through the receiver till the receiver tempera¬ 
ture reaches the operating temperatures, being ready 
for normal operation (see Fig. 20). Once the receiver 
has reached a sufficient temperature and the circuit has 
been adequately heated, the receiver bypass is closed, 
the receiver is loaded, and the salt outlet tank is filled at 
565°C [25]. 

While the salts are circulating through the receiver, 
the heliostat field focuses on the receiver to transmit 
the thermal power to the salt. From the receiver’s out¬ 
put tank, the hot salt descends to the hot salt storage 
tank, where the molten salts are pumped with the 
required flow from the hot salt tank to the steam 
generator system (SGS). 

Molten salt is passed through a steam generator 
where the heat is transferred onto the water-steam 
circuit. When this salt flow has been cooled at the outlet 
from the SGS, it returns to the cold salt storage tank. 
When leaving the steam generator the molten salt still 
has a temperature of at least 290° C and is pumped into 
a cold-salt storage tank. 

Such a system is implemented additionally to the 
Solar Two plant in the Gemasolar central receiver 
power plant in Spain. 

The second system uses water/steam as a HTF in the 
PS 10 receiver system (Fig. 21). 

The solar receiver on top of the tower produces 
saturated steam and circulates it to a conventional 
steam turbine. In the turbine, it expands to produce 
mechanical work and electricity. 


For cloudy periods, the plant has a saturated water 
thermal storage system. During full load operation of 
the plant, part of the steam produced by the receiver is 
used to load the thermal storage system. When energy is 
needed to cover a transient period, energy from satu¬ 
rated water is recovered at 20 bar to run the turbine at a 
50% partial load [16]. Such a system is implemented in 
the PS 10 and PS20 central receiver power plants in 
Spain and in the Sierra SunTower in the USA. 

The third system uses air as a HTF. Figure 22 shows 
the schematic diagram of such a solar tower system. 
The heat is transferred to air, which is sucked through 
the receiver structure. The hot air is transported to the 
storage system or the heat recovery steam generator 
(HRSG). 

In the conventional cycle, steam is produced in the 
boiler and thermal energy is transported to a steam 
turbine. The steam expands in the turbine producing 
mechanical work which is then converted into electrical 
energy by a generator. The exhaust steam from the 
turbine is then condensed in a condenser, and the 
condensate thereafter is pumped to the boiler where it 
receives heat from the solar receiver again, and the cycle 
is repeated. 

Part of the received energy is used for direct electric 
generation with the HRSG and the rest energy is stored 
at the storage system to generate electricity when there 
is no solar radiation during day or at night. The ther¬ 
mal storage unit is integrated into the air cycle, through 
which the operation of the power plant can be held for 
a certain time at constant power, depending on the 
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Schematic diagram of a solar tower power plant (Solar Two) using salt as a HTF [3] 
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Schematic diagram of the PS10 solar tower using water/steam as HTF (Source: Abengoa Solar) 
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Schematic diagram of the test and demonstration power plant Julich using air as a HTF (Source: SIJ) 


storage dimensions. In principle, this thermal storage 
can be designed with an unlimited capacity, securing 
a continuous power plant operation. It is used as 
a buffer that stores energy in times of high irradiation 
and enables operation of the plant after sunset or dur¬ 
ing periods of reduced solar input. 

This third system finds application in the Solar 
Tower Julich located in Germany. 

Characteristics of Different Heat Transfer Fluids 

A heat-transfer fluid medium (HTF) in a central 
receiver absorbs the highly concentrated radiation 
reflected by the heliostats and converts it into thermal 
energy to be used for the subsequent generation of 
superheated steam for turbine operation. Different 
heat transfer can be used. To date, the heat transfer 


media demonstrated include water/steam, molten 
salts, liquid sodium, and air. While the US industry 
uses molten salt as heat transfer and storage device the 
European industrial sector provides air or water/steam, 
preferably as a heat carrier, and uses fixed-beds for 
storage of thermal energy. If pressurized gas or air is 
used at very high temperatures of about 1,000°C or 
more as the heat transfer medium, it can even be used 
to directly replace natural gas in a gas turbine, thus 
making use of the excellent cycle efficiency (60% and 
more) of modern gas and steam combined cycles [26]. 

Salt 

In general, salt can be stored easily because of its prom¬ 
ising heat transport capability. For this purpose it can 
be stored in a hot tank and recirculated to a cold tank. 




































Concentrating Receiver Systems (Solar Power Tower) 


C 


2369 


Molten salt is mainly based on sodium and potassium 
nitrates. A further advantage of molten salt as a HTF 
includes a lower operating pressure and better heat 
transfer than a water/steam receiver. Better heat trans¬ 
fer results in a higher allowable incident solar flux at the 
receiver. That means that a smaller receiver can be 
implemented in the solar tower system. 

On the other hand, the working fluid molten salt 
requires special handling because of its low viscosity 
(similar to water) and due to the problem of solidifi¬ 
cation at temperatures of approximately 220° C. The 
molten salt is kept at a minimum temperature of 
approximately 290°C. Due to the low viscosity, storage 
and transport of the molten salt is difficult [27]. As the 
salt must always be kept in a molten state, special 
heating is required whenever the plant is not in oper¬ 
ation. As a result, additional heating requirements con¬ 
tribute to the operating costs. 

An electric heating system is also necessary in order 
to avoid corrosion. Additionally, its accident sensitivity 
is a further disadvantage. 

Water/Steam 

When using water/steam as a HTF no heat exchanger is 
needed because the steam production results directly in 
the central solar receiver at the top of the tower. Then 
a direct link-up to the steam turbine is achieved. 

Fluctuating solar radiation leads to rapidly fluctu¬ 
ating parameters and phase change and therefore the 
operation control turns out to be complicated. Further, 
the heat transport is not optimal and the thermal 
storage becomes difficult. 

Air 

Air as a HTF is freely available, easy to handle, nontoxic 
and ambient friendly. No hazardous substances are 
used. The air receiver offers the highest level of safety 
and protection from view of environmental impact and 
less costs. 

In contrast to molten salt, air does not require 
freeze protection during times of nonoperation and 
allows working with very high process temperatures 
and no limitation of temperature. This enables the 
operation with high steam parameters and therefore 
high efficiencies in combination with modern conven¬ 
tional steam power technology. 


On the other hand, air does not provide good heat 
transfer, has a lower heat capacity, and the storage 
capability is not optimal. 

Solar central receiver power plants based on air¬ 
cooled receivers are a promising technology offering 
both operational advantages due to the simplicity of 
the hot air loop and the potential for high conversion 
efficiencies. 

Others 

Different gases too, like C0 2 , helium, organic fluids, 
and sodium may be used as HTF. 

The use of sodium was realized only once. Only 
once sodium was used as a HTF in the Small Solar 
Power System (SSPS) plant in Spain. The SSPS plant 
was the first central receiver plant in Europe and was 
built on the Plataforma Solar de Almeria (PSA) in 
Almeria, southern Spain, by the International Energy 
Agency (IEA). This solar tower producing 0.5 MW 
started operation in 1981. This CRS was destroyed in 
the year 1986 by a sodium fire. Due to this experience, 
sodium as a HTF was never used again in solar tower 
power plants. 

Increase of Operation Hours of CRS by Storage 
and Hybridization 

A solar thermal power station consists of a conven¬ 
tional block-unit power station and a solar component 
which replaces the combustion chamber of 
a conventional power station. Such power stations 
reach annual nominal loads of up to 3,000 h in loca¬ 
tions of high irradiation (e.g., North Africa). 

In order to increase the operation hours of a central 
receiver plant, storage or hybridization may be 
implemented. By both systems, an increase of the 
capacity factor of the conventional power block of the 
solar tower power plant can be achieved. 

Through the integration of solar thermal storage or 
supplemental fossil or biomass firing, solar tower 
power plants produce dispatchable electricity to 
match peak demands at any time. 

Storage 

Storage represents one approach for achieving a high 
capacity factor for a central receiver system. Finding 
efficient ways of storing energy is one option to deal 
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with the lack of year-round sunshine. By using thermal 
storages nominal loads of 4,000 h can be reached. For 
solar technologies it is necessary to include an energy 
storage system due to the variation of the solar irradi¬ 
ation onto the earth’s surface with time (e.g., night and 
day, seasonal changes, diurnal variation, and weather). 
Using storage units increases secure and permanent 
energy supply, and as a result the solar fraction is 
enlarged when the energy is provided by solar 
irradiation. 

The current heat storage types for CRS may be 
sensible heat storage system, latent-heat storage system, 
steam accumulator, and thermochemical storage 
system. 

The term “sensible heat” describes the heat which is 
absorbed or released by a material as a result of 
a change in temperature, whereupon the material 
does not undergo a change of aggregate state. Two 
sensible heat storage systems can be distinguished: 
indirect storage and direct storage [28]. 

An example of indirect storage is a regenerator. 
Regenerators are the “natural” technology choice for 
heat storage in air-cooled central solar power plants. 
The technology offers considerable design freedom that 
needs to be exploited to come to well-adapted solutions 
for market-scale implementations. 

Heat storage regenerators represent a self-evident 
choice for the combination with air-cooled solar cen¬ 
tral receiver systems: The solar-heated air-stream 
passes along a solid storage medium taking up the 
heat to be stored. Reversing the flow direction allows 
to supply the steam generator at times of insufficient 
solar radiation. In principle, this technology is appli¬ 
cable to both atmospheric and pressurized systems 
[29]. The STJ solar tower in Jtilich, Germany, uses 
a regenerator as a storage system. 

In direct storage systems, the HFT which is heated 
by a receiver is used directly as a storage medium. The 
solar tower power plant Solar Two, for example, uses 
a 2-tank direct storage system consisting of a hot-salt 
and a cold-salt storage tank. 

The storage fluid consists of a eutectic salt mix of 
sodium nitrate (NaN0 3 ) and potassium nitrate 
(KN0 3 ) in the proportion 60% NaN0 3 + 40% KN0 3 , 
with a total weight of 1,500 t. Eutectic means that 
mixed in this specific ratio, the salt mixture has the 
lowest possible melting temperature that can be 


achieved, which in this case is 221°C. It is important 
to keep the salt temperature above the solidification 
temperature permanently, even if the plant is not in 
operation [28]. 

Latent-heat storage systems utilize the material 
property of absorbing or releasing heat energy during 
a phase change. In principle, there are three possible 
phase changes: solid-solid, solid-liquid, and liquid- 
vapor. 

Only during the phase change from solid to liquid, 
storage capacity and storage volume are in a realistic 
ratio to one another. For latent-heat storage systems 
used in medium and high temperature applications, 
materials are chosen whose melting temperatures lie 
within the temperature range of the process. Based on 
the mass or the volume of the storage medium, it is 
possible to store considerably more energy with such 
a phase change material than, for example, with con¬ 
crete. An advantage of latent-heat is that the tempera¬ 
ture stays constant during melting and solidification, 
which is important for stable power plant operation. 
Latent-heat storage systems can have a compact design 
and are, for example, under development for dish 
power stations. Moreover, latent-heat storage systems 
are also under development for parabolic trough and 
solar tower power plants [28]. 

Steam accumulators are pressure vessels in which 
a charging system feeds steam in the hot water (well 
distributed). Through the input of heat of condensa¬ 
tion, the temperature of the hot water in the storage 
vessel rises. Steam accumulators utilize the ecologically 
most worthwhile and cheapest HTF available. 
The necessary pressure vessels, however, dominate the 
investment costs. The PS 10 solar tower in Seville, 
Spain, uses a steam accumulator as a storage system. 

In thermochemical energy storage systems, revers¬ 
ible reactions are used to store energy, which are endo¬ 
thermic when charging and exothermic when 
discharging. The useful specific reaction enthalpy of 
such reactions is merely one order of magnitude 
smaller than that in the combustion of fossil fuels and 
hence considerably larger than that in the storage of 
sensible or latent heat. Moreover, after separating the 
reaction products the energy can practically be stored 
loss-free over a longer period of time [28]. 

In regions with very high radiation, solar thermal 
power plants with heat storage facilities can reach 
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a maximum of 3,000-4,000 nominal load hours per 
year. Modern storage technology using molten salt 
enables solar tower power plants to bridge more than 
6 h of nonsolar periods depending on the chosen stor¬ 
age capacity. 

Hybridization 

One major option for accelerating the market intro¬ 
duction of solar thermal power technology is the con¬ 
cept of solar-fossil hybrid power plants. Their 
advantage compared to solar-only systems lies in low 
additional investment costs due to an adaptable solar 
share, high operation time, dispatchable power pro¬ 
duction, and reduced technical and economical risks 
[30] . Dispatchability implies that power production can 
be shifted to the period when it is needed. At times of 
low or no solar irradiation, a hybrid operation of the 
power station would decrease the specific electricity 
generation costs and increase the plant availability for 
the potential customer. Hybridization of a power tower 
with a fossil fuel holds further several additional dis¬ 
tinct advantages, including power ride-through for 
solar intermittency, increased capacity factor, and 
potentially enhanced acceptance among utilities [31]. 

In order to demonstrate the hybridization of a solar 
tower system, an open central receiver system with air 
as HTF is considered. 

For the hybridization, one main option is to place 
a gas turbine or a burner in the solar cycle before the 
heat recovery steam generator (HRSG) [32]. As has 


been pointed out, in [30] burning fuel might be natural 
gas or biogas and after the HRSG the exhaust gas is 
recirculated to the receiver or can be passed to a stack. 
In order to achieve round-the-clock, 100% renewable 
energy CRS can be hybridized with biomass energy. 

The idea of hybridization is to combine fuel such as 
biogas or gas with low-carbon solar power during day¬ 
light to generate uninterrupted electricity. The power 
plant can then shift seamlessly between using the sun as 
fuel and a conventional or another renewable fuel [33]. 

Figure 23 shows the realization for such a hybrid 
system with a gas turbine. 

At days of low solar radiation, the gas turbine or the 
burner adds partly or entirely heat to achieve nominal 
load of the boiler [34]. 

Through hybridization, for example, by combustion 
of biofuels, the production of electricity can be increased 
up to 8,600 h [33] . It is expected that such hybrid power 
plants will have a high potential for the market introduc¬ 
tion in the next decade. Due to the high electricity 
demand, especially in countries of the sunbelt, it is 
expected that the hybrid system has a considerable market 
potential and will facilitate the market penetration. 

Another hybridization option is to combine solar 
energy with a pressurized solar receiver with a gas tur¬ 
bine at high temperatures. As outlined in [35] using 
solar energy into the gas turbine of combined cycle 
systems (CCS) is a very promising way as it is used to 
preheat the compressor discharge air before it enters 
the combustor or the gas turbine. Solar air preheating 
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Schematic diagram of the Julich demonstration plant with a gas turbine (Source: SIJ) 
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offers superior performance, as the solar energy 
absorbed in the heated air is directly converted with 
the high efficiency of the CCS [36] . It can be realized by 
molten salt receivers up to 560°C [37] or with pressur¬ 
ized volumetric receivers [38]. Figure 24 shows the 
concept of a solar-hybrid gas turbine system, which 
was investigated by the DLR. 

Concentrated solar radiation enters the receiver 
which is heated up. Air from the gas turbine is passing 
through the closed pressurized receiver and is heated 
up by forces convection [39]. Then the hot air is passed 
through a gas turbine which expands it. The exhaust gas 
is then passed through a steam generator and afterward 
leaves the system through a chimney. In the steam gen¬ 
erator, steam that drives a steam-cycle process is 
produced. 

Different projects run in this research field. EU, for 
example, funded in the last decade research on hybrid 
solar technology in the modifying of the combustion 
chamber of a gas turbine to allow dual use of solar heat 
and fossil fuel (Solgate) and in the developing of an 
efficient solar-hybrid micro turbine (Solhyco). 

The project objective of Solgate was the develop¬ 
ment of a solar-hybrid power system with direct solar 
heating of a gas turbine’s pressurized air. A pressurized 
receiver module for 1,000°C was developed and built. 
The high temperatures were achieved using ceramic 
absorbers in combination with active cooling measures 
on the receiver window. To reduce receiver cost, a low 
temperature tubular receiver module was designed and 
built, with emphasis on low-pressure drop. The test 
setup consisted of three modules and a gas turbine 
system with modified combustion system, air ducts 
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Scheme of solar-hybrid gas turbine system (Source: DLR) 


and control system coupled to a generator. The com¬ 
ponents were integrated to a complete solar-hybrid 
power system and installed in the PSA solar tower 
facility in Almeria, Spain. Within the Solgate project, 
for the first time a gas turbine was operated with solar 
energy from a volumetric pressurized receiver [40]. 

According to the approach taken by Solgate, the gas 
turbine receives hot air from the outlet of the high 
temperature receiver module. This hot air enters the 
modified gas turbine combustor through a cooling 
bypass system [16]. The development of a gas turbine 
able to operate at higher temperatures allows the use of 
this solar technology without changes. 

The main aim of the Solhyco project was to develop 
a highly efficient solar-hybrid micro-turbine (SHM) 
system for power and heat generation with dual solar 
power and fuel input. A reliable and cost-effective 
receiver system is being designed to reduce the gener¬ 
ation cost. And a new combustion system was devel¬ 
oped to allow the use of biofuels. 

In the last years another interesting project (Solugas) 
aims in the further development of solar-hybrid gas 
turbine. For this demonstration project, a solar tower 
was planed to be designed and implemented to house 
the receiver, hot gas piping and gas turbine package 
[41]. The hot, pressurized air from the solar receiver is 
directly fed into the combustion chamber of a gas tur¬ 
bine where natural gas is added to further heat the air to 
the turbine firing temperature design point. 

Examples of CRS Plants 

Typical optical concentration factor ranges in CRS plants 
from 200 to 1,000 and plant sizes of 10-200 MW are 
chosen because of economy-of-scale constraints, even 
though advanced integration schemes are claiming eco¬ 
nomic sense for smaller units as well [42]. 

Research Facilities 

Solar One The Solar One plant near Barstow, Cali¬ 
fornia was a 10 MW e water-steam solar power tower 
facility. It operated from 1982 to 1988 and ultimately 
achieved 96% availability during hours of sunshine 
[43]. During its power production years Solar One 
delivered over 37,000 MWh to the utility grid [44]. 

Figure 25 shows an aerial view of the Solar One 
solar tower power plant at operation. 
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Solar One power plant aerial (Source: Sandia) 


The heliostat field included 11,318 heliostats posi¬ 
tioned 360° around the tower. The total reflective area 
of each heliostat was 40 m 2 . 

Solar One used a water/steam solar receiver coupled 
directly to the turbine-generator. The receiver system 
used the reflected sunlight to heat water directly, creat¬ 
ing superheated steam. 

A thermocline oil/rock system provided thermal 
storage [45]. This storage had the disadvantage of hav¬ 
ing high thermodynamic losses. Additionally, as men¬ 
tioned in [45], the coupling of the solar receiver with 
the turbine caused the turbine to drop off-line each 
time a cloud came by. These two disadvantages initia¬ 
tive the US Department of Energy to start supporting 
a new project: Solar Two. 

Solar Two Solar Two (Fig. 26) employed a molten- 
salt heat transfer system. Solar Two operated from 1996 
to 1999 and helped validate nitrate salt technology, 
reduce the technical and economic risks of power 
towers, and stimulate the commercialization of solar 
power tower technology [43]. 

During operation, cold molten salt is pumped from 
a cold salt tank through a receiver. With the high solar 


concentration reaching the receiver the salt is heated up 
to 565°C. Then it is stored in a hot salt tank and used to 
produce steam to power a steam turbine. 

The molten salt is kept to a minimal temperature of 
290° C at all times to avoid solidification. The figure 
below shows the power plant of Solar Two. 

The plant demonstrated an energy storage design 
with a two-tank molten-salt system. With the aid of 
heat storage tanks, the capacity factor of power plants 
can be raised from near 25% without storage operation 
to potentially 65% of the year operation in solar- 
storage operation [46]. 

The Solar Two power plant can generate up to 
10 MW e . Its storage tank has a capacity of 3 h and the 
capacity of the storage system is approximately 
115 MWh th . The efficiency of charging, storing, and 
discharging of this storage system is approximately 97%. 

The heliostat field used the Solar One heliostats of 
72,500 m 2 as well as 108 new heliostats of 95 m 2 each. 
The additional heliostats provided a flux distribution 
representative of a commercial receiver, eliminated the 
excessive morning start times common to Solar One, 
and provided additional energy for charging the ther¬ 
mal storage system. 
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Figure 26 

Solar Two receiver and tower, Dagett, CA (Source: Sandia) 


National Solar Thermal Test Facility The National 
Solar Thermal Test Facility at Sandia National Labora¬ 
tories, is located in Albuquerque, New Mexico. 

Figure 27 shows an aerial view of Sandia National 
Laboratories 5-MW t Central Receiver Test Facility, 
including 222 heliostats. The facility is used for testing 
of advanced Concentrating Solar Power systems for 
a variety of applications. 

CESA-1 CESA 1 is a test plant for solar power plants 
components as well as a flexible testing facility for large 
pieces of advances materials, as demonstrated since 
1988 in various test campaigns. Figure 28 shows 
a CESA-1 Tower distant view. 

The heliostat field focuses on the receiver at the top 
the CESA-1 tower. 

It consists of a field of 300 heliostat mirrors, able 
to generate up to 7 MW t on the receiver with 


energy densities of up to 3.5 MW/m 2 [47]. The main 
test piece is the receiver which is installed at any 
of the several levels of the 80-m tower prepared as test 
beds. 

Themis The 2.5 MW Themis prototype of solar 
tower and molten salt power plant technology was 
designed and erected in France in the early 1980s. 
This solar plant, as shown in Fig. 29, was successfully 
operated until 1986 [48]. 

In the last years, a completely new control system as 
well as a heliostat prototype has been implemented and 
tested on site. 

CSIRO A heliostat field of more than 800 m 2 reflector 
area has been installed in 2006 at the CSIRO solar tower 
in Newcastle, Australia (Fig. 30). 

In late 2010, a second heliostat field was commis¬ 
sioned adjacent to the existing field. It includes 450 
heliostats and has a thermal capacity of 1.2 MW t and 
will be used to demonstrate a 200 kW Brayton cycle gas 
turbine [49]. 

Solar Research Facility Unit The Solar Research 
Facility of the Weizmann Institute of Science (WIS), 
shown in Fig. 31, is located in Israel. 

Its major feature is a solar power tower containing a 
heliostat field of 64 large, multifaceted mirrors. Each 
heliostat has an aperture area of 56 m 2 . As can be seen 
in the figure, it has several levels where different 
receiver systems can be tested. 

Solar Tower Plant Jiilich (STJ) The Solar Tower 
Plant of Julich aims on the one hand in the production 
of electricity and in the other hand on research and 
development. The main principle of the STJ is analyzed 
in section “Solar Tower Plant Julich (STJ).” 

Regarding research, it is focused on two main 
branches: One is ensuring quality of existing technolo¬ 
gies by developing new test procedures and setting up 
a framework of norms. The second is the technical 
improvement of components and the development of 
new technical features. Examples of technical questions 
under research by the Solar-Institut Julich and the 
German Aerospace Center are [50]: 

• Improvements of solar-specific components, such 

as heliostat, heat storage, and absorber 
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National solar thermal test facility, Albuquerque, New Mexiko (Source: Sandia) 
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CESA-1, Plataforma Solar de Almeria, Spain (Source: Sandia) 
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The Themis solar tower power plant in the Eastern Pyrenees of southern France, showing the field sloping from right to left 
(Source: Centrale solaire Themis) 
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CSIRO test facility in Australia (Source: CSIRO) 


Optimization of operation and control. 
Adaptations of conventional components espe¬ 
cially for CRS, for example, heat recovery boilers 
specially designed for the heat input from 


solar and fossil fuels or highly efficient steam 
turbines. 

Hybridization of solar tower power plants with 
fossil fuels. 
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Concentrating Receiver Systems (Solar Power Tower). Figure 31 

Solar tower at the Solar Research Facility in Rehovot, Israel (Source: Weizmann Institute of Science (WIS)) 


Commercial 

Sierra SunTower Sierra SunTower, which is shown in 
Fig. 32, supplies 5 MW of clean, renewable energy to the 
grid. The full-scale power plant, the only commercial 
CSP tower in the USA, supplies electricity to Southern 
California Edison (SCE) and power up to 4,000 homes. 
It is operational since 2009 [51]. 

A field of 24,000 mirrors reflects solar heat to 
a thermal receiver mounted atop a central power tower. 
Each small heliostat has an aperture area of about 
1.14 m 2 . The Sierra plant has two different types of 
direct-steam, natural-circulation solar receivers in 
operation - a dual-cavity receiver and an external 
receiver [52]. The focused heat boils water within 
the thermal receiver and produces steam at 440°C and 
60 bar. The plant pipes the steam from each thermal 
receiver and aggregates it at the turbine, powering a 
power generator. The steam then reverts back to water 
through cooling, and the process repeats [53]. 

The project created over 250 construction jobs and 21 
permanent jobs. The 5 MW output of Sierra SunTower 
will reduce C0 2 emissions by 7,000 t/year [54]. 

PS 10 A land with a high number of installed solar 
power plants is Spain. The PS 10 solar tower was the 
first commercial solar tower in Europe (Fig. 33) and is 


located in Sanlucar la Mayor, 15 km west of the city of 
Seville. It is promoted by Solucar Energia, S.A., an 
Abengoa Group company. The project makes use 
of glass-metal heliostats, a pressurized water thermal 
storage system, and a saturated steam receiver and 
turbine. These technologies have been developed by 
European companies, and already tested and qualified 
at the solar test facility located at the Plataforma Solar 
de Almeria. 

The PS 10 plant has a solar field composed of 
624,120 m 2 heliostats with a mobile curved reflective 
surface that concentrate solar radiation on a receiver at 
the top of a 100 m tower [55]. The receiver, which 
produces 40 bar 250°C saturated steam from thermal 
energy supplied by the concentrated solar radiation 
flux, has a cavity design to reduce radiation and con¬ 
vection losses. 

It is formed by four vertical panels of 5.40 m wide 
by 12 m high, each one making up an overall heat 
exchange surface of about 260 m 2 . These panels are 
arranged into a semi-cylinder of 7 m radius. During 
operation at full load, the receiver receives a thermal 
power of about 55 MW of concentrated solar radiation 
with peaks of 650 kW/m 2 [16]. 

The steam is sent to the turbine, where it expands, 
producing mechanical work and electricity. The turbo 
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Concentrating Receiver Systems (Solar Power Tower). Figure 32 

eSolar tower power plant (Source: eSolar) 



Concentrating Receiver Systems (Solar Power Tower). Figure 33 

Bird's eye view of the PS10 (Source: NREL) 


generator output goes to a water-cooled 0.06-bar pres¬ 
surized condenser. The condenser output is preheated by 
0.8 and 16 bar turbine extractions. The output of first 
preheater is sent to a deaerator fed with steam from 
another turbine extraction. A third and last preheater is 


fed with steam coming from the receiver. This preheater 
increases the water temperature to 245° C. This flow is 
mixed with the flow of water returning from the drum, 
raising the temperature of water fed to the receiver to 
247°C [53]. 
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For cloud transients, the plant has a 20-MWh ther¬ 
mal capacity saturated water thermal storage in form of 
four storage tanks. During full-load plant operation, 
part of the steam produced by the receiver is employed 
to load the thermal storage system. When energy is 
needed to cover a transient period, the energy is recov¬ 
ered from the saturated water at 20 bar to run the 
turbine at 50% load. 

PS20 PS20 solar power tower plant is also located at 
the Solucar Platform, near Seville (Spain) near to the 
PS10 plant (Fig. 34). 

With a power capacity of 20 MW, the solar power 
plant will produce enough clean energy to supply 
10,000 homes, and will avoid the emission of approx¬ 
imately 12,000 t of C0 2 into the atmosphere. 

The plant with a Heliostat Solar-Field Aperture 
Area of 150,000 m 2 has the capacity to generate more 
than 40 GWh of energy each year. 

PS20 consists of a solar field made up of 1,255 
heliostats designed by Abengoa Solar [56]. Each helio¬ 
stat, with a surface area of 120 m 2 , reflects the solar 
radiation onto the receiver, located on the top of a 
165 m high tower, producing steam which is converted 
into electricity generation by a steam turbine [53]. 


The storage capacity is 1 h and a wet cooling is used. 
These enhancements include a higher-efficiency 
receiver, various improvements in the control and 
operational systems, and a better thermal energy stor¬ 
age system compared to PS 10. 

Gemasolar Gemasolar, as shown in Fig. 35, is the 
first commercial-scale plant in the world to apply cen¬ 
tral tower receiver and molten salt heat storage 
technology. 

The tower power plant is located in Fuentes de 
Andalucia, Seville, Spain and the expected net electrical 
annual production is 110 GWh. The 17 MW e plant will 
supply power to 25,000 homes and reduce atmospheric 
C0 2 emissions by more than 30,000 t a year. 

The surrounding heliostat field concentrates the 
solar radiation at the receiver placed at the top of a 
tower located in the center of the solar field. As men¬ 
tioned in [57] each heliostat with a total reflective sur¬ 
face of 115.7 m 2 has a galvanized steel structure with 35 
mirror elements. The heliostat field consists of 2,650 
heliostats and takes an area of 185 ha [58]. The tower 
has a height of 140 m and the total mirror surface is 
about 300,000 m 2 . The estimated annual operation 
hours will be 6,500 h with an amount of 15% 
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Aerial view of PS10 (at the bock) and PS20 (at the front) solar tower power plants in Sevilla, Spain (Source: Abengoa Solar) 
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Figure 35 

Gemasolar, tower view. Stand 2010 (Source: Gemasolar) 


hybridized with burning of natural gas. The reached 
capacity factor may exceed 70%. 

Through this receiver circulates a mixture of molten 
salts, which is heated by this radiation input and stored 
inside a tempered tank [57]. From there, hot salts are 
pumped to the steam generator system, where super¬ 
heated steam is produced to drive a conventional 
turbine-generator set, producing electricity that is fed 
to the net. 

The molten salt storage tank permits independent 
electrical generation for up to 15 h without any solar 
feed. The turbine can be operated at normal capacity 
when there is loss of or a reduction in solar radiation 
due to clouds. 

The salts are impelled from a “cold tank” to 
a receiver located at the top of a tower where they are 


heated to 565°C. The hot salts then descend to the heat 
exchanger and generate water steam [52]. 

Solar Tower Plant Jiilich (STJ) Germany’s first solar 
tower power plant for experimental and demonstration 
purposes has been constructed and is now in operation 
in the town of Juelich. The central receiver plant, with 
an open volumetric receiver, supplies the grid with 
a nominal power of 1.5 MW e . It is operational since 
December 2008 and started the production of electric¬ 
ity in spring of 2009. 

The construction of the facility was funded by the 
BMU and the Ministry of Economics of the federal 
states of NRW and Bavaria. Kraftanlagen Miinchen 
built the facility as prime contractor. The subsequent 
operation was carried out by the local utility 
Stadtwerke Jiilich. Scientific project partners are the 
Solar-Institut Jiilich (SIJ) and the German Aerospace 
Centre (DLR). Since Summer 2011 the STJ belongs to 
the DLR. 

The objective of the solar power tower project in 
Jiilich is to demonstrate the entire system. Figure 36 
shows a bird’s eye view of the STJ. The concentrator 
system consists of about 2,150 sun-tracking heliostats 
of about 8 m 2 reflective surface each. They follow the 
path of the sun and concentrate the solar radiation on 
a receiver that is installed at the top of a 60 m tower. 
The absorber is made of porous ceramic elements 
through which incoming ambient air flows. 

The applied open volumetric receiver technology is 
used to heat up ambient air to high temperatures of 
around 700° C and this heat is delivered for steam 
generation in a boiler of a conventional steam turbine 
cycle. The steam generated there drives a steam turbine, 
which produces power via a generator. 

The porous absorber located at the top of the tower 
traps the highly concentrated solar radiation inside the 
structure allowing the heat to be transferred to air very 
effectively. The receiver is made up from identical sub- 
receivers, in which the absorber modules can easily be 
exchanged in case of failure or to test innovative mod¬ 
ules [12]. 

The advantages of this technology are simplicity 
and scalability, the ability to include a thermal storage, 
the low thermal capacity (quick start-up), and 
a high efficiency potential due to high achievable 
temperatures. 
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Concentrating Receiver Systems (Solar Power Tower). Figure 36 

Bird's eye view of the STJ (Source: SIJ) 


Under Construction or Under Development 

A lot of solar tower power plants are under construc¬ 
tion or under development in the world, mainly in the 
USA. 

Ivanpah Solar Electric Generating Station (ISEGS) 
will be located in Primm, NV, California. The steam 
turbine will have a capacity of 370.0 MW. The tower 
will have a height of 140 m and the heliostat field 
will consist of 214,000 heliostats, each with a mirror 
area of 14.08 m 2 . As HTF water/steam will be used 
and a solar receiver steam generator will be 
implemented [56]. 

Crescent Dunes Solar Energy Project (Tonopah) 
will be installed in Tonopah, Nevada and start opera¬ 
tion in 2013. The receiver will be cylindrical and molten 
salt will be heated up by concentrated solar radiation. 
The steam turbine has a nominal power of 110 MW. 

As mentioned in [56] Gaskell Sun Tower (Gaskell), 
a 245 MW solar tower plant will be located in Lancaster, 
California and will start production in 2012. 

Rice Solar Energy Project (RSEP) is another CRS 
project in the USA. It will be installed in the Mojave 
Desert and shall use molten salt as a HTF. The total 
power capacity is planned to be 150 MW and it will 


have a total heliostat area of more than 1 million m 2 . 
The system will include a storage system and will start 
producing power by the year 2013. 

Additionally, two solar tower plants of 400 MW are 
under development in Nevada and three plants of 
600 MW in California. Further, some solar tower 
power plants are announced in Spain and in the rest of 
the world. 

Secondary Uses of CRS 

There exist many ways in which solar concentrated 
sunlight from a central receiver system may be used 
for the delivering of clean sustainable energy services. 

To these services belong solar chemical technolo¬ 
gies, desalination, and absorption cooling. 

Solar chemical technologies include the solar ther¬ 
mal production of fuels and the use of the solar thermal 
heat for chemical processes. 

Chemical processes include the melting of metals 
and the processing of cement and the recycling of 
hazardous solid waste materials. Recycling of hazard¬ 
ous wastes enables the reduction of exhaust gas streams 
and avoids additional generation of greenhouse gases 
and other pollutants. 
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Solar energy can be converted into storable and trans¬ 
portable chemical fuels such as hydrogen and methanol. 

Today’s production of hydrogen, its distribution, 
storage, and utilization are dependent upon conven¬ 
tional technologies, which are not commercially feasi¬ 
ble at this time. At the moment, more than 90% of 
hydrogen is produced using heat from fossil-fuels, 
mainly natural gas [59]. Today, the production of 
hydrogen by using of different renewable energies is 
investigated. If hydrogen is generated from concen¬ 
trated solar energy, it is a completely clean technology 
with no hazardous wastes. This is the vision outlined in 
the European Commission’s “European hydrogen and 
fuel cell roadmap,” which runs up to 2050. Solar towers 
are most appropriate for future large-scale production 
of solar fuels, because they can achieve the necessary 
high temperatures (>1,000°C) due to their high con¬ 
centration ratio. 

One solution of the direct concentrated solar pro¬ 
duction of hydrogen is accomplished at the 
“Plataforma Solar de Almeria”(PSA) in Southern 
Spain. The German Aerospace Center is also working 
on iron-based thermochemical cycle processes. Here, 
the reactive material - based on iron oxide - is rotatory 
oxidized and reduced again. Such a ceramic cylinder, 
which features many parallel channels and bears resem¬ 
blance to an auto exhaust catalyst, is heated in a solar 
receiver-reactor with concentrated solar radiation to 
about 800° C [60]. The water steam reacts with the 
coating, whereupon oxygen is bounded and hydrogen 
is released. In a second reaction step, the water vapor is 
turned off and the receiver-reactor is heated up to 
about 1,200°C. Thereby, the oxygen on the reactive 
coating escapes and the cycle begins again. 

This procedure has had, after initial pilot tests of the 
Institute of Technical Thermodynamics at the DLR in 
Germany, its successful dress rehearsal in November 
2008 on the Spanish Plataforma Solar de Almeria. 

Solar fuels such as hydrogen can be used in several 
ways; “upgrading” fossil fuels burned to generate heat, 
fed into turbines or engines to produce electricity or 
motion, or used to generate electricity in fuel cells and 
batteries. By storing energy in a fuel like hydrogen, it 
can be retrieved when needed, and is available even 
when the sun is not shining. 

One further chemical application of CRS is the 
reforming process. Research on solar steam reforming 


of natural gas has been conducted in the EU projects 
SCR, SOLASYS and the recent study SOLREF, all in 
which the German Aerospace Center participated. Fur¬ 
ther, solar concentrated radiation from a heliostat field 
may be used for the production of methanol or synthe¬ 
sis gas in solar reactors. Additionally, solid waste mate¬ 
rial may be treated by the use of concentrated sunlight. 
Waste materials containing carbonaceous compounds 
can be converted by solar concentrated energy into 
syngas and hydrocarbons that can be further processed 
into other valuable synthetic chemicals [61]. 

The solar tower technology can be used for desali¬ 
nation as well. There are different ways for the exploi¬ 
tation of the heliostat field for producing clean water 
(Fig. 37). 

Desalination systems can be powered from solar 
tower energy systems, either by producing the electric¬ 
ity for the supply of existing or planned RO systems or 
by using the produced heat for the distillation evapo¬ 
rators (MED) [63]. 

Figure 37 shows three different desalination systems 
combined with a CSP. The first desalination system, in 
combination with a solar tower (see Fig. 37, left), uses 
only the heat from the solar cycle for the supply of the 
MED system with heat and the electricity is provided by 
the grid. The MED process usually operates as a once 
through system without a large mass of brine 
recirculating around the plant. This design reduces 
both pumping requirements and scaling tendencies [64] . 

In the second system, the solar tower provides only 
the necessary electricity for the reverse osmosis (RO) 
unit (see Fig. 37, middle). The RO plant uses electricity 
generated by a solar thermal power block to work 
the pumps. Theoretically, the only energy requirement 
is to pump the feed water at a pressure above the 
osmotic pressure. In practice, higher pressures must 
be used, typically 50-80 atm., in order to have a suffi¬ 
cient amount of water pass through a unit area of 
membrane [65]. 

Finally, the last MED system (see Fig. 37, right) 
includes the conventional cycle for the supply of the 
desalination system with heat and electricity. 

These three systems may find application in islands 
and coastal areas which face water shortage problems. 
Cogeneration of electricity and fresh water would 
probably work best with higher temperature levels pro¬ 
vided by a solar tower system. 
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Concentrating Receiver Systems (Solar Power Tower). Figure 37 

Desalination systems combined with solar tower plant (Source: [62]) 


Desalination plants have to be operated continu¬ 
ously. Therefore, a solar tower power plant should be in 
operation day and night. This is only possible with an 
additional storage unit or with the hybridization of the 
plant [66]. The choice of which system is to apply for 
a particular case should depend on the requirements of 
each commodity and the characteristics of the load and 
water demand [67]. 

Well-proofed thermally driven cooling technologies 
that can use the heat provided by a solar tower for cold 
production are absorption and adsorption chillers. 
Absorption cooling uses a liquid sorption material or 
a solid material. Solar thermal energy from a CRS may 
change the refrigerant into a vapor. A solar tower may 
also be combined with an adsorption chiller in order to 
provide heat for cooling. 

An adsorption chiller uses a solid sorbent material, 
such as silica gel or zeolith, to adsorb the cooling 
medium water on its surface, which is a process that 
releases latent heat. Adsorption chillers consist of two 
chambers filled with sorbent, an evaporator, and 
a condenser (Fig. 38). 

The two chambers work simultaneously, 
accomplishing their individual tasks. In the first cham¬ 
ber, the water, bounded to the silica gel, is driven out 
under heat supply, for example, provided by the heat 
from the steam cycle of a solar tower plant. Water is 
liquefied in the condenser and the heat is removed by 
the cooling water. The condensed water is injected into 


the evaporator and evaporated. The water vapor is then 
passed to the other chamber, where the sorbent adsorbs 
it. Heat resulting from the adsorption processes in 
dissipated to the cooling water of the second chamber. 
The evaporator works at a low pressure and the refrig¬ 
erant is evaporated as it is absorbing the evaporation 
heat from the chilled water loop. This leads to the 
production of useful chilled water [28]. 

Economical Considerations 

Investment Costs 

Of the current CSP technologies, central receiver 
(power tower) systems have been recognized as having 
the highest potential for producing baseload power 
with a capacity factor above 75% at a lower levelized 
cost of energy. 

The solar field, electric power block, and receiver 
encompass in general, as depicted also by [68], approx¬ 
imately three fourth of the total direct costs. 

The heliostat field (materials plus labor) represents 
the largest single capital investment in a power tower 
plant, and thus represents the greatest potential for cost 
reductions [31]. The heliostat field of a solar tower 
power plant makes up roughly 50% of the investment 
costs. For installing the heliostat field, expected costs 
range from 180 to 250 €/m 2 for small production runs 
in the USA, and from 140 to 220 €/m 2 in Europe [69]. 
As a result, it is important to press on reducing the cost 
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Concentrating Receiver Systems (Solar Power Tower). Figure 38 

Adsorption chiller principle (Source: Solair) 


of heliostats as far as accomplishable in order to 
improve and strengthen the economic viability of the 
solar power tower technology [6]. 

Operation and maintenance costs include plant 
operation, fuel expenses in the case of hybridization 
or backup, feed and cooling water, and field mainte¬ 
nance costs. Investment and operating costs need wider 
scale proof in commercial operation. 

LEC 

The project’s analysis of different solar-hybrid gas 
turbine systems found that for a 16 MW (electrical) 
installation operating at 800°C, the investment cost 
would be 1,440 €/kW, with an electricity generation 
cost of 0.057 €/kWh at 16% annual solar share. With 
an operating temperature of 1,000°C, a solar share 
above 50% was found to be achievable, although the 
electricity generation cost would then rise to €0.086/ 
kWh [16]. 

As estimated in [70], hybrid systems offered at the 
beginning of this century estimated costs of electricity 
production from the solar portion of $0.08-$0.15/ 
kWh, whereas solar-only plants were expected to have 
costs in the range of $0.15-$0.20/kWh. 


The other technologies analyzed are currently planned 
in significantly smaller pilot scale of up to 15 MW e . The 
LEC is significantly higher for these small systems ranging 
from 19 to 28 cents/kWh. Assuming that several of the 
smaller systems are built at the same site to achieve a 
power level of 50 MW and take benefit of a similar 
O&M effort as the larger plants, LEC estimates of all of 
the systems also range between 15 and 20 cents/kWh. In 
[71] similar values were obtained. The systems achieve a 
solar capacity factor of up to 30% under these conditions. 

Estimates range from €0.135-€0.218/kWh 
depending on design and location and it is expected 
that these costs will fall relatively quickly [72]. 

To nearly similar results comes an estimation 
presented in [59] . The cost of generating electricity from 
concentrated solar power currently ranges from approxi¬ 
mately 15 €cents/kWh (USD 0.19) at high solar irradia¬ 
tion (DNI) sites up to approximately 23 €cents/kWh 
(USD 0.29) at sites with low average solar resource. 

Future Cost Development 

In combination with highly efficient combined cycle 
systems or recuperated gas turbines, significant cost 
reductions for solar electric power generation can be 
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achieved (predicted LEC of 0.069 €/kWh at 50% solar 
share, specific investment cost of 1,410 €/kW). 

Scenario studies suggest that CSP and therefore also 
CRS could achieve cost competitiveness with coal and 
gas between 2030 and 2045 [73]. 

The German Advisory Council on Global Change 
of the government (Wissenschaftlicher Beirat der 
Bundesregierung fur Globale Umweltveranderungen, 
WBGU) presented a large study in 2007 [74] and 
Table 1 shows the foreseen development of CRS from 
this study. 

Regarding to the results of the WBGU study, cost of 
CRS may be around 10 cents by 2020 and drop under 
0.10 €/kWh by 2050. 

With increased plant sizes, better component pro¬ 
duction capacities, and more suppliers and improve¬ 
ments from R8d3, costs are expected to fall between 
10 and 14 €cents/kWh (USD 0.15-0.20) by 2020 [59]. 

Besides the estimation of further price drops, 
the gap with generation costs from conventional 
fuels is expected to decrease rapidly due to increased 
prices of conventional fuels at world markets. 
The competitiveness with mid-load, for example, gas- 
fired plants, might be achieved by the year 2020. 

In the future, central receiver plant projects will 
benefit from similar cost reductions to those expected 
from parabolic trough plants. The expected evolution 
of total electricity costs, according to the World Bank, is 
that they will drop to 8-7 €cents/kWh in the medium 
term (100 MW e Rankine-cycle plant or 100 MW e 
ISCC, both with storage) and to 5 €cents/kWh in the 
long-term period (200 MW e Rankine-cycle plant with 
storage) for high insulation sites with an annual DNI of 
more than 2,700 kWh/m 2 [69]. 

Increases in installed capacity along with ongoing 
technology enhancements will result in lower LCOEs 

Concentrating Receiver Systems (Solar Power Tower). 


Table 1 Foreseen development of CRS according to [74] 


2020 

2050 

Efficiency 

(electric) 

20-25% 

25-30% 

Costs (€/kWh) 

0.07-0.14 

Ca. 0.06 

Power level per 
unit 

Many 10-100 MW 

Many 10-100 MW 


and useful operational information that can be used to 
guide future R&D activities [72]. 

Regarding secondary applications of CRS, for 
example, the projected costs of hydrogen produced by 
solar tower and electrolysis range from 15 to 20 US 
cents/kWh, or USD 5.90-7.90/kg H 2 (assuming solar 
thermal electricity costs of 8 US cents/kWh e ). Both the 
US Department of Energy and the European Commis¬ 
sion have a clear vision of the future hydrogen econ¬ 
omy, with firm targets for hydrogen production costs. 
The US target for 2017 is USD 3/gge (gasoline gallon 
equivalent; 1 gge is about 1 kg H 2 ), and the EU target 
for 2020 is €3.50/kg [75]. 

Future Directions 

Many studies dealing with the composition of future 
energy mixes come to the conclusion that solar concen¬ 
trated thermal power will be increasingly important. 

In 2008, the German Ministry for the Environment, 
Protection of Nature and Nuclear Safety (BMU) 
published a guiding study on the “Continued develop¬ 
ment of the strategy for the expansion of renewable 
energies.” This study focuses on the German energy 
market and on how to reach the government’s goal to 
reduce greenhouse gases to 20% of the 1990 level by 
2050. Solar thermal power plants like solar tower 
are considered as one significant contributor to this 
aim [76]. 

A study for Greenpeace in [77] comes to the con¬ 
clusion that in the next 5 years (2015), in the conser¬ 
vative model, the annual commissioning of solar 
thermal power plants may reach a capacity of 
566 MW and in the ambitious scenario 6,814 MW/ 
year. In these installed capacity, solar towers play a 
significant role. 

A further study prepared by Greenpeace Interna¬ 
tional and the European Solar Thermal Power Industry 
Association comes to the result that by 2015 the total 
installed capacity of solar thermal power plants will 
have passed 5,000 MW. By 2020, additional capacity 
would be rising at a level of almost 4,500 MW each year 
where solar tower plants are included. At the end of the 
scenario period, the total installed capacity around the 
world will have reached 21,540 MW. This Greenpeace - 
ESTIA scenario considers that in 2040, CSP plants 
could total 630 GW e . The scenario of the study 
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presented in [69] also shows how much electricity 
would be produced by solar thermal power plants. 
This is based on the assumption that 1 MW of capacity 
produces 2,500 MW hours of electricity per annum. By 
2020, solar thermal power will have achieved an annual 
output of more than 54 TWh [69]. 

According to the IEA [78], before 2030 some 4,700 
GW e of total power capacity is expected to be built 
worldwide, either as additional capacity or in replace¬ 
ment of existing capacity. 

Cheap, safe, and environmentally friendly electric¬ 
ity from concentrating solar power systems could meet 
about 15% of European power needs by 2050. This was 
confirmed by a study prepared by the German Aero¬ 
space Center on behalf of the German Federal Environ¬ 
ment Ministry [79]. 


The Desertec concept (Fig. 39), by considering the 
results of this study envisions solar power stations in 
North Africa and the Middle East providing electricity 
to those regions and to Europe in a way that is sustain¬ 
able and cost-effective. 

A variety of international companies participate in 
this Desertec Initiative. They have set themselves the 
goal of 2012 to the smallest detail to clarify whether the 
project is feasible. A long-term objective is to be able to 
meet a considerable part of the increasing electricity 
demand of MENA countries and, in addition to that, to 
cover about 15% of Europe’s energy demand with clean 
like solar thermal power from deserts by the year 2050. 

The potential for improvements is probably great 
for solar tower technologies. With two dimensions of 
concentration, such a technology can reach higher 
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The Desertec concept (Source: Desertec Foundation) 
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temperature and offer higher conversion efficiencies. 
Concentrating solar power technologies would become 
competitive in the world’s solar belt when a concerted 
effort for building about 5 GW e worldwide has been 
made. 

Improvements include all major components of 
a solar tower: heliostat, receiver, storage, and conven¬ 
tional cycle components. 

Regarding the size of the heliostats, different 
approaches will continue: whether in large scale 
(around 100-120 m 2 ), medium size (30-50 m 2 ), small 
scale (1-10 m 2 ), or micro scale (less than 1 m 2 ). An 
increase of the size of a heliostat yields to a decrease of 
the control requirements. Small heliostats enable lower 
foundation and installation costs as well as low cost of 
the tracking system. On the other hand, such 
a development toward small scale should consider pos¬ 
sible higher operation and maintenance costs resulting 
for many heliostat components. 

Also the chemical composition of new steel prod¬ 
ucts with different percentage of carbon and stainless 
steel may lead to better material for tower and heliostat 
optimization design. 

At the receiver development, it is sure that the 
future development will be toward higher operating 
temperatures in order to achieve better thermal effi¬ 
ciencies. Main alternatives for solar tower receivers 
currently under development include the use of the 
following HTF: superheated steam, molten salt, and 
air. Due to the higher operating temperatures, mate¬ 
rials with better thermal and mechanical properties will 
be tested in order to be applicable in the future receiver 
layouts. 

In a high flux receiver, radiated materials of the 
absorber are exposed to extreme working conditions. 
These include high temperature, hot spots, and high 
temperature gradients. Material selection through lab¬ 
oratory and field studies is a critical issue to guarantee 
receiver integrity during their whole design life. Possi¬ 
ble degradation expected in CRS occurs related to the 
thermal cycle, high temperature operation, and/or 
severe temperature changes and those due to the HTF 
exposure. The primary consequence of the thermal 
cycling is the occurring of degradation due to low 
cycle fatigue and thermal fatigue. In addition, thermal 
aging during the receiver life time modifies microstruc¬ 
ture. Such modifications have direct impact on the 


mechanical properties of the receiver. In the future, 
an increase of scientific work and of investment in 
these optimization fields is expected. 

Molten Salt is used as a HTF in a solar tower. 
Further research in new chemical composition of the 
salts like different qualities of mix binary and tertiary 
salts may enable higher working temperature. 

High-temperature tower concepts also include 
atmospheric air as the heat transfer fluid (tested in 
Germany with the Jiilich solar tower project) with 
solid material storage. Solar-to-electricity efficien¬ 
cies of up to about 25% can be delivered by such 
towers [80]. 

Multi-tower configuration appears to present 
a further promising alternative. Such a concept consists 
of installing more than one tower for the same steam 
turbine and for the same heliostat field. This results in 
an increase of tower efficiency by the reduced distance 
between heliostats and towers [81]. However, such 
systems may be correlated to higher pressure drops 
and thermal losses of the piping system. 

Another future step will be the path toward con¬ 
struction of large CSR units. Scaling reduces unit 
investment cost, unit operation, and maintenance 
costs, and increases performance [82]. 

Currently, there exist projects, mainly in the USA, 
for large-scale solar thermal tower power plants. Such 
large units have unique requirements and a special 
feature is the long-term test run, which includes some 
functional tests but also a 4-seasons-test run. This 
testing is not typical for conventional power plants 
and requires new models to formulate warranty condi¬ 
tions. During this phase, electrical power is produced 
but the contractor is still on site in order to witness the 
test run and to reach final acceptance of the plant [83]. 

Even if the early deployment of solar tower systems 
now has higher costs of immediate emission reductions 
than other options, this deployment must be under¬ 
taken if the cost reductions it drives are key to future 
large-scale deployment. As mentioned in [84], the early 
deployment of renewable energy technologies like CRS 
is a cost-effective measure for long-term climate change 
mitigation, even if it looks too costly when only short¬ 
term reductions are considered. 

The increase in annually installed power plant 
capacity means entry into a period of distinctly more 
cost-efficient mass production of components [85]. 
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Future aims will be to improve the manufacturabil¬ 
ity, focusing on mass production, assembly, and cost 
reduction, ensuring that the heliostat has optical, 
mechanical, and thermal properties, equal to or better 
than already existing structure designs. 

Advanced technologies, mass production, econo¬ 
mies of scale, and improved operation will together 
enable a reduction in the cost of solar electricity to 
a level competitive with other power sources within 
the next 10-15 years. This will, as outlined in [69], 
reduce dependency on fossil fuels and avoid the risk 
of future electricity cost escalation. 

CRS industry has to realize the cost reduction 
potentials in component manufacturing (mirrors and 
absorbers), collector and plant assembly, system inte¬ 
gration and design. 

According to the ECOSTAR study, cost reductions 
of 25-35% due to technical innovations and scaling up 
to 50 MW e are feasible for most of the technologies. 
These figures do not include effects of volume produc¬ 
tion or scaling of the power size of the plants beyond 
50 MW unit size, which would result in further cost 
reductions [82]. 

Solar tower power plants need to be built in areas of 
high direct solar radiation, which generally translates 
into arid, desert areas where water is a scarce resource. 
In [86] , it was verified that a typical power tower power 
block that employs wet cooling requires approximately 
2,500 L of water to produce 1 MWh of solar electricity. 
Although plants in the near future will probably be able 
to secure the necessary water rights for wet cooling, the 
issue of water use will likely require CRS to transition 
to dry cooling. Thus, it is possible that dry cooling 
technology of the power block will be a necessity for 
power towers to become a major provider of solar 
electricity in the future. In these fields, in the future, 
the demand for investments and research activities will 
rise for many companies. 

Major future goals of the future development of 
CRS are to try to find the right balance between high 
thermal efficiency, low pressure drop, high durability, 
and low cost of the components of a solar tower. 
Improved operating procedures will be developed to 
provide more reliable operation and to raise the capac¬ 
ity factor of a solar tower plant. Use of efficient Rankine 
cycles and construction of new steam generator, which 
are extra developed for solar application, enable the 


rising of the efficiency of a CRS. Given a fixed plant 
size and capacity factor, the net annual solar-electric 
efficiency sets the required heliostat field area. As the 
efficiency increases, the collector area and cost decrease 
in proportion [56]. 

Future solar-only solar tower plants have good 
long-term perspective for high conversion efficiencies 
and for use of very efficient energy storage systems by 
utilization of high temperatures in order to enlarge the 
solar capacity or solar share. 

Storage Systems are a second key factor for cost 
reduction of solar power plants. 

Development needs are very much linked to the 
specific requirements of the systems in terms of the 
used heat transfer medium and the required tempera¬ 
ture. In general, storage development needs several 
scale-up steps generally linked to an extended 
development time before a market acceptance can be 
reached [82]. 

Another significant key issue is the ability to be 
combined with hybrid systems by firing of conven¬ 
tional fuels or even biofuels like biogas. In addition, 
they have the potential to be applied to other high- 
temperature heat processes like process heat, solar 
chemical processes, and desalination. 

As modified in [70], already at the beginning of 
this century, implementation of hybridized systems 
is one of the paths to break the nontechnological finan¬ 
cial barriers to deploy solar electric technologies, 
which enable the reduction of the initial 
investment. Hybrid solar-fossil fuel plants making use 
of special finance schemes at favorable sites can, 
according to [69], already deliver competitively priced 
electricity. 

The commercial confidence in this technology is 
growing as more operational plants are being built 
and consequently it will improve in the near future. 
With a concentration factor of up to 1,000 suns, power 
towers have much greater development potential in 
terms of cost-effectiveness [87]. In this new century, 
concentrated solar technologies like CRS will play 
a significant role in providing sustainable energy and 
heat for different applications. But always economical 
and environmental considerations will play a key factor 
when the appropriate technology, which covers the 
permanent rising demands for energy and water avail¬ 
ability, has to be chosen. 
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Glossary 

Deconstruction Systematic dismantling of a structure 
for reuse and recycling. 

Embodied energy Energy employed in the manufac¬ 
ture of a product. Sometimes referred to as 
“emergy.” 

Recycled aggregate (RA) Construction and demoli¬ 
tion waste containing a range of different 
inorganicnonmetallic constituents processed for 
use as aggregate. 

Recycled concrete aggregate (RCA) aggregate derived 
from processing end- of-life concrete. 

Reclaimed asphalt pavement (RAP) Also known as 
recycled asphalt planings, is an asphalt material 
reclaimed by planing or milling asphalt from road 
surfaces. The material can be recycled in hot or cold 
mix asphalt. 
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Definition of the Subject and Its Importance 

Construction and demolition waste comprises a wide 
variety of materials including concrete, metals, timber, 
ceramics, soil, plaster, asphalt, and polymers which 
arise either during construction, renovation, or demo¬ 
lition activities. 

National arisings of construction and demolition 
waste typically make up a large proportion of most 
countries’ total waste - typically between 15% and 30%, 
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but sometimes higher. As a result, it is often identified as 
an area in which great potential exists for enhancing 
sustainability through reduction, reuse, and recycling. 
The past 20 years have seen significant growth in the 
recycling and reuse of construction and demolition 
waste, partly driven by legislation, taxation, and increas¬ 
ing scarcity of landfill space. As such activities grow, 
the importance of adopting the most sustainable 
approaches to managing this waste has started to be 
recognized as the key to moving toward a wholly sus¬ 
tainable construction industry. 

Introduction 

The construction process involves manipulating mate¬ 
rials to form structures, coupled with the excavation 
and removal of soil and other materials. As a result, it 
will frequently produce material which is surplus to 
requirement. Moreover, once a structure reaches the 
end of its useful life, it must eventually be demolished, 
creating a quantity of waste. 

The magnitude of arisings of construction and 
demolition is often disproportionately large in com¬ 
parison to the extent to which the sector contributes 
toward economic growth, and there is a strong argu¬ 
ment for concluding that this situation is unsustainable 
in both economic and environmental terms. The rec¬ 
ognition of this has grown over the last 30 years, with 
developments both in terms of legislation and policy to 
reduce construction and demolition waste and also in 
terms of the development of procedures and technol¬ 
ogy which make recycling of end-of-life construction 
materials more viable. 

This chapter examines the nature of construction 
and demolition waste and the mechanisms which cause 
it to arise. It then explores some of the options available 
(both established and developing) for reducing the 
amounts of waste deposited in landfill before 
discussing some of the environmental hazards that 
must be addressed when processing and using such 
materials. 

Construction and Demolition Waste 

Estimates of quantities of construction and demolition 
waste arising globally are, by their nature, likely to be 
imprecise. One attempt to do this arrives at a value of 
around 1.42 billion tonnes of material [ 1] . This is likely 


to be an underestimate, given that the total estimated 
amount of construction and demolition waste deriving 
from a limited range of countries in Table 1 yields 
a value close to this. However, the actual total is likely 
to be of such an order of magnitude. 

The manner in which this is distributed around the 
world is partly dependent on economic activity, but 
while there is certainly some correlation between eco¬ 
nomic activity and the rate of production of construc¬ 
tion and demolition waste, other factors also have 
a strong influence. This is illustrated in Fig. 1, which 
plots construction and demolition arisings versus the 
economic activity of the construction sector for 27 
European Union member states (plus Norway) and 
indicates a far-from-strong correlation. Nonetheless, 
it is generally the case that the bulk of construction 
and demolition waste arises in the more economically 
developed world [15]. 

More insight into economic influences can be 
obtained by plotting the proportion of construction 
and demolition waste as a percentage of total waste 
against the percentage contribution of the construction 
sector to gross domestic product for member states of 
the European Union. This is shown in Fig. 2, which 
indicates that many member states produce construc¬ 
tion and demolition waste at disproportionately higher 
rates than their construction sector contributes toward 
GDP. Such an outcome hints at a highly unsustainable 
situation. 

It is also worth noting the countries where the 
percentage of construction and demolition waste is 
proportionately lower than the sector contribution to 
GDP: Bulgaria, Estonia, Latvia, Lithuania, Poland, 
Romania, Slovakia - the seven member states with the 
lowest GDP in the studied group. The implication of 
this is significant - it is not so much the case that a high 
GDP equates to high levels of construction and demo¬ 
lition waste. Rather, high GDP offers the opportunity 
to be wasteful, or possibly more accurately, low GDP 
restricts it. 

Thus, it is evident that high arisings of construction 
and demolition waste are by no means an inevitable 
side effect of an active economy. For instance, Sweden, 
whose GDP per capita is in the top six of EU member 
states, makes a construction and demolition waste 
contribution to total waste which is nearly at parity 
with its construction sector’s contribution to GDP. 
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Construction and Demolition Wastes. Table 1 Estimated annual arisings of construction and demolition waste from 
a range of countries 


Country 

Annual arisings, tonnes 

Year 

Reference 

Australia 

12,345,178 

2006 

[2] 

Austria 

31,321,626 

2006 

[3] 

Belgium 

13,089,649 

2006 

[3] 

Brazil 

62,143,200 

2004 

[4] 

Bulgaria 

1,023,303 

2006 

[3] 

Canada 

2,816,528 

2002 

[5] 

China 

2,185,445 

2004 

[6] 

Cyprus 

306,846 

2006 

[3] 

Czech Republic 

8,379,851 

2006 

[3] 

Denmark 

5,802,367 

2006 

[3] 

Estonia 

717,105 

2006 

[3] 

Finland 

23,145,713 

2006 

[3] 

France 

358,878,312 

2006 

[3] 

Germany 

196,536,165 

2006 

[3] 

Greece 

6,829,161 

2006 

[3] 

Hungary 

3,045,335 

2006 

[3] 

India 

13,154,400 

2005 

[7] 

Ireland 

16,599,467 

2006 

[3] 

Italy 

52,315,620 

2006 

[3] 

Japan 

680,400 

2006 

[2] 

Latvia 

19,340 

2006 

[3] 

Lithuania 

348,967 

2006 

[3] 

Luxembourg 

6,774,547 

2006 

[3] 

Malaysia 3 

<1,807,525 

1998,2001 

[8,9] 

Malta 

2,492,522 

2006 

[3] 

Netherlands 

56,609,820 

2006 

[3] 

New Zealand 

572,400 

1997 

[10] 

Norway 

1,247,936 

2006 

[3] 

Poland 

14,141,031 

2006 

[3] 

Portugal 

3,607,232 

2006 

[3] 

Romania 

33,870 

2006 

[3] 

Slovakia 

916,227 

2006 

[3] 

Slovenia 

994,887 

2006 

[3] 

Spain 

47,323,392 

2006 

[3] 

Sweden 

8,943,362 

2006 

[3] 
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Construction and Demolition Wastes. Table 1 (Continued) 


Country 

Annual arisings, tonnes 

Year 

Reference 

Taiwan 6 

2,400,000 

2001 

mi 

Thailand 

997,920 

2002-2005 

[12] 

UK 

109,545,988 

2006 

[3] 

USA 

123,000,000 

1996 

[13] 

Vietnam 

773,515 

1997-1999 

[14] 

TOTAL 

1,192,058,626 




Construction and demolition waste + industrial waste 
Concrete only 



Construction and Demolition Wastes. Figure 1 

Mass of construction and demolition waste per capita versus economic activity in construction for 27 European Union 
member states plus Norway (Derived from Eurostat data for 2006 [3]) 


This efficient performance has partly been attributed 
to the implementation of taxation and legislation 
in Sweden, including a landfill tax and the introduction 
of a stringent Environmental Code whose aim was 
to promote sustainable development. However, it 
should be stressed that social and cultural aspects may 
also play a role - the UK operates a landfill tax at 
a comparable level, and yet performs much less 
impressively. 

Production of waste on the scale seen for the con¬ 
struction sector clearly presents a significant obstacle in 


moving toward sustainable development. Materials 
leaving construction sites and sites of demolished 
structures for landfill were once primary raw materials, 
and extraction of further raw material is required to 
replace material lost from the system. This, in turn, 
requires additional energy and resources for 
processing, manufacture and transport, which gener¬ 
ates pollutants and further waste. Furthermore, the 
disposal of waste in landfill or through combustion 
reduces the quality of land resources and presents addi¬ 
tional pollution problems. 
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Construction and Demolition Wastes. Figure 2 

Construction and demolition waste expressed as a percentage of total waste versus the percentage contribution to 
GDP from the construction sector for 27 European Union member states plus Norway (Derived from Eurostat data for 
2006 [3]). Dashed line is the line of equivalence 
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Construction Waste 

Construction waste is waste derived from the construc¬ 
tion process itself. It has been proposed that this waste 
can be divided into two categories based on the types of 
events that generate it: accidental and predictable 
wastes [16]. 

Accidental waste is produced unintentionally on¬ 
site as a result of accidents and poor management of 
materials. It can include breakages, materials exposed 
to the elements in a way which degrades them and 
spoiled materials arising from inadequate supervision 
and training of operatives. 

Predictable waste is the inevitable product of 
a specific construction project which can be anticipated 
from the construction procedures devised during the 
design and planning stages. This can include excavated 
soil, cutoffs, and concrete residues washed out of mixers. 

Due to the manner in which construction waste 
arises, there is usually more scope for allowing the 
sorting of wastes. The presence of skips for individual 
material types on construction sites is now a common 


site in many countries. In many cases, skips are pro¬ 
vided by recycling companies. 

Demolition Waste 

By its nature, the vast majority of construction and 
demolition waste derives from demolition. The demo¬ 
lition of a structure is typically a relatively destructive 
process, regardless of what technique is employed, 
meaning that the resulting rubble consists of 
a complex mixture of a range of different materials. 
This presents problems when reuse or recycling of 
demolition waste is an objective, although, as will be 
discussed, sorting of the material is possible, albeit with 
some compromise in terms of the efficiency of isolation 
of individual material components. 

The problems related to conventional demolition 
have led to attempts to promote the “deconstruction” 
of structures - the careful, systematic disassembly of 
structures with as little damage as possible [17]. Partial 
deconstruction of buildings is encountered relatively 
frequently, but is usually limited to high-value 
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materials such as copper. For such an approach to be 
successful on a much wider scale, design of structures 
with future disassembly in mind is necessary. Thus, 
while it is likely that the deconstruction is likely to 
become increasingly common, it is likely to be some 
time before the process becomes commonplace. 

Another source of waste other than from construc¬ 
tion and demolition is during renovation of structures. 
Since renovation involves the removal of existing mate¬ 
rial and replacement with new materials, the nature of 
the resulting material shares characteristics of both 
demolition and construction waste. However, because 
the removal of material is likely to be conducted in 
a very controlled manner, in many cases the opportu¬ 
nity for sorting of waste is high. 

Composition of Construction and Demolition 
Waste 

The materials that make up construction and demoli¬ 
tion waste can be divided up into many different clas¬ 
sifications based on composition, physical properties, 
and original use. Here, three classifications are used - 
“metallic,” “organic,” and “mineral” - since the waste 
management options available for different materials 
within each classification are often, but not always, 


similar. These classifications are discussed below. Esti¬ 
mates and measured quantities of each type of material 
classification from selected literature are shown in 
Table 2. 

It should be stressed that the composition of con¬ 
struction and demolition waste will vary from location 
to location depending on the nature of the building 
stock in a region and local traditions and trends 
with respect to the use of materials for construction. 
This also means that the nature of construction and 
demolition waste will change with time. An analysis of 
the changing nature of materials used in construction 
in the US identified essentially parallel increases in the 
use of most materials over the last 100 years. However, 
the rate of growth in the use of plastics and aluminum, 
which only started being used for construction in the 
twentieth century, has been higher than all other mate¬ 
rials [26]. 

The Metallic Fraction 

Metallic materials encountered in construction and 
demolition waste depend on the nature of the source 
and, in the case of demolition waste, the age of the 
structure. They are likely to include ferrous metals, 
copper, lead, aluminum, and zinc [18]. Table 2 shows 


Construction and Demolition Wastes. Table 2 Composition of construction and demolition waste from a range of 
locations 


Data type 

Location 

Material type (mass%) 

Metallic Organic 

Mineral 

Others 

Measured 

Switzerland, 1991 [18] 

0.2 

11 

56 

32.8 


Switzerland, 1993 [18] 

3.1 

24.7 

72.1 

0.1 


USA, 1995 (construction, residential) [12] 

2 

44 

39 

15 


USA, 1997 (demolition, residential) [12] 

2 

42 

24 

32 


USA, 1997 (demolition, nonresidential) [12] 

5 

16 

79 

0 


Austria, 2000 [19] 

0.2 

2.8 

79 

18 


Taiwan, 2001 [20] 

5.5 

13.4 

81.1 

- 


San Francisco, USA, 2004 [21] 

15.5 

20.6 

33.4 

30.5 

Estimated 

UK, 2000 [22] 

7 

20.3 

71.8 

0.9 


USA, 2003 [23] 

1-5 

21-35 

46-70 

- 


Florida, USA, 2007 [24] 

2.6 

21.6 

65.9 

9.9 


Massachusetts, USA, 2008 [25] 

5 

36 

30 

29 
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that the metallic fraction normally makes up the 
smallest proportion of construction and demolition 
waste - usually below 10% by mass. 

The Organic Fraction 

Here “organic” refers to the chemistry of the materials, 
and thus includes any materials whose major constitu¬ 
ents are organic compounds. 

Organic materials likely to occur in construction 
and demolition waste include timber, paper, organic 
polymers, and bituminous roofing. 

During the construction process, much of the 
organic fraction is likely to be packaging waste in the 
form of plastic and paper packaging and timber pallets. 
Table 2 shows that, while the proportion of organic 
fraction varies from source to source, in the majority 
of cases, it is usually present in construction and demo¬ 
lition waste at between 20% and 30% by mass. 

The use of plastics in construction is dominated by 
polyvinyl chloride (PVC), which is used in pipes and 
ducts, window frames, and flooring. Additionally, 
high- and low-density polyethylene (HDPE and 
LDPE), polypropylene, polystyrene, polyurethanes, 
and acrylonitrile butadiene styrene are all encountered 
in construction products (Table 3). 


Construction and Demolition Wastes. Table 3 Polymers 
typically encountered in construction and demolition waste 


Polymer 

Acronym 

Application 

Polyvinyl chloride 

PVC 

Pipes/ducts 



Window frames 



Flooring 

High-density 

polyethylene 

HDPE 

Pipes/ducts 

Low-density 

LDPE 

Pipes/ducts 

polyethylene 


Sheeting 



Packaging 

Polypropylene 

PP 

Pipes/ducts 

Polystyrene 

PS 

Insulation 



Packaging 

Polyurethane 

PU 

Insulation 

Acrylonitrile 
butadiene styrene 

ABS 

Pipes/ducts 


The Mineral Fraction 

The mineral fraction includes concrete, fired-clay 
ceramics, soil, gypsum plaster, glass, and masonry - 
the nonmetallic, inorganic constituents of construction 
and demolition waste. 

In terms of mineralogy, the mineral fraction 
commonly contains large quantities of quartz, along 
with calcite, dolomite, and phyllosilicates such as 
chlorite, muscovite, and illite [27]. As a result, the 
main elemental constituents, other than oxygen, are 
calcium, silicon, and aluminum. Figure 3 shows the 
chemical composition of a selection of mineral frac¬ 
tions from a UK-wide sampling of demolition waste 
plotted in the form of a Ca0-Si0 2 -Al 2 0 3 ternary dia¬ 
gram. The material is generally limited to a narrow 
compositional “corridor” that runs from the CaO- 
rich corner of the plot to the Si0 2 -Al 2 0 3 axis. These 
limits to composition are not particularly surprising - 
aggregates and stone masonry will typically occupy the 
top Si0 2 -rich corner of the diagram, while the compo¬ 
sition of clay soil, slate, and most brick will be limited 
to a region on the Si0 2 -Al 2 0 3 axis, richer in Si0 2 than 
A1 2 0 3 . Limestone aggregate and masonry occupy the 
CaO-rich corner of the system, while Portland cement, 
deriving from a raw material feedstock of limestone 
and clay (or similar), occupies a position between the 
Si0 2 and CaO-rich regions on the diagram. 

One group of materials that sits between the 
organic and mineral categories is that of asphalt deriv¬ 
ing from highways. These materials will typically con¬ 
sist largely of inorganic mineral constituents bound 
together with a smaller volume of organic bitumen. 
However, where mixed waste is being handled, because 
the inorganic and organic components are extremely 
difficult to separate, and because their density usually 
lies closer to the range of the mineral fraction, it is 
normally the case that these materials end up as part 
of this fraction. 

Managing Construction and Demolition Waste 

Over the last 30 years, the application of integrated 
waste management strategies to wastes deriving from 
most industrial activities has become the favored 
approach. Such strategies adopt a collection of man¬ 
agement options whereby waste arisings from a given 
activity are dealt with comprehensively and in 
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Construction and Demolition Wastes. Figure 3 

Ca0-Si0 2 -Al 2 0 3 ternary diagram for demolition waste sampled from sites in the UK [28] 


a manner which satisfies a series of objectives, includ¬ 
ing safety and sustainability. 

Where sustainability is an objective, it is common 
for an integrated waste management strategy to strive 
to select options at as high a level as possible within 
a “waste hierarchy” The detail of waste hierarchies 
varies from source to source, but the general form 
taken is usually that shown in Fig. 4, with reduction 
of waste being the most desirable management option 
and disposal being the least. In between these options 
are reuse: the return of material back to its previous 
application with minimal reprocessing; recycling, 
where return is only possible with reprocessing; 
composting; and energy recovery, which normally 
refers to waste-to-energy combustion, but may also 
include the production of fuels through pyrolysis or 
anaerobic digestion. 

Recycling can take either a closed- or open-loop 
form, with the former referring to the return of mate¬ 
rial back to its original mode of use and the latter 
referring to recycling into an alternative application. 


Reduction 

Re-Use 

Recyling and Composting 
Energy Recovery 
Disposal 

Construction and Demolition Wastes. Figure 4 

The waste hierarchy 

The fact that construction and particularly demolition 
waste frequently arises in mixed form often prevents 
closed-loop recycling on the grounds that separation of 
many constituents is impossible, and so return to orig¬ 
inal use is no longer an option. 
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Open-loop recycling tends to be viewed as being a 
slightly less desirable route, since there is sometimes, but 
not always, a compromise in quality of the resulting 
product leading to a loss of value. Recycling to lower- 
value products is sometimes referred to as “down¬ 
cycling” Another potential problem with open-loop 
recycling is that the “recyclability” of the resulting prod¬ 
uct may be progressively compromised at each cycle, 
which clearly reduces the sustainability of the process. 

The Delft ladder is an attempt at applying the waste 
hierarchy specifically to demolition waste [29]. This is 
a hierarchy which attempts to prioritize different 
options for end-of-life structures and the structural 
elements and components which make up structures. 
The hierarchy is as follows: 

1. Prevention: devise building systems that allow dis¬ 
mantling and reassembling for full recycling 

2. Object renovation: renovating a structure for con¬ 
tinued use 

3. Element reuse: remove elements and refurbish for 
reuse 

4. Material reuse (recycling): waste materials 
processed for return to production process 

5. Useful new application: material, element, or 
component used in a different situation 

6. Immobilization with useful application: poten¬ 
tially harmful material rendered benign when 
used as a raw material for a new component 

7. Immobilization: potentially harmful material ren¬ 
dered benign and sent to landfill 

8. Incineration with energy recovery: heat from 
combustion of waste used to heat water and gen¬ 
erate electricity 

9. Incineration 

10. Landfill 

As shall be seen, this chapter takes a slightly mod¬ 
ified approach to that proposed by the Delft ladder. 
Nonetheless, the basic philosophy of this and the more 
generic waste hierarchy are adopted in subsequent 
sections. 

Various options available for using construction 
and demolition waste are discussed below, using the 
waste hierarchy as a means of structuring the discus¬ 
sion. The options examined include both established 
technologies and technologies whose feasibility has 
been demonstrated, but whose large-scale adoption 


has not yet been realized. The options examined are 
summarized in Fig. 5. 

Reduction 

During construction, reduction of both accidental and 
predictable construction wastes requires different 
approaches. The nature of predictable waste means 
that there is potentially scope at the planning stages of 
a project to “design-out” many occurrences of waste. 
This can include planning to allow offcuts and other 
inevitable arisings to be used elsewhere, ordering sizes 
which minimize offcuts, providing suitable storage 
facilities on-site for materials, and coordinating deliv¬ 
eries to reduce storage times. 

Accidental waste, being caused by accidents, is best 
reduced through ensuring good health and safety prac¬ 
tices on-site and employing good materials acceptance 
procedures to minimize the extent to which damaged 
materials enter a site. 

Demolition waste is, ultimately, impossible to reduce 
in the sense that the eventual demolition of a structure 
will yield a fixed volume of material. One interpretation 
of what waste reduction means in this context is simply 
the avoidance of demolition via the refurbishment of 
existing structures for new purposes. Such activities have 
been established practice in many parts of the world for 
some time, but have become increasingly popular. 

Some caution is required in terms of recommending 
refurbishment over other options in the waste hierarchy. 
A recent life-cycle assessment study found that refur¬ 
bishment of a particular structure (a bank headquarters) 
had larger levels of associated carbon dioxide emissions 
compared to demolition and construction of a new 
building [30]. The main reason for this was that 
projected emissions associated with service and main¬ 
tenance of the refurbished structure were higher. This 
does not necessarily mean that refurbishment is not 
a sustainable option, but does stress the importance of 
establishing the appropriateness of an existing structure 
and making sure that refurbishment addresses issues 
relating to energy efficiency as thoroughly as possible. 

Reuse 

Reuse refers to the removal and possible refurbishment 
of structural elements or components from buildings 
for use in a similar application. Generally, the 
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Construction and Demolition Wastes. Figure 5 

Options for the various constituents of construction and demolition waste 


reusability of an article will depend on the ease with 
which it can be removed, and the extent to which 
service has affected its aesthetic and engineering qual¬ 
ities. Currently the main markets for reclaimed mate¬ 
rials are fired-clay bricks (usually from structures in 
which lime mortar was used) and timber flooring. Both 
types of article are modular, thus allowing removal of 
individual items with relative ease, and have agreeable 
aesthetic qualities, sometimes improved by age. 

A UK study has appraised various construction 
systems in terms of their reusability [17]. The results 
of the appraisal are summarized in Table 4. 

Recycling 

The nature of recycling of wastes deriving either from 
construction, renovation, or demolition is very depen¬ 
dent on the form in which the material is obtained. 
Where the material has been separated into individual 
material types, there is usually more scope for 


recycling, and closed-loop recycling is likely to be one 
of these options. However, where waste is mixed, it is 
much more likely that lower-value recycling options 
will be the only ones available. 

The following section examines the most obvious 
options for recycling, but makes no claim to be com¬ 
prehensive. Where materials have been separated, the 
options for recycling, whether wastes derive from con¬ 
struction or demolition, are in most cases the same. 
However, the section will start by examining the 
processing of mixed demolition waste for recycling, 
the main product of which is recycled aggregate. 

Processing Mixed Demolition Waste for Recycling 

The processing of concrete or mixed demolition waste 
requires two basic operations - size reduction and 
screening - to obtain the required grading of particles. 
However, depending on the nature of the source mate¬ 
rial, there is likely to be a need to remove undesirable 
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Construction and Demolition Wastes. Table 4 Results of evaluation of various construction systems for their suitability 
for subsequent reuse [17] 


Suitability for reuse 

Category 

Lower 

Higher 

Wall finishes 

• Plaster on substrate 

• Ceramic wall tiles 

• Metal wall sheeting 

• Synthetic flexible wall tiles 

• Wallpaper 

• Mirror finishes 

• Timber boarding system 

• Textile fixed to timber support 

• Waterproof wall panels 

• Polished plaster panels 

Internal 

partitioning 

systems 

• Metal stud walling system with plaster skim 
coat 

• Metal stud walling system with removable wallpaper 

• Glass partitioning system 

• Solid straw internal partition with plaster finish 

Ceiling 

finishes 

• Plasterboard screwed to timber joists with 
plaster skim coat 

• Plasterboard ceiling on support system 

• Resin-bonded glass wool tiles on support system 

• Suspended metal tiles on support system 

• Timber decorative boarding fixed to timber joists 

Floor finishes 

• Ceramic tiles fixed with adhesive/grout 

• Wool carpet fixed with adhesive 

• Terrazzo floor fixed with adhesive 

• High-density fiberboard with loose laid carpet tiles 

• Loose laid carpet tiles 

• Wool carpet with carpet grippers 

• Solid wooden floor 

• Floating laminate flooring 

• Terrazzo floor on vinyl release system 

• Loose laid rubber tiles 

External wall 
systems 

• Render finish on solid cavity block 
construction 

• Brick and block cavity wall system with 
cement mortar 

• Planar structural glazing 

• Stone cladding or terracotta rainscreen 

• Curtain walling - metal sections with solid/glass infill 
panels 

• Metal cladding 

• Brick and block cavity wall system with lime mortar 

• Timber cladding 

Roofing 

finishes 

• Single-ply membrane fixed with adhesive 

• Asphalt applied as hot fluid 

• Plastic finish applied as liquid 

• Single-ply membrane fixed mechanically 

• Thatch roofing 

• Lead/copper/aluminum roofing 

• Clay tiles 

• Bitumen impregnated fiberboard 

Structural 

elements 

• Permanent formwork walling 

• Prefabricated concrete wall and floor panels 

• Concrete beam and insulation block floor 

• Concrete beam and block floor with screed 
finish 

• Structural insulated wall/roof 

• Panels with external render and internal 
plasterboard on plaster dabs 

• Studs, insulation, and panels with external timber 
boarding and internal screw-fixed plasterboard 

• Structural insulated wall/roof panels with external 
timber boarding and internal plasterboard on 
battens 

• Timber floor joists on joist hangers with timber 
boarding and plasterboard 

• 2Concrete beam and block floor with dry finish 

• Exposed steel roof construction with Z-purlins and 
C-sections 


constituents to ensure that the product of processing 
satisfies the required standards for end-use. Addition¬ 
ally, there may be constituents present whose market 
value makes separation desirable. 


Equipment for processing demolition materials is 
available in both mobile and static forms. In many 
cases, mobile equipment is favored on the grounds 
that processing can be carried out on-site, removing 
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the need for transportation to a processing center, 
a particularly useful arrangement when the processed 
material is to be used on the site of origin. The variable 
processing requirements for demolition waste mean 
that equipment on the market is often modular, 
allowing a range of plant units to be selected and 
arranged in sequence to tailor a processing scheme to 
a given material source and product type. Examples of 
three such sequences are shown in Fig. 6. 

Equipment for crushing and screening will usually 
be identical to that used for quarrying activities, 
although some of the plant used to remove undesirable 
components is often designed specifically for demoli¬ 
tion waste processing. 

Modular processing systems require the movement 
of material from one process to another, and this 
is normally achieved through the use of belt and 
radial conveyors and apron, plate, and reciprocating 
feeders. 

In most cases, demolition waste will undergo some 
degree of pre-sorting prior to processing. This nor¬ 
mally involves the identification and removal of large 
components either for disposal or recycling. Such com¬ 
ponents may include steel sections and reinforcement 
bars, plastic components and oversized pieces of 
timber. 

The material entering the process sequence is then 
usually crushed using impact, jaw (see Fig. 7), or cone 
crushers [18]. In some cases, a screen will be located 
immediately prior to crushing to ensure oversize mate¬ 
rial does not jam or damage machinery. It is often the 
case that a concrete crusher mounted on an excavator 
will be located directly after this screen to break down 
the material to crushable size and to remove steel 
reinforcement, etc. 

After crushing, the resulting material is screened, 
often using a sequence of screens which might include 
grizzly screens, finger screens, trommel screens, and 
vibrating screens. A screen sequence can be achieved 
either through placing screens in series or by stacking 
screens in decks with different size fractions being 
directed into different process streams. By selecting 
appropriate screen configurations, sorting for shape 
as well as size is possible, which potentially has benefits. 
For instance, long, narrow objects such as metal pipes 
and rods can be removed by including bar screens or 
similar in the screening sequence. 


Either before or after screening, the removal of 
various nonmineral constituents can be conducted. 
This may involve processes which differentiate between 
organic or metallic constituents and mineral constitu¬ 
ents based on their physical properties. However, 
it is not uncommon for hand sorting to be carried 
out, where operatives identify and remove metals, plas¬ 
tics, timber, and paper from material passing on 
a conveyor. 

The removal of ferrous metals can be achieved 
through the use of magnets (either permanent or elec¬ 
tromagnets) which can be suspended over conveyors or 
mounted in revolving drums or belt configurations. 

Where significant quantities of non-ferrous and, 
hence, nonmagnetic metal is present, eddy current 
devices can be used. Eddy current devices expose 
metal particles to a changing magnetic field which 
induces eddies of electrical current within them, in 
turn inducing magnetic fields which oppose the origi¬ 
nal magnetic field and cause the metal particles to be 
repelled [32]. During the processing of construction 
and demolition waste, the repulsive force is used to 
drive metallic particles off a conveyor. 

The density of the various constituents of construc¬ 
tion and demolition waste is also a physical character¬ 
istic which can be exploited in the separation process. 
Because most polymers and timber have specific grav¬ 
ities below one, the use of water as a medium for 
separation is an option. Separation in this way can 
involve settling tanks or revolving, water-filled drums 
[33]. Air classification can also be used, which involves 
passing the material over a vertical stream of air which 
will remove less dense constituents. Air classification 
has the additional benefit of removing fines having 
terminal velocities lower than the velocity of the air 
stream. Because of this, however, it is usually necessary 
for the exhaust from air classification to subsequently 
pass through pollution control devices capable of 
removing particulates [18]. 

Other devices with origins in minerals processing 
that separate on the basis of differences in density, and 
which can potentially be used on construction and 
demolition waste, include jigs and shaking tables [34]. 

The removal of fines can also be achieved through 
washing, which can be carried out using barrel or log 
washers, in the same way as for conventional 
aggregates. 
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Construction and Demolition Wastes. Figure 6 

Three configurations for recycling of mixed demolition waste, (a) Basic process [31 ]; (b) wet process with hand sorting [19]; 
(c) dry process with air classification, hand sorting, and length sorting [18] 
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Construction and Demolition Wastes. Figure 7 

Mobile jaw crusher being used to crush demolition waste (Photograph courtesy of tayside contracts) 


Recycling Mineral Waste The use of mineral frac¬ 
tions derived from the processing of construction and 
demolition waste as aggregate is an obvious option. 
Both coarse and fine aggregates can be produced, with 
applications as granular fill, aggregate for hydraulically 
bound and unbound roadbase and pipe-bedding mate¬ 
rials. The material can also be used as an aggregate in 
concrete. 

Usually, a distinction is made between aggregate 
obtained from the reprocessing of what is essentially 
waste containing only concrete and waste containing 
a range of different constituents of the mineral fraction. 
The former is usually referred to as recycled concrete 
aggregate (RCA), while the latter is recycled aggregate 
(RA). This distinction has most significance where the 
recycled material is used in concrete. 

RCA and RA in Concrete The use of RCA and RA in 
concrete is an approach that has been extensively 
explored, and it is generally agreed that, with 
adequate quality control and the use of suitable mix 
design methods, good quality material can be 
produced. 

Particles of RCA typically have a thin layer of hard¬ 
ened mortar at their surface, retained during the 
crushing process. This layer usually has a strong influ¬ 
ence on the subsequent performance of the material in 
concrete. Because the porosity of the layer is usually 


higher than that of the aggregate within, water absorp¬ 
tion of the recycled material is higher than that of the 
aggregate in its virgin state. The extent to which water 
absorption is increased is partially influenced by the 
nature of the concrete from which the aggregate was 
obtained, although the effect is slight [35]. 

Higher water absorption has the effect of reducing 
workability of fresh concrete, and this means that to 
achieve a given workability, a higher level of mixed 
water is required [36]. 

The presence of mortar also influences the com¬ 
pressive strength of the hardened concrete, with 
recycled aggregate reducing compressive strength rela¬ 
tive to a natural aggregate mix with the same water- 
cement ratio. This effect is more pronounced when 
RCA is used as a fine aggregate, resulting from the 
higher surface to volume ratio giving the material 
a higher mortar content. Where recycled aggregate is 
used as a partial replacement for virgin aggregate, the 
effect is minor at low levels of replacement, but 
becomes more pronounced at higher levels [37]. 

Drying shrinkage and creep are also typically more 
pronounced in concrete mixes containing RCA [38] as 
are durability properties relating to the permeation 
properties of concrete [39]. 

The solution to the reduction in strength and com¬ 
promise of other properties is usually simply to reduce 
the water-cement ratio of a concrete mix. However, the 
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higher levels of water required to maintain workability 
mean that potentially uneconomic quantities of 
cement may be required through conventional mix 
proportioning. This makes the use of superplasticizing 
admixtures an attractive option. Appropriate modifi¬ 
cation of mix proportions has been shown to produce 
concrete whose structural performance in reinforced 
concrete elements in terms of shear capacity and flex¬ 
ural behavior has been demonstrated to be comparable 
to concrete of equivalent strength made with natural 
aggregate [40, 41]. 

The performance of recycled aggregate in concrete is 
very much dependent on what constituent materials are 
present, although similar effects are typically encoun¬ 
tered in terms of water absorption. The main concern 
with RA is its greater potential for the contamination 
with undesirable components, as discussed below. 

Good quality control in the production of RCA and 
RA is necessary, mainly to avoid problems relating to 
the presence of detrimental substances. Contamination 
with chlorides is one of the major concerns. 

Chlorides may arise in RA and RCA as a result of 
contamination during service from chloride-bearing 
environments such as those experienced by coastal 
and highway structures. In addition, chlorides may 
have been introduced into concrete at the construction 
stage through contaminated materials, the use of sea¬ 
water as a mix constituent, or via the use of calcium 
chloride as an accelerating agent. Chlorides are capable 
of significantly accelerating the rate of corrosion of steel 
reinforcement. As a result, chloride levels in concrete 
are strictly limited in concrete standards. 

Additional potential problems presented by the use 
of RA in concrete include the presence of sulfates, 
aluminum metal, and materials capable of undergoing 
alkali-silica reaction (ASR). 

In the case of sulfates, contamination may arise from 
gypsum plaster from the original structure, or possibly 
as a result of ingress from an external source. Soluble 
sulfate has the potential to cause delayed ettringite for¬ 
mation, where concrete containing relatively high levels 
of concrete aggregate and cured at elevated temperatures 
expands and cracks as a result of the decomposition and 
subsequent reformation of ettringite. 

Aluminum will, in the alkaline conditions created 
by hydrating Portland cement, react to evolve hydrogen 
gas [42]. When evolved within concrete, bubbles form, 


which can lead to significant expansion of the fresh 
material and compromise strength. Additionally, the 
presence of hydrogen gas presents an explosion hazard. 

The presence of glass in RA may be capable of 
undergoing harmful ASR in contact with hydrating 
cement, which causes expansion and cracking of con¬ 
crete [43]. 

While the implications of chloride contamination 
are serious, screening of aggregate is relatively straight¬ 
forward, since chlorides can be readily determined 
through chemical analysis. Analysis for sulfates is also 
easily conducted, but much of the problems of delayed 
ettringite formation from recycled aggregate stems 
from localized agglomerations of gypsum plaster. 
Such features are potentially missed by chemical anal¬ 
ysis of bulk samples. 

Aluminum metal can be identified through powder 
x-ray diffraction, although analysis tends to be quali¬ 
tative rather than quantitative and low levels of alumi¬ 
num, which may not be detected by this technique, are 
still capable of damaging concrete. Most ASR- 
susceptible materials elude detection by chemical and 
mineralogical analysis methods, meaning that screen¬ 
ing for these components requires the use of acceler¬ 
ated ASR test methods. These issues are one of the main 
reasons that the growth in the use of recycled concrete 
aggregate in concrete has exceeded that of recycled 
aggregate. 

It should be pointed out, however, that the presence 
of aluminum particles is likely to be limited in 
processed construction and demolition waste, due to 
the fact that ductile metals will tend not to fragment to 
any great extent during processing, and thus will end 
up in the oversize fraction. Where handpicking and 
eddy current devices are employed, the risk of alumi¬ 
num being present is still lower. Furthermore, a recent 
study of the propensity for recycled aggregate from 
a range of sources to undergo ASR found that the 
materials presented very little risk [44]. 

Pozzolanic Components in the Mineral Fraction One 
interesting potential future option for the recycling of 
the mineral fraction is in exploiting the pozzolanic 
properties of certain constituents. Certain silica-rich 
materials undergo pozzolanic reactions with lime in 
the presence of water to form cementitious products 
which provide compressive strength. This is exploited 
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in concrete production by combining industrial by¬ 
products such as fly ash from coal-fired power 
generation with Portland cement. In construction and 
demolition waste, both fired-clay ceramics and glass 
possess pozzolanic characteristics [45, 46], and the 
mineralogy of many fired-clay ceramics is frequently 
very similar to that of fly ash. 

To exploit this chemical characteristic, it is neces¬ 
sary to grind the materials to very fine particle sizes, or 
at least to utilize the fines derived from processing. 
However, the main barrier to adopting such an 
approach is the mixed nature of the mineral fraction. 
Research examining the possibility of thermally 
processing mixed mineral demolition waste to render 
more of the mixture cementitious [28], while partially 
successful, found that variation in performance of the 
various materials made the viability of such an 
approach questionable. The most promising route 
toward utilizing materials in this way is via the decon¬ 
struction of structures. 

RCA and RA in Asphalt RCA and RA can also 
potentially be used as aggregate in asphalt mixtures, 
although research in this area has largely concentrated 
on RCA. In asphaltic mixtures, the higher porosity of 
the RCA particle surfaces is thought to absorb asphalt, 
which has the effect of reducing the effective asphalt 
content and increasing air voids [47]. 


The increase in air voids usually has the effect of 
reducing stiffness [48]. The decline in stiffness has 
implications in terms of load-bearing capacity during 
service. Deterioration in integrity of asphaltic mixtures 
resulting from contact with water is more marked with 
increasing RCA content, although this only becomes 
problematic at relatively high RCA levels. Creep rates 
and rutting depth also increase with RCA content. 

Unlike concrete, there is limited scope for adjusting 
asphalt proportions to accommodate RCA - reduction 
of bitumen content can improve properties such as resil¬ 
ient modulus, but has undesirable implications for the 
behavior of the material during placing and compacting. 

It should be stressed that performance is very much 
dependent on the source material. For instance, one 
study has observed wholly acceptable behavior from 
asphalt mixtures containing only RA [49]. 

Reclaimed Asphalt Pavement The recycling of 
reclaimed asphalt pavement or recycled asphalt 
planings (RAP) is now a well-established practice in 
many areas of the world. Due to the fact that asphalt 
surfaces are isolated from other structures, the problem 
of contamination is much less of an issue. As a result, 
the process of reclamation and recycling is relatively 
straightforward in comparison to other end-of-life 
construction materials. Reclamation is conducted by 
planing or milling asphalt from a road surface (Fig. 8). 



Construction and Demolition Wastes. Figure 8 

Planing of asphalt (Photograph courtesy of tayside contracts) 
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Recycling then involves reducing the particle size of the 
RAP and introducing it into the manufacture of hot or 
cold mix asphalt. 

Recycling in hot mix asphalt involves introducing 
a proportion of RAP into a conventional hot mix 
manufacturing process. This involves combining bitu¬ 
men with aggregate which has been heated using 
a burner. Recycling as part of the hot mix process can 
either be done at a manufacturing plant or in situ using 
mobile plant in which the planing, crushing, and hot 
mix production processes are carried out sequentially. 

During service of an asphalt road surface, bitumen 
undergoes oxidation reactions which lead to an 
increase in the proportion of larger molecules of hydro¬ 
carbons. This degradation leads to a reduction in the 
viscosity of mixes containing RAP [50]. To overcome 
this problem, the introduction of new bitumen and 
sometimes a “recycling agent” during hot mix recycling 
is employed to “rejuvenate” the mix [51]. The recycling 
agent is normally either a hydrocarbon wax or an inor¬ 
ganic zeolite and has the effect of reducing the required 
mixing temperature, which would otherwise need to be 
high to sufficiently reduce the viscosity of the bitumen 
present in RAP to allow adequate mixing and compac¬ 
tion. Such high temperatures are both costly in terms of 
energy consumption and present a safety risk. 

Most studies which have characterized hot mix 
asphalt mixtures containing RAP have found that stiff¬ 
ness (and, hence, the ability of the material to spread 
load) is increased as a result of the presence of stiffer 
oxidized bitumen [52]. However, where milling pro¬ 
cesses lead to significant reduction in particle size, 
a decline in stiffness is possible with increasing RAP 
content [53] . Typically, the inclusion of RAP in hot mix 
asphalt has the effect of compromising performance 
under cyclical loading, although it has been suggested 
that durability properties are enhanced. 

Technical requirements for the use of RAP in 
asphalt have typically placed stricter limits on materials 
in the pavement surface layer, compared to base and 
binder courses. This has certainly been the case for the 
Specification for Highway Works in the UK and the 
requirements of State Highway Administrations in 
the USA [53]. However, there is a growing recognition 
that this approach is excessively cautious, and the UK 
requirement now places emphasis on demonstrating 
that performance is appropriate at high RAP levels [54] . 


Another concern with hot mix recycling is that the 
presence of RAP leads to the emission of greater quan¬ 
tities of fumes during production and placing [55]. 
Furthermore, there is evidence that the toxicity of the 
hydrocarbons released is higher where RAP is used [56] . 

Even with the use of recycling agents, the required 
temperatures for mixing and compacting limit levels of 
RAP to around 50% [57]. Cold mix recycling permits 
higher levels of usage. 

Cold mix production involves mixing aggregate 
with an emulsion of bitumen droplets dispersed in 
water and stabilized with an emulsifying agent. The 
viscosity of such emulsions is sufficiently low to allow 
mixing and placing at ambient temperatures (Fig. 9), 
thus offering a route for utilizing RAP at levels up to 
100% of the total aggregate. Once placed, the emulsion 
breaks and the pavement stiffens. 

Another way of overcoming problems of high 
viscosity in asphalt mixtures through the use of cold 
mix technology is to employ foamed bitumen. Foam 
is created by injecting fine, cold water droplets into 
hot bitumen causing their conversion to steam 
and consequent expansion. The foamed bitumens 
viscosity is temporarily reduced, making mixing 
with RAP possible at ambient temperatures [58]. 
After a few seconds, the foam breaks down, but the 
presence of moisture maintains a low viscosity, and this 
allows the mix to also be compacted at ambient 
temperatures. 

Fresh Concrete Recycling options also exist for excess 
fresh concrete from construction. Residue remaining in 
ready-mixed concrete trucks has, in the past, been 
disposed of by washing out the mixer and disposing 
of the resulting material by settling out the aggregates 
and discharging the remaining sludge (“wash water”) 
which consists of water and particles of cement and 
fines into sewers. This approach has gradually evolved 
toward less wasteful procedures as a result of the 
increasing economic burden of landfill disposal and 
restrictions in many parts of the world on wastewater 
disposal. The most obvious way in which the mixer 
residue can be recycled is by recycling both the settled 
aggregates and the clarified water (which has been 
allowed to undergo sedimentation) in concrete 
production and disposing of the sedimented fines. 
However, a more sustainable solution is to recycle the 
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Construction and Demolition Wastes. Figure 9 

Asphalt paver laying cold mix asphalt (Photograph courtesy of tayside contracts) 


wash water without sedimentation. The affect of this 
approach on concrete performance has been 
demonstrated to be minimal [59]. 

An alternative means of achieving the same objec¬ 
tive is through the use of stabilizer/activator systems for 
halting and restarting the cement hydration process 
[60]. This technique involves concrete returned to the 
ready-mixed concrete plant being dosed with 
a retarding admixture referred to as a “stabilizer.” The 
retarder is usually a phosphate or phosphonic acid. An 
appropriate dosage is used to ensure a sufficient level of 
retardation, and the stabilized mix is either stored in 
the truck or in separate containers. When concrete 
production resumes, an accelerating admixture (the 
“activator”) is added at a dosage sufficient to counter¬ 
act the effect of the stabilizer. It is then combined with 
fresh concrete to produce mixes containing up to 35% 
recycled concrete. The impact on hardened concrete 
performance is minimal. 

Soil Conditioning The use of mineral constituents of 
construction and demolition waste as a means of 
ameliorating soils is an application which has been 
given some consideration in recent years. The mineral 
fraction of mixed construction and demolition waste 
typically contains relatively high levels of soluble 
calcium, sulfate, and potassium [61], which are 
valuable nutrients to all plants [62]. Additionally, in 


“sodic” soils, where sodium accumulation compromises 
soil structure in arid environments, calcium can be used 
to mobilize sodium ions fixed in clays, allowing them to 
be washed away [63]. Also, calcium additions can be 
used to tackle problems such as subsoil acidity 
syndrome [64] and crust formation [65] in soils. 

Typically the application of calcium to soils to real¬ 
ize these benefits is achieved using gypsum, meaning 
that the use of waste plaster from either construction or 
demolition activities has also been considered for such 
a role [66]. 

Gypsum Regardless of the unsustainability of disposing 
of any construction and demolition waste component to 
landfill, there is a very strong environmental argument 
for removing gypsum waste from waste streams destined 
for such a fate. This is because leaching of sulfate can 
present environmental problems, and the presence of 
gypsum promotes the formation of hydrogen sulfide 
gas which has an unpleasant odor and can lead to 
health problems [67, 68]. 

Closed-loop recycling of gypsum in the form of 
plasterboard (drywall) is an established practice in 
many parts of the world [69-71]. Reprocessing for 
recycling involves grinding up the recovered material 
using grinding apparatus, usually in combination with 
trommel screens which act to remove the paper¬ 
backing material which would otherwise render the 
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material unusable [71]. In some cases, magnetic sepa¬ 
ration is used to remove metallic contaminants, such as 
nails. New plasterboard can tolerate levels of around 
25% by mass of recovered gypsum [70]. The separated 
paper is also suitable for recycling. 

The tolerance of the process to contaminants is 
extremely low (less than 3% contamination can be 
accommodated) [69]. As a result, removal of plaster¬ 
board prior to demolition or, at the very least, well- 
managed pre-sorting of demolition rubble prior to 
processing for other purposes is necessary. In the case 
of plasterboard deriving from construction, 
uncontaminated material can be obtained by locating 
skips at construction sites into which waste board can 
be deposited for collection by reprocessors [69, 71]. 
Additionally, contamination with moisture renders 
plasterboard potentially unrecyclable, although pro¬ 
cesses have been developed more recently which have 
overcome this problem [70]. 

Recovered material can also be used as the gypsum 
component of Portland cement [71]. 

Glass Infrastructure for the recycling of glass is well 
established globally, and recycling of flat glass from 
structures is, in theory, a relatively straightforward 
process. Furthermore, glass recycling has favorable 
sustainability credentials, since its return to the glass 
furnace reduces the overall energy requirement of the 
manufacturing process. The saving in energy is 
dependent on the production rate, but at high 
production rates, a saving of around 19% is possible 
by using 50% recycled glass (or “cullet”) [72]. 

In reality, the physical characteristics of glass make 
it indistinguishable from the bulk of the mineral frac¬ 
tion of demolition waste, making separation after 
demolition impossible. This means that glazing must 
be removed from structures prior to demolition. Pres¬ 
ently, therefore, the majority of glass arising from con¬ 
struction for recycling tends to come from the 
replacement of glazing as a result of maintenance and 
refurbishment activities. Additionally, problems with 
contamination of glass with putty and other materials 
have acted as an obstacle to recycling. Nonetheless, 
some progress is being made, with recycling rates 
increasing, particularly for the manufacture of con¬ 
tainer glass and glass fiber, whose cullet quality criteria 
are slightly more tolerant [73]. 


The possibility of using glass as aggregate has also 
been explored. When employed as an unbound mate¬ 
rial, for instance as a pipe-bedding material, the use of 
glass is relatively straightforward. The use of glass as 
a bound material in asphalt mixtures is an established 
practice, albeit normally using material from the 
recycling of container glass. Although successful use 
of glass in asphalt pavement surfaces has been demon¬ 
strated at aggregate replacement levels of up to 15%, 
low bonding strengths and reduced friction mean that 
use in non-wearing courses is favored [74] . Glass aggre¬ 
gate in concrete functions well physically, but can 
potentially undergo expansive alkali-silica reaction 
(ASR) [75]. However, the inclusion of materials such 
as ground granulated blast furnace slag in concrete 
mixes containing glass appears to control expansion 
effectively [43]. 

Recycling Organic Waste 

Timber There are many potential means of recycling 
waste timber in the manufacture of construction 
materials. These include particleboard (or chipboard), 
wet-processed fiberboard, dry-processed fiberboard 
(MDF), oriented strand board (OSB), and cement- 
bonded particleboard [76]. Currently, only 
particleboard manufacture employs significant 
quantities of recycled timber. 

All of the products involve timber-derived material 
held together by some form of binder. In every case, 
reprocessing of the timber is required, firstly through 
size reduction. The least amount of size reduction is 
required in the manufacture of OSB, where timber 
strands up to 75 mm in length are cut and then sprayed 
with a layer of adhesive. The strands are then laid into 
mats, usually of three layers in which the grain of the 
strands may be aligned. The mats are then pressed 
between platens to form the boards and heat applied 
to cure the resin. 

Smaller particle sizes are used for particleboard and 
cement-bonded particleboard. The former is bound 
with an organic resin, while the latter is bound with 
Portland cement. Like OSB, particle board is 
manufactured by first producing a mat of particles 
which have been sprayed with resin and then pressing 
the mat between heated platens, passing the mat 
through heated rollers or extrusion of particles through 
a heated die. Cement-bonded particle board is 


c 




2410 


c 


Construction and Demolition Wastes 


produced by combining Portland cement and wood 
particles in ratios of approximately 75:25 by mass. 

The manufacture of fiberboard requires further 
processing prior to manufacture, in the form of expo¬ 
sure to steam at high pressure. This allows a subsequent 
mechanical defibration process to separate the fibers. 

Wet-processed fiberboard is produced by forming 
a mat of wetted fibers which is then pressed between 
heated platens. No application of resin is required, as 
the heat softens the lignin on the surface of the fibers, 
binding them. In the manufacture of MDF, resin is 
sprayed onto dry fibers, which are then formed into 
boards in a similar manner to particleboard. 

Uses for chipped timber include animal bedding, 
surfacing material for bridleways, footpaths, and chil¬ 
dren’s playgrounds. Additionally, chipped timber 
mulches have been found to be highly effective, with 
good capabilities in suppressing the growth of weeds, 
maintaining surface soil temperatures and retaining 
soil moisture [77]. 

However, concern has been expressed regarding the 
use of treated timber in many of these applications. 
Timber treatments are principally used as a means of 
preserving wood from biological degradation. Timber 
treatments include waterborne preservatives such as 
copper-chromium-arsenic (CCA) compounds, tar oil 
preservatives (such as creosote), copper and zinc 
organics delivered via organic solvents, as well as paints 
or varnishes [76]. It has been found that arsenic deriv¬ 
ing from CCA treatments can leach from mulches (see 
section Environmental and Sustainability Issues). Fur¬ 
thermore, a risk assessment carried out in the UK 
concluded that the use of treated wood chippings for 
animal bedding, mulches, and compost was, in most 
cases, inappropriate [77]. In many countries, limits are 
now set for levels of preservatives in board materials, 
and this is likely to improve mattes in the future. 

Composting The composting of timber and other 
biodegradable components of construction and 
demolition waste is also an available option. 
Composting involves the aerobic digestion of 
biodegradable waste by microorganisms to produce 
humus for agricultural purposes. The success of 
composting is dependent on factors such as moisture 
content, oxygen levels, temperature, pH, and the 
texture and biochemical composition of the waste. 


The key factor relating to chemical composition is 
the carbon-nitrogen (C/N) ratio, which should ideally 
be around 30 [78]. Because the main chemical constit¬ 
uents of timber are based around carbon, oxygen, and 
hydrogen, the C/N ratio of timber is typically around 
700. Thus, composting involving timber is best 
conducted in combination with wastes richer in 
nitrogen, such as agricultural manure or green plant 
wastes [79]. 

Biochemical composition is also relevant in the case 
of timber, as a major constituent of timber is lignin, 
which is usually broken down at a relatively slow rate by 
microorganisms. 

The issue of texture is also important. Texture in 
this context refers to the structure of the waste, with 
a high surface area and open structure allowing access 
to microorganisms and retention of oxygen and mois¬ 
ture. To obtain a suitable texture, timber requires 
chipping prior to composting. 

As before, the presence of CCA preservatives is 
likely to compromise the extent to which certain 
sources of wood can be used in this way. However, it 
should be stressed that the fungal and microbial 
decomposition of timber during composting appears 
to have the effect of reducing the plant toxicity of 
timber (albeit in timber containing no CCA) [80]. 
The presence of CCA seems to have little effect on the 
populations of microorganisms present during 
composting, and there is little evidence of release of 
the metals used in CCA [81]. However, the longer-term 
consequences of their presence are likely to be detri¬ 
mental and should be avoided. 

Recycling Plastics Recycling plastics can be conducted 
in two ways - mechanical recycling and feedstock 
recycling. Mechanical recycling refers to the reforming 
of plastics into new articles. When the plastics involved 
are thermoplastics, mechanical recycling normally 
involves the application of heat to produce a melt 
which is then extruded. Elevation of temperatures can 
potentially cause the polymers to undergo thermal 
degradation leading to deterioration of mechanical 
and durability properties. 

Plastics derived from construction and demolition 
waste are currently most commonly encountered in 
mixed form. Recycling of such material presents prob¬ 
lems, mainly as a result of the immiscibility of different 
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polymers. This immiscibility leads to a lack of homo¬ 
geneity in the melt which ultimately yields poor 
mechanical properties. 

Despite this, some success has been achieved in the 
extrusion of articles using blends of incompatible poly¬ 
mers at temperatures at which one polymer melts and 
the other remains solid, but above the glass transition 
temperature. Extrusion causes the solid polymer to be 
drawn out into fibers held within a matrix of the molten 
polymer. This approach has been successfully used to 
manufacture items such as ties for rail tracks. However, 
the approach is limited to a specific pair of polymers 
(HDPE and PS) and would likely be unsuitable for the 
PVC-rich plastics fraction deriving from construction 
and demolition waste [82]. Other similar techniques 
have been developed for the manufacture of board, 
pallets, stadium seating, benches, cable drums, perfo¬ 
rated paving blocks, and underground chambers for 
water distribution applications [83]. 

Laboratory studies have examined the possibility of 
overcoming the issue of immiscibility. One approach is 
through the use of compatibilizers - compounds which 
are capable of producing a more uniform melt. While 
compatibilization is well established for binary poly¬ 
mer blends, the situation is more complex for the 
combinations of multiple polymer blends. However, 
some success has been achieved with mixtures of poly¬ 
olefins (PE and PP) and mixtures of polyethylenes and 
PS [84, 85]. Examples of compatibilization of multiple 
polymer blends containing PVC are not documented, 
although the improved properties of PVC/LDPE blends 


observed after the addition of a chlorinated polyethyl¬ 
ene compatibilizer indicate one potential route [86]. 

Another means of improving the properties of 
materials derived from mixed plastic melts is by milling 
the waste into a powder prior to melting and extrusion. 
Many polymers used in construction are above their 
glass transition temperature under ambient conditions 
and are consequently sufficiently tough to resist milling 
processes (Table 5). For this reason, successful reduc¬ 
tion of waste plastics to very small particle sizes is best 
achieved by milling at low temperatures. At laboratory 
scale, the injection of liquid C0 2 into the chamber of 
a ball mill has been demonstrated to produce PE-PP 
mixtures which display evidence of greater compatibil¬ 
ity and possible copolymerization [88] . Similarly, cryo¬ 
genic impact milling - in which liquid N 2 is used to 
reduce the temperature to below — 150°C - has been 
shown to produce enhanced tensile strength in mix¬ 
tures of post-consumer polymer waste comprising PE, 
PP, and PS compared to the unmilled mixture [89]. 
Comparable results can be achieved through the extru¬ 
sion of polymer mixtures at cryogenic temperatures 
[90]. It should be stressed that the energy required to 
produce liquid C0 2 and N 2 is likely to significantly 
compromise the viability of such techniques in both 
economic and sustainability terms. 

The problems associated with using mixed polymer 
wastes have lead to much research and development 
activity focusing on the separation of such wastes. One 
of the characteristics of the polymers which can be 
exploited to effect separation is density. Separation of 
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Polymer 

Density, kg/m 3 

Typical glass transition 
temperature, °C 

Charge from triboelectric 
effect, pC after contact 
with gold or brass [ ] 

PVC 

1,400 

80 

-1.6 

HDPE 

960 

25 

0 (Brass) 

LDPE 

920 

-5 

0 (Brass) 

PP 

900 

-20 

0 

PS 

1, 050 (High impact PS) 

100 

0 

PU 

1,120 

-30 

+0.1 

ABS 

1,060 

110 

+0.1 
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high- and low-density polymers is possible using tech¬ 
nologies such as air tables, cylinder cyclones, and flo¬ 
tation [91, 92]. All these techniques permit the 
separation density to be controlled to permit specific 
fractions to be isolated. 

In the case of air tabling, plastics waste is passed over 
a vibrating porous deck through which air is passed - 
the velocity of the air controls the separation density. 
The technique has been found to be most efficient when 
the difference between the densities of the components 
to be separated exceeds around 400 kg/m 3 . Table 5 
shows that air tabling is most appropriate for the 
separation of PVC from other polymers deriving from 
construction and demolition wastes. 

Cylinder cyclones create a cyclone of liquid with an 
air core within a cylinder such that material less dense 
than liquid floats on the cyclone surface, while denser 
material is forced out through exit port in the cylinder’s 
side. The separation density in this case is controlled by 
adjusting the density of the liquid medium. The liquid 
is normally water, and its density can be increased by 
the addition of fine mineral particles. 

The use of flotation in liquid is also an effective 
means of separating certain polymers from others. The 
use of water is the most straightforward technique, and 
Table 5 shows that the densities of polymers commonly 
encountered in construction and demolition waste fall 
either side of that of water. Furthermore, solutions can 
be tailored to a required density through the addition 
of salts or organic solvents. 

Exploitation of density is, however, not usually 
sufficient to fully isolate individual polymer compo¬ 
nents in construction and demolition waste streams, 
since there is little difference in the densities of many 
polymers. A flotation technique that does not rely on 
differences in density is froth flotation [93]. This 
involves introducing mixed polymer waste in particle 
form into a cell containing a solution of frothing agent 
through which air is passed in a manner which allows 
froth formation. The solution also contains a collector 
compound which is preferentially adsorbed onto the 
surface of certain polymers, rendering them 
hydrophobic. This preferential absorption can be pro¬ 
moted by adjusting the solution pH to within a specific 
range. Air bubbles attach themselves to the surfaces of 
polymer particles which have been rendered hydropho¬ 
bic, causing them to float to the surface of the cell. 


Of particular relevance to construction and demolition 
waste is the successful separation of ABS and high 
impact PS (two polymers with very similar densities) 
using this technique [94]. 

Triboelectric separation is another technique suited 
to separating plastics of similar density. It exploits the 
triboelectric effect, where particles of different mate¬ 
rials acquire charges of differing polarity and magni¬ 
tude after being brought into contact with another 
material, usually with friction acting between the two. 
Electrostatic charges acquired by polymers are given in 
Table 5, indicating that, again, PVC is distinct from the 
other materials. By allowing the particles to then fall 
through space containing an electric field created by 
establishing a high potential difference between two 
electrodes, the differently charged particles are directed 
by electrostatic deflection either toward one or other of 
the electrodes. Contact between polymer particles and 
other materials to induce charging can be achieved in 
a number of ways including the use of cyclones, fluid¬ 
ized beds, and rotating drums [95]. 

Most of the commonly encountered polymers in 
waste streams have very distinct infrared spectra. 
Since the 1990s, there have been significant advances 
in the development of infrared spectrometers capable 
of rapidly distinguishing between different types of 
plastics. This has, in turn, lead to the appearance of 
a wide range of plastics separation systems on the 
market. Such devices are able to identify the constitu¬ 
ent polymer in individual plastic articles passing 
a spectrometer via a conveyor. Articles can be accepted 
or rejected usually through the application of an air 
blast. Such systems are highly effective when dealing 
with packaging waste in municipal waste streams. 
However, application to polymer mixtures deriving 
from construction and demolition waste, where indi¬ 
vidual articles are potentially bulky and thick, is much 
less viable. 

Separation of polymers opens up new possibilities 
for recycling, although the mechanical recycling 
of recovered polymers is not without problems. 
This is principally the result of peroxidation, which 
results from reactions involving organic hydroperoxide 
compounds at ambient temperatures initiated by sun¬ 
light in external exposure conditions [83]. Peroxida¬ 
tion is partly responsible for deterioration of polymer 
properties during service. However, it presents an 



Construction and Demolition Wastes 


C 


2413 


additional problem during recycling, since partially 
peroxidated polymers are particularly prone to 
undergo thermal degradation when exposed to ele¬ 
vated temperatures. 

PVC will normally contain stabilizing additives 
which prevent degradation during melting and extru¬ 
sion. The stabilizing effect of these additives persists 
during mechanical recycling, and it has been demon¬ 
strated that PVC can be mechanically recycled up to 
five times between periods of accelerated aging to pro¬ 
mote peroxidation [96]. Furthermore, additional sta¬ 
bilizers can be added prior to recycling to ensure 
stability. However, the situation is less straightforward 
where PVC from a range of sources is being recycled. 
Problems can arise from the presents of contaminants 
in recovered PVC. For instance, it has been shown that 
very small quantities of PE accelerate the degradation 
of PVC during mechanical recycling [97] . This problem 
can be resolved by combining recycled PVC with virgin 
material. 

Combining sources of PVC containing different 
additives can cause problems. For instance, the combi¬ 
nation of lead-based additives with sulfur-bearing 
additives causes discoloration [98]. Of most concern, 
however, is the formation of transformation products 
from stabilizers during the processing of PVC. These 
products are usually metal chlorides, which are poten¬ 
tially highly toxic. The issue is of greater importance 
where recycling is involved, since these products will 
accumulate with each cycle of reprocessing [83]. 

The problems associated with PVC tend to mean 
that the material is usually recycled in products where 
release of metals does not pose a significant risk. 
Recently, research has shown that PVC waste can be 
utilized in the manufacture of lightweight aggregate for 
concrete [99]. 

The recycling of plastics in asphalt mixtures has 
been demonstrated to be feasible, with some benefits 
in terms of modification of properties. Most research 
into this area has concentrated on polyethylene, which 
has been shown to enhance stability with respect to 
degradation resulting from exposure to water [100]. 
The inclusion of PVC in hot mix asphalt has been 
shown to improve flexural strength and wear resistance 
[101]. Incorporation of PVC in asphalt for waterproof¬ 
ing improves mechanical performance as well as resis¬ 
tance to chemical attack [102]. 


Recycling Metallic Waste A study of the economics 
of recycling materials has concluded that only the 
recycling of ferrous metals and copper from construc¬ 
tion and demolition waste are currently viable econom¬ 
ically [103]. This is partly because these two metals are 
easily removed from this stream. 

Ferrous metals are likely to largely take the form of 
steel structural components such as sections and rein¬ 
forcement bars and are, hence, easily identified and 
removed. Moreover, they are magnetic. Copper is 
mainly present as piping and cable, which is readily 
recovered from construction sites and will usually be 
stripped from a structure prior to demolition. It is, 
however, important to remember that this conclusion 
is potentially dependent on the techniques and tech¬ 
nology used for separation and the market value of 
metals, with future increases in price potentially 
converting construction and demolition waste into 
resources for more extensive metals recovery. 
For instance, construction and demolition waste 
is one of the largest zinc-bearing waste streams in 
existence [104]. 

The benefits of recycling metals from a sustainabil¬ 
ity perspective are significant, since recycling usually 
requires significantly less energy than production of 
virgin metal from ore. This is shown in Table 6, which 
indicates embodied energy values associated with the 
production of recycled steel, copper, aluminum, lead, 
and zinc are considerably less than the virgin metal. 
While it should be stressed that embodied energy 
values will be dependent on local conditions such as 
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Embodied energy of virgin and recycled metals commonly 
encountered in construction and demolition waste [105] 



Embodied energy, MJ/kg 


Metal 

Virgin 

Recycled 

Saving 

Steel 

35.3 

9.5 

73% 

Copper 

70.0 

17.5 

(High-grade scrap) 

75% 

Aluminum 

218.0 

28.0 

87% 

Lead 

49.0 

10.0 

81% 

Zinc 

72.0 

9.0 

88% 
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transport distances, ore purity, and local manufactur¬ 
ing practice, the overall result is likely to be similar 
regardless of location. 

Energy Recovery 

The organic fraction of construction and demolition 
waste is combustible, and the generation of heat and 
power from the combustion of wood, plastics, and 
paper is an obvious means by which energy can be 
recovered [20]. However, problems associated with 
this approach have led to the development and growth 
of other means of energy recovery. 

Combustion Combustion can be carried out in com¬ 
bined heat and power incineration plants where heat is 
used both to generate electricity through a steam tur¬ 
bine and also to heat water for local homes and 
industries. 

The calorific value of plastic is relatively high, with 
higher heating values for construction plastics varying 
between 18.0 and 46.6 MJ/kg (Table 7) [106]. This 
compares favorably with many conventional oil- 
derived fuels such as fuel oil whose higher heating 
value is around 46 MJ/kg. However, it should be 
noted that the higher heating value of PVC, which is 
usually present to the largest extent in construction and 
demolition waste, is much lower than many other poly¬ 
mers. The higher heating value of timber lies between 


Construction and Demolition Wastes. Table 7 Higher 
heating values of a range of polymers [1 06] 


Polymer 

Higher heating value, 
MJ/kg 

Polyvinyl chloride 

18.0 

High-density polyethylene 

46.6 

Low-density polyethylene 

46.6 

Polypropylene, isotactic, or 

46.6 

syndiotactic 


Polypropylene, isotactic 

46.5 

Polystyrene, isotactic 

41.5 

Polystyrene, atactic 

41.6 

Polyurethanes 

23.7 

Acrylonitrile butadiene styrene 

40.0 


19 and 28 MJ/kg of dry wood (Table 8), although this is 
compromised by the presence of water [107]. 

Separation of the organic fraction from the 
noncombustible materials is the only viable option, 
since the specific heat capacities of many 
noncombustible construction and demolition waste 
constituents is relatively high, meaning that efficiency 
would be compromised considerably by using mixed 
waste. 

In both cases, the combustion of these materials is 
not without detrimental consequences. The combustion 
process will contribute toward climate change via the 
emission of carbon dioxide, although it has been dem¬ 
onstrated that where combustion is carried out using 
only plastics (and taking into account avoided emis¬ 
sions from conventional power generation and heating, 
emissions from treatment of waste and emissions from 
conventional incineration fuel), net C0 2 emissions can 
be negative where efficiencies are high and electricity 
generation takes precedence over water heating. 

Plastics used in construction can contain additives 
comprising trace metals with detrimental effects on the 
environment. For instance, PVC window frames have 
historically contained lead stabilizers [108], although 
their use is being phased out in many countries. Addi¬ 
tionally, the combustion of plastic waste containing 
PVC has been partly blamed for the formation of 
dioxins and furans in the exhaust gases of incinerators 
[109]. These substances are condensed onto particu¬ 
lates which are collected by the pollution control equip¬ 
ment of incinerators and which must be disposed of 
with great care. 

However, probably the main objection to combus¬ 
tion as a waste management option is simply that there 
are arguably more sustainable uses that plastics and 
timber can be put to, which do not involve their 
destruction. 

Some of the problems of recovering energy from 
organic wastes are potentially avoided by thermally 
processing the material to produce solid, liquid, or 
gaseous fuels. This can be done either via biological 
processes, in the form of anaerobic digestion, or using 
pyrolysis-based techniques. 

Pyrolysis Pyrolysis is the thermal decomposition of 
organic substances in the absence of oxygen. Tempera¬ 
tures between 350°C and 900°C are required to effect 
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Construction and Demolition Wastes. Table 8 Higher heating values of a range of timber species in an oven-dry 
condition [107] 


Species 

Higher heating value, MJ/kg 

Species 

Higher heating value, MJ/kg 

Bigleaf maple 

19.5-19.6 

Red oak 

21.8 

Black cottonwood 

20.5 

Redwood 

21.4 

Cypress 

24.8 

Sitka spruce 

18.8 

Douglas-fir 

20.5-21.4 

Western hemlock 

18.6-20.1 

Hickory 

21.8 

Western red cedar 

22.6 

Lodgepole pine 

20.0 

White ash 

22.4 

Oregon ash 

19.1 

White birch 

21.7 

Oregon white oak 

18.9 

White cedar 

21.1 

Pitch pine 

28.4 

White fir 

18.6-19.3 

Ponderosa pine 

21.2-21.3 

White oak 

22.1 

Poplar 

22.4 

White pine 

22.4 

Red alder 

18.6 

Yellow pine 

24.1 


decomposition [110], although temperatures at the 
lower end of this range are more typical. The products 
of pyrolysis of polymers are smaller molecules (in both 
gas and liquid form) deriving from the decomposition 
of polymer chains and a quantity of char - solid 
carbon-rich residue. Table 9 shows the products of 
pyrolysis of a range of individual polymers used in 
construction. 

The gases formed are largely mixtures of carbon 
dioxide (C0 2 ), carbon monoxide (CO), methane 
(CH 4 ), ethane (C 2 H 6 ), ethene (C 2 H 4 ), propane 
(C 3 H 8 ), and propene (C 3 H 6 ), which require further 
refinement to produce gas which can be employed as 
fuel. As a result, it is the liquid or solid fractions which 
are more typically used as fuels. 

When pyrolysis is carried out for recovery of energy, 
it is desirable for the majority of liquid hydrocarbon 
products to consist of compounds in the gasoline 
range - molecules containing between 5 and 12 carbon 
atoms. A bias toward such a range can often be 
achieved through the use of catalysts, which also permit 
reductions in pyrolysis temperatures and times [110]. 
Additionally, where mixtures of polymers are present 
(which is the most likely scenario in the case of plastics 
deriving from construction and demolition waste), 
a significant amount of chemical interaction between 


Construction and Demolition Wastes. Table 9 Products 
of pyrolysis of polymers commonly used in construction [1 1 0] 


Pyrolysis products 

Low temperature High temperature 


Polymer (<500°C) (>500°C) 


Polyethylene 

• Waxes 

• Gases 

• Paraffin oil 

• Light oils 

• a-olefins 

Polypropylene 

• Soft paraffin 

• Gases 


• Olefins 

• Light oils 

Polyvinyl 

• HCI (<300°C) 

• Toluene 

chloride 

• Benzene 


Polystyrene 

• Styrene 

• Styrene 


• Styrene 

• Styrene 


oligomers 

oligomers 


polymers leads to the formation of lower molecular 
weight liquid fractions, albeit at lower yields [111]. 

In the case of processed demolition waste, it is often 
the case that plastics and timber waste will be present 
together due to their similar density. Co-pyrolysis of 
timber and plastics is an option in this case, although it 
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has both advantages and disadvantages. Co-pyrolysis of 
timber with mixed plastics typically has a negative 
influence on the yield of liquid hydrocarbons relative 
to the products obtained from plastics alone and 
appears to promote a shift toward gases richer in C0 2 
and CO [112, 113]. However, the presence of plastics 
alongside timber provides the advantage of reducing 
the oxygen content of the resulting compounds relative 
to the products of wood pyrolysis alone [113]. This is 
beneficial, since the presence of oxygen reduces the 
heating value of the liquid fraction and renders it 
corrosive. 

Examination of chemical interactions between cel¬ 
lulose in wood and individual polymers indicates that 
there is little reaction between most polymers and 
wood during pyrolysis [114]. The main exception to 
this is PVC, whose presence reduces the temperatures at 
which cellulose decomposes, and promotes the forma¬ 
tion of char, possibly as a result of acid catalysis by HC1 
released during pyrolysis. 

The high carbon content of char means that it is 
possible to employ it as a solid fuel in power generation 

[115] . Where pyrolysis is carried with the express aim 
of producing char for fuel, the process is referred to as 
“carbonization.” The successful carbonization of the 
combustible fraction of construction and demolition 
wastes has been demonstrated and the resulting char 
possessed a higher heating value of up to around 
23 MJ/kg. When compared with wastes from other 
sources, this was only exceeded by materials derived 
from shredded bulky wastes from a source of municipal 
waste and wastes from the food-processing industry 

[116] . 

Another related technique to extract fuels from 
organic wastes is gasification which involves heating 
wastes in the presence of oxygen and steam. The pro¬ 
cess starts with pyrolysis, but the presence of oxygen 
causes the hydrocarbons and some of the char to 
undergo partial combustion to form CO and C0 2 . 
Further CO and C0 2 are formed by the gasification 
reaction of the char, during which carbon reacts with 
water, also causing the formation of hydrogen gas (H 2 ). 
Methanation follows, during which carbon in the char 
reacts with hydrogen to form methane (CH 4 ). The 
carbon monoxide subsequently undergoes water-gas 
shift with water to form more hydrogen and C0 2 . 
Reforming reactions involve a reaction between CH 4 


and both water and C0 2 to yield further H 2 and CO. 
Thus, a gas is produced which is rich in H 2 and CO 

[117]. This synthetic gas, or “syngas,” can be burnt as 
fuel, with higher efficiency than combustion of the raw 
waste, or used as a raw material in the manufacture of 
other petrochemical products. From a sustainability 
perspective, the latter option is likely to be viewed in 
more favorable terms. Additional products in the form 
of char and tar (higher molecular weight hydrocar¬ 
bons) are also formed. 

Co-gasification of plastics and wood (conducted in 
combination with coal, which assists in stabilizing the 
process) has been examined, with the conclusion that 
where utilization as fuel is the overall aim, high tem¬ 
peratures are required, compromising efficiency. 

The formation of HC1 during both pyrolysis and 
gasification of mixed plastic waste containing PVC is 
potentially a problem, since pollution control require¬ 
ments will be heavier. However, a gasification process 
has been proposed in which separation of HC1 is 
conducted and revenue generated from its sale for use 
as a raw material in other chemical manufacturing 
processes [118]. 

Anaerobic Digestion Anaerobic digestion, like 
composting, relies on microorganisms to break down 
wastes into gas, liquid, and solid products. Unlike 
composting, anaerobic digestion is carried out in the 
absence of gaseous oxygen, which is achieved by 
allowing digestion to occur within a sealed tank. The 
microorganisms involved still require oxygen, and so in 
this environment, this must be obtained from com¬ 
pounds present in the waste. 

The digestion process produces gas comprising 
C0 2 , ammonia (NH 3 ), hydrogen sulfide (H 2 S), and 
a relatively large quantity of CH 4 , the latter giving this 
product value as fuel. It is frequently the case that the 
gas is treated prior to use to remove H 2 S, which would 
otherwise contribute toward acid rain. Additionally, 
liquid and solid digestates are produced, both of 
which can be employed as fertilizer. 

This option is clearly only applicable to organic 
construction and demolition wastes, and the lack of 
biodegradability of construction polymers excludes 
them from such a technique [119]. While there has 
been growth in the use of biodegradable plastics in 
many areas, this has not been the case in construction, 
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where durability is an essential requirement, and usu¬ 
ally one of the main reasons for using plastics in the 
first place. 

Anaerobic digestion of timber alone is not an 
option - the process requires nutrients in the form of 
nitrogen and potassium, and these are present in very 
low concentrations. However, the option of co¬ 
digestion exists, where multiple-source wastes are 
blended to achieve higher yields of methane than 
would be obtained with single sources [120]. 

Two of the most common feedstocks for anaerobic 
digestion are municipal biosolids and manure from 
livestock. Both of these wastes have previously under¬ 
gone digestion and, consequently, contain a relatively 
little carbon, yielding low methane levels. As for aero¬ 
bic composting, an optimum C/N ratio of between 
20 and 30 is necessary for optimum digestion, which 
can be achieved by co-digesting biosolids or manure 
with materials with higher carbon content, such as 
timber. 

The high levels of lignin present in timber mean 
that a large proportion remains undigested. The poten¬ 
tial to use this solid digestate as fertilizer may be 
compromised by the presence of metals, particularly 
those deriving from preservatives. Recently, a two-stage 
anaerobic digestion process starting with a stage in 
which the feedstock is acidified and liquefied to remove 
a large proportion of these metals has been proposed 
[ 121 ]- 

Environmental and Sustainability Issues 

Environmental Hazards of Construction and 
Demolition Waste 

Despite the environmental benefits of recycling con¬ 
struction and demolition waste, the composition of 
such material can potentially present hazards to both 
human health and the environment. Key concerns 
include general dust, asbestos, and the leaching of 
constituents into the surrounding environment. 

Dust Construction and demolition activities can 
have the effect of raising atmospheric particulate levels 
which threaten public health [122], and links between 
airborne demolition particulates and exacerbation of 
conditions such as asthma have been established [123]. 
Additionally, levels of fungi particles in the air increase 


during demolition activities, potentially increasing the 
rate of occurrence of lung infections [124]. It can 
therefore be concluded that processing of construction 
and demolition waste can potentially have similar 
implications. In practice, however, it has been found 
that control of particulates is wholly feasible through 
the application of water using bowsers, sprays, and 
wheel washers and through the use of sweepers, 
hoods, extractor fans, and balanced air systems where 
processing is carried out indoors [125]. Demolition 
waste processing facilities employing air classification 
will often control particulate emissions downstream 
using pollution control devices such as cyclones [18]. 

Asbestos The use of asbestos in structures began on 
a large scale in the late nineteenth century. The material 
was used in construction in cements, plasters, boards, 
and reinforcement in plastic components, such as 
flooring tiles, principally for its good resistance to 
heat and chemical attack. Relatively soon after asbestos 
use started on a large scale, respiratory health problems 
in people working with the material began to be 
identified. Inhalation of fibers of asbestos leads to 
asbestosis as well as various forms of cancer. The 
brown and blue forms of asbestos are the most hazard¬ 
ous, although all forms are now known to be harmful 
to health [126]. 

Asbestos in construction is now prohibited in most 
parts of the world. However, many structures built or 
refurbished before the 1980s can potentially contain the 
material in various forms. Demolition of structures 
containing asbestos clearly presents a risk of releasing 
this substance into the environment. While atmo¬ 
spheric release of fibers can be controlled by spraying 
with water [127], the dispersal of asbestos into the 
wider environment is clearly not desirable. Thus, in 
many countries, it is a legal requirement that asbestos 
is removed from structures prior to demolition. 

Asbestos cannot be readily isolated from other 
components in the mineral fraction of demolition 
waste. Moreover, removal of asbestos after size reduc¬ 
tion would potentially permit quantities of fibers to be 
released. For this reason, the only practical means of 
guaranteeing the absence of asbestos is to ensure thor¬ 
ough removal prior to demolition, and this, in turn, 
requires a comprehensive survey of the fabric of the 
structure to identify occurrences of this material. 
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Leaching Most research carried out into the leaching 
of chemical constituents from construction and demo¬ 
lition waste has concentrated on leaching occurring in 
landfill facilities. However, much of the general find¬ 
ings of this work can be applied to situations in which 
the material is employed in an application, with the 
qualification that the chemical conditions within land¬ 
fill may differ considerably from those elsewhere. 

While the majority of inorganic ions typically 
found to leach from construction and demolition 
waste are not toxic, their concentrations in leachate 
can be of such a magnitude as to exceed regulatory 
drinking water limits [128]. Cations released in large 
quantities can include calcium, potassium, sodium, 
magnesium, manganese, and iron, while anions are 
mainly sulfate, chloride, carbonate, and hydroxide 
[128-130]. 

The types of trace metal found to leach from con¬ 
struction and demolition waste are very much depen¬ 
dent on the types of material present. Trace metals 
found to leach from construction and demolition 
waste have included arsenic, barium, cadmium, chro¬ 
mium, copper, lead, nickel, and zinc [128-130]. Of 
these, arsenic, chromium, and lead are of greatest con¬ 
cern due to their toxicity [131]. 

Much of the arsenic and chromium derives from 
waste timber which has been treated with CCA wood 
preservative [128]. Of these two elements, the greatest 
hazard is usually considered to be presented by arsenic 
on the grounds that it is more toxic. Moreover, chro¬ 
mium in treated timber will typically be in the trivalent 
(Cr(III)) form, which is significantly less toxic than the 
hexavalent (Cr(VI)) form [132, 133]. However, 
leaching of chromium in high pH environments 
could potentially oxidize chromium to its hexavalent 
state [134]. 

The source of manganese has been speculated as 
also being from timber [ 135] , while one possible source 
of lead is lead-based paint [136]. 

The leaching of organic compounds has been exam¬ 
ined in research aimed at establishing potential risks 
presented by fines from processed construction and 
demolition waste [137]. The conclusion drawn from 
this work is that, while organic compounds are 
detected in leachate from such wastes (deriving from 
sources such as asphalt, paints, varnishes, and fuel), 
they are normally found well below regulatory limits. 


Wastewater Where wet processes are used during the 
processing of construction and demolition waste, 
appropriate measures need to be taken to ensure that 
waste water is disposed of correctly and in a form which 
is compatible with local environmental requirements. 
Most commonly, the use of settlement tanks and filters 
is employed to ensure particulate levels in waste water 
are reduced to acceptable levels. 

Evaluating Impact 

Sustainable management of construction and demoli¬ 
tion wastes using the waste hierarchy as a general guide 
is largely a sound approach. However, it should be 
stressed that there are likely to be exceptions to this. 
A good example of such a grey area is the production of 
RA and RCA. 

The processing of demolition waste to produce RCA 
and RA involves operations not dissimilar to the pro¬ 
duction of virgin crushed rock aggregate. For this rea¬ 
son, the embodied energy of the material is likely to be 
only slightly less than that of virgin aggregate, with any 
energy advantage coming from the potential reduction 
in transportation distances (particularly if the aggregate 
can be used on the site on which it is being processed). 
Since emission rates of pollution from such processing 
are likely to closely correlate with embodied energy, 
these will also be close to that of virgin aggregate. 

Where the sustainability of a waste management 
option is in question, it is clearly of great value to 
conduct an analysis of the environmental impact of 
using construction and demolition waste in 
a particular application against an equivalent product 
derived from virgin material. A useful tool in carrying 
out such an analysis is environmental life-cycle assess¬ 
ment (LCA), which is a systematic methodology for 
compiling an inventory of the inputs (resources and 
energy) and outputs (pollution and waste) associated 
with the unit processes that make up the entire life cycle 
of a material and then assigning quantitative measures 
of environmental impact to each of these [138]. 

This type of analysis can also be extremely useful in 
making larger-scale decisions about waste management 
strategies for construction and demolition waste. A study 
carried out in Sweden examined two scenarios - 
one in which the maximum level of recycling of con¬ 
struction and demolition waste was achieved, and one 
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in which the maximum level of reuse was achieved 

[139] . Recycling in this case included combustion of 
certain materials with the recovery of energy. The ben¬ 
efits of the two different approaches were measured in 
terms of the amount of mineral resources and energy 
saved. The main energy savings from both scenarios 
were in the recycling or reuse of metals and masonry, 
and the combustion or reuse of timber. The most 
significant conservation of mineral resource in both 
cases was gravel. The study also examined projected 
future scenarios, potentially providing useful informa¬ 
tion to those making planning decisions. 

It should be stressed that levels of resource use and 
environmental impact will not be the only factors 
which decide the extent to which an option for utilizing 
construction and demolition waste is sustainable. 
As well as the need for a utilization option to have 
reduced embodied energy and associated pollution 
emissions when compared to a conventional product, 
the processing requirements must have a lower cost and 
the material must have at least an equal market value 

[140] . Moreover, consideration of the renewability 
of the primary resource from which a material is 
derived should also be given consideration. For 
instance, taking into account renewability, the finite 
nature of metal ores in the Earth means that there is 
potentially a stronger argument for recycling metals 
than for recycling timber deriving from well-managed 
sources. 

Future Directions 

The drivers for growth in the recycling and reuse of 
construction and demolition waste include scarcity of 
landfill space, legislation, and taxation. It is likely that 
these incentives toward more sustainable approaches to 
managing this material will increase in the future, with 
additional pressures potentially coming from increas¬ 
ing energy prices and resource scarcity - particularly 
for metals and polymers. 

Another future development is likely to be the plac¬ 
ing of greater emphasis on “producer responsibility” in 
the field of construction and demolition waste. The 
philosophy behind producer responsibility is essen¬ 
tially an extension of the “polluter pays” principle - 
the responsibility for managing waste is placed with the 
sector which produces it. In Europe, one way in which 
this is achieved is by requiring waste producers in 


certain sectors to satisfy quotas for the collection and 
processing of waste for recycling or reuse. This is, in 
fact, frequently carried out by other parties who are 
then able to sell credits to the industry in question 
which contribute toward satisfying the quota. How¬ 
ever, this is by no means the only way in which pro¬ 
ducer responsibility can be implemented. For instance, 
by placing taxation on a product which reflects the cost 
of disposal, an incentive for waste reduction is created 
for the producer. 

These increasing demands for sustainable manage¬ 
ment of construction and demolition waste are likely to 
have two major influences on the future direction of 
development in this area - a drive toward more appro¬ 
priate use of waste materials and increased emphasis on 
deconstruction over demolition. 

Currently, much of the material deriving from the 
processing of demolition waste is being employed at its 
lowest engineering value - as a general fill material 
where the main property being utilized is simply the 
material’s volume. Application of sustainable philoso¬ 
phies and the increasing economic burdens placed on 
those responsible for utilizing construction and demo¬ 
lition waste are both likely to drive management strat¬ 
egies to higher levels of the waste hierarchy. This would 
mean materials being elevated to their most appropri¬ 
ate point on the waste hierarchy where their perfor¬ 
mance as a recycled or reused item is comparable to 
that of a conventional product at comparable or 
lesser cost. 

The key to obtaining the largest amount of value 
from a waste material, in both engineering and eco¬ 
nomic terms, lies in efficient separation. For this rea¬ 
son, it is highly likely that future developments will 
include shifts toward both deconstruction and design 
of structures with deconstruction in mind. It is also 
likely to mean a growth in infrastructure for the collec¬ 
tion and processing of individual construction and 
demolition waste components and further develop¬ 
ment of technologies for separation. 
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Glossary 

After shocks Smaller earthquakes following the largest 
one (the main shock). 

Amplification Increase in the amplitude of earth¬ 
quake wave depending on the nature of ground. 

Amplitude The maximum height of a wave crest or 
depth of a trough. 

Attenuation Decrease in the amplitude of earthquake 
waves. 

Earthquake The vibration of the Earth caused by the 
passage of seismic waves. 

Epicenter The point on ground surface directly above 
the focus (hypocenter) of an earthquake. 

Focus The place at which an earthquake is initiated. 

Frequency Number of vibrations per unit time. 

Fundamental period It is the period of an oscillating 
structure when subjected to shaking. 

Ground motion Movements of ground caused by the 
vibrations of earthquake. 

Intensity A measure of ground shaking obtained from 
damage to man-made structures (Modified 
Mercalli Scale). 

Isoseismal map Contour lines drawn to separate one 
level of seismic intensity from another for a given 
earthquake or maximum intensity expected of 
earthquakes in a given area. 

Magnitude A measure of energy released by an earth¬ 
quake (Richter Scale). 

Main shock The main earthquake event, which may be 
followed by aftershocks. 

Mitigation To lessen or alleviate the damage likely to 
be caused by an earthquake. 

Period The time interval between successive crests or 
troughs of an earthquake wave. 

Resonance Increase in the amplitude of an earthquake 
caused by an overlap between the frequency of 
ground and that of the structure resting on it. 

Retrofit To bring back to stable condition after having 
been damaged by an earthquake. 

Return period Recurring period of an earthquake of 
a given magnitude or intensity. 


Seismic wave An elastic wave generated by natural or 
man-made causes. 

Seismic zoning A map based on the distribution of 
earthquakes, classified according to expected inten¬ 
sity translated into acceleration as the percentage of 
gravity. 

Seismicity The occurrence of earthquakes in space 
and time. 

Seismology The study of earthquakes. 

Tectonic plates Major regions of the Earth separated 
by geological faults along which these regions 
(plates) move relative to each other. 

Definition of the Subject 

Earthquakes are simply the shaking of the ground ini¬ 
tiated by natural or man-made causes. The waves gen¬ 
erated by earthquakes are called seismic waves, and they 
are broadly classified into body waves and surface 
waves. A point from which the earthquake waves first 
emanate is called the earthquake focus. Moreover, the 
point directly above the focus is called the epicenter of 
an earthquake. Natural earthquakes can be caused by 
tectonic, volcanic (related to movement of tectonic 
plates), or collapse-related behavior of rock/soil. 
Man-made shaking, on the other hand, can be initiated 
by a variety of activities such as blasting, underground 
nuclear explosions, movement of heavy vehicles, and so 
forth. 

Natural earthquakes are known to have occurred 
throughout history [1,2]. They have left their traces as 
manifestations of geological processes at work on the 
geological time scale. These traces are generally evident 
in places where people lived, and no such records are 
available for places which were uninhabited. Most of 
these earthquakes are located along tectonic plate 
boundaries, while some are located in the intraplate 
regions of the world [3], as reflected in global hazards 
maps published by the United States Geological Survey. 

Many lessons were learned from the damage caused 
by the Great California earthquake of 1857 and subse¬ 
quent earthquakes associated with movements along 
the San Andeas fault [4] . The great Alaskan earthquake 
of 1964 [5] and a history of earthquakes in Lima, Peru, 
and elsewhere have helped in developing guidelines for 
the design of earthquake-resistant small and higher rise 
buildings, under different ground conditions [6]. 
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The Engineering Research Institute in California 
has recently prepared a report dealing with contribu¬ 
tions of earthquake engineering to protecting commu¬ 
nities and critical infrastructure from multi-hazards. 
Both, the 2009 International Building Code in general 
and the Seismic Design Maps and Tools for engineering 
in particular give guidance to these questions. 

Introduction 

Earthquakes are one of the most destructive environ¬ 
mental hazards. The power of an earthquake is 
expressed according to its intensity or magnitude. The 
effect of both the parameters generally decreases as the 
distance of the affected place increases from the epi¬ 
center of the earthquake. 

The Modified Mercalli Intensity Scale [7], as shown 
in Table 1, ranges from the lowest of 1 (not felt except 
by a few) to a maximum of 12 (total damage, where 
objects are thrown into the air). The intensity is 
a noninstrumental quantity assigned according to 
observed geological effects, damage to structures, and 
the perception of shaking by people and animals at 
a given place. It is not a fixed quantity, but varies 
from place to place. The magnitude of an earthquake, 
on the other hand, as shown in Table 2, is a fixed 
instrumental quantity. It is a measure of the energy 
released by an earthquake [8]. For example, the mag¬ 
nitude of six is approximately equal to the energy 
released by the atomic bomb used at Hiroshima during 
the Second World War. 

Ground Motion Considerations 

The outcome of earthquakes on buildings depends 
generally more on the severity of ground shaking than 
on any other single factor. Two earthquakes of the same 
magnitude, but located in two different areas, may 
produce quite different effects on buildings. The 
depth of an earthquake from its focus is an important 
factor in this regard. Generally, the shallower the earth¬ 
quake, the more severe the effect. The energy of deeper 
earthquakes is dissipated over a larger volume of rock, 
while that of the shallower earthquake is spread over 
a smaller volume of rock. 

Engineers use strong-motion seismographs to 
record how the ground shakes during an earthquake. 
The most important parameters of ground motion 


(free-field motion) obtained from these instruments 
are the peak velocity, the peak acceleration, the fre¬ 
quency or period of waves, and the duration of strong 
motion. The peak velocity determines how fast the 
ground is shaking, whereas the peak acceleration indi¬ 
cates how quickly the velocity is changing, and the 
latter is more often used in the engineering design of 
structures. It may be borne in mind that the force 
generated by an earthquake is a product of the mass 
of a structure and the ground acceleration. 

Ground Condition Considerations 

Earthquake shaking is attenuated or reduced with dis¬ 
tance from the epicenter. Nevertheless, for a fixed dis¬ 
tance, the type of geological material has also a strong 
influence on the nature and magnitude of ground 
motion. It may be emphasized that the high-frequency 
(short-period) earthquake waves, transmitted by 
strong rocks, attenuate more rapidly with distance 
than the low-frequency (long-period) waves transmit¬ 
ted by soils (Fig. 1). Furthermore, earthquake waves 
change their characteristics as they encounter 
a material different from the one through which they 
were initially propagating. For instance, the high- 
frequency waves, while passing through rocks, are 
slowed down as they enter the overlying soil cover, 
but their period is increased instead. A clear distinction 
must, however, be made between attenuation described 
above and amplification. Although the ground motion 
of soils is attenuated more than that of rocks, the 
reverse is true as far as amplification of ground motions 
in soil. For the same epicenter distance, the thicker the 
soil cover, the greater the amplification factor, and the 
longer the duration of ground shaking. Topographic 
variations can also cause amplification of seismic 
waves. Summits of isolated hills and edges of plateaus 
and cliffs are sites of large amplification within a fairly 
wide frequency range. 

Ground and Structure Interaction 

Fundamental periods or frequencies of buildings vary 
according to the height and nature of buildings. When 
such structures are subjected to free-field ground 
motions, they respond differently to different frequen¬ 
cies or periods of ground motion. For the same ampli¬ 
tude of ground motion, it is the resonance of 
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Construction Planning, Environmental Impact of Foundation Studies and Earthquake Issues. Table 1 Abridged 
modified Mercalli Intensity Scale [1] (1956 version) 


Description 

1 . 

Not felt except by very few under-favorable circumstances (1 Rossi-Forel scale) 

II. 

Felt only by a few people at rest, especially on upper floors of buildings 

Delicately suspended objects may swing (l-ll on Rossi-Forel scale) 

III. 

Felt quite noticeably indoors, especially on upper floors of buildings, but many people do not recognize it as an 
earthquake. Standing automobiles may rock. Vibration like passing of truck (3 on Rossi-Forel scale) 

IV. 

During the day felt indoors by many, outdoors by few. At night, some awakened. Dishes, windows, doors 
disturbed; walls make creaking sound. Sensation like heavy truck striking building. Standing automobiles rocked 
noticeably (IV-V or Rossi-Forel scale) 

V. 

Felt by nearly everyone, many awakened. Some dishes, windows, and so on broken; cracked plaster in a few places; 
unstable objects overturned. Disturbance of trees, poles, and other tall objects sometimes noticed. Pendulum 
clocks may stop (V-VI Rossi-Forel scale) 

VI. 

Felt by all, many frightened and run outdoors. Some heavy furniture moved; a few instances of falling plaster and 
damaged chimneys. Damage slight (VI—VII Rossi-Forel scale) 

VII. 

Everybody runs outdoors. Damage negligible in buildings of good design and construction; slight to moderate in 
well-built ordinary structures; considerable in poorly built or badly designed structures; some chimneys broken. 
Noticed by people driving cars (VIII Rossi-Forel scale) 

VIII. 

Damage slight in specially designed structures; considerable in ordinary substantial buildings, with partial collapse; 
great in poorly built structures. Panel walls thrown out of frame structures. Fall of chimneys, factory stacks, 
columns, monuments, and walls. Heavy furniture overturned. Sand and mud ejected in small amounts. Changes in 
well water. People driving cars disturbed (VIII + to IX Rossi-Forel scale) 

IX. 

Damage considerable in specially designed structures; well-designed frame structures thrown out of plumb; great 
in substantial buildings, with partial collapse. Buildings shifted off foundations. Ground cracked conspicuously. 
Underground pipes broken (IX+ Rossi-Forel scale) 

X. 

Some well-built wooden structures destroyed; most masonry and frame structures destroyed with foundations; 
ground badly cracked. Rails bent. Landslides considerable from river banks and steep slopes. Shifted sand and 
mud. Water splashed, slopped over banks (X Rossi-Forel scale) 

XI. 

Few, if any, (masonry) structures remain standing. Bridges destroyed. Broad fissures in ground. Underground 
pipelines completely out of service. Earth slumps and land slips in soft ground. Rails bent greatly 

XII. 

Damage total. Waves seen on ground surface. Lines of sight and level distorted. Objects thrown into the air 


Masonry A, B, C, D. To avoid ambiguity of language, the quality of masonry, brick, or otherwise is specified by the following lettering 
Masonry A. Good workmanship, mortar, and design; reinforced, especially laterally, and bound together by using steel, concrete, etc.; 
designed to resist lateral forces 

Masonry B. Good workmanship and mortar; reinforced, but not designed in detail to resist lateral forces 

Masonry C. Ordinary workmanship and mortar; no extreme weaknesses like failing to tie in at corners, but neither reinforced nor 
designed against horizontal forces 

Masonry D. Weak materials, such as adobe; poor mortar; low standards of workmanship; weak horizontally 


frequencies (or periods) of ground motion and that of 
the building which is the cause of maximum damage. 
Thus, in terms of the frequency content of an earth¬ 
quake wave, low-rise buildings (such as ordinary one- 
or two-story dwellings and similar structures) are more 


affected by high-frequency or short-period ground 
motions, as compared to low-frequency or long-period 
motions of the ground. The reverse is true for the high- 
rise buildings, which are less affected by high-frequency 
or short-period motions of the ground. This change in 
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Construction Planning, Environmental Impact of Foun¬ 
dation Studies and Earthquake Issues. Table 2 Energy 
release of an earthquake. Magnitude (M) - energy (E) rela¬ 
tion can be given the following formula: 

Log E = 11.8 + 1.5 M. 

If M is increased by 1.0, E is magnified by a factor of 10 1 ' 5 , 
i.e., approximately 32. Therefore, the seismic energy of an 
M = 6 earthquake is about 32 times as large as that of an 
earthquake M = 5 earthquake, and is about 1,000 times 


that of an M = 4 earthquake. 

Magnitude and energy of earthquakes (Modified from 
Earthquake Mechanics, [8] 


Magnitude (M) 

Energy 
(tons of TNT) 

Nuclear bomb 
equivalence 

4.0 

15 

0.001 x Hiroshima 

0.0008 x Nagasaki 

4.5 

86 

0.006 x Hiroshima 

0.004 x Nagasaki 

5.0 

478 

0.03 x Hiroshima 

0.02 x Nagasaki 

5.5 

2,629 

0.17 x Hiroshima 

0.13 x Nagasaki 

6.0 

15,057 

1 x Hiroshima 

0.75 x Nagasaki 

6.5 

86,042 

5.7 x Hiroshima 

4.3 x Nagasaki 

7.0 

478,011 

31.7 x Hiroshima 

23.8 x Nagasaki 

7.5 

2,639,063 

175 x Hiroshima 

131 x Nagasaki 

8.0 

15,057,361 

1,000 x Hiroshima 

750 x Nagasaki 

8.5 

860,420,065 

57,514 x Hiroshima 

4,285 x Nagasaki 


TNT (Trinitrotoluene) is a chemical compound 

Dynamite has 60% greater energy than TNT of the same weight 

Hiroshima atom bomb ~ 15,000 t TNT 

Nagasaki atom bomb ~ 20,000 t TNT 

behavior of buildings is mainly attributed to resonance 
of soil-structure interaction, which can cause more 
damage when the ground motion frequency of 


earthquake waves and that of the building structure 
are the same. It is, therefore, advisable to avoid high- 
rise buildings (low-frequency) on low-frequency soils, 
and, similarly avoid low-rise buildings (high-fre¬ 
quency) on high-frequency grounds. 

It must also be remembered that the frequency of 
earthquake waves decreases as the distance of the site in 
question increases. Therefore, it is not surprising to 
note that sometimes high-rise buildings located farther 
from the epicenter suffer more than high-rise buildings 
located closer to the epicenter. This is particularly 
noticeable in situations where the buildings are under¬ 
lain by rock. Low-rise buildings, founded on rock and 
close to the epicenter, are more vulnerable to damage 
than the high-rise buildings situated in the same area. 

It has been shown from the foregoing that the 
damage to structures in earthquake-prone areas is 
largely dependent on ground acceleration; depth of 
earthquake focus; distance of the epicenter from the 
structure; site conditions including the nature of the 
ground, its thickness, and topography of the site; dura¬ 
tion of ground shaking; and last but not the least, the 
type of structure. It must be emphasized that for engi¬ 
neering purposes, it is the intensity or resulting accel¬ 
eration of the ground motion that is important and not 
the magnitude of the earthquake. 

Mitigation and Construction Planning 

Earthquake risk varies from location to location, from 
structure to structure, and from person to person. 
Damage due to earthquakes is particularly great in 
certain locations, and it is very difficult to prevent the 
earthquakes from occurring. Nevertheless, precautions 
must be taken to mitigate the impacts of earthquake in 
the vulnerable area. Earthquakes are often followed by 
aftershocks, which can be more detrimental to the 
main earthquake shock, because the structures may 
have weakened by the main shock. The magnitude of 
the main shock is generally greater than the after¬ 
shocks, the magnitude of which generally decreases 
with time and can be empirically determined as to 
how long the aftershocks will continue. 

Earthquakes are inevitable, but the damage from 
earthquakes is not. Safety and survival in an earthquake 
is very important. It is important to be prepared and 
take necessary precaution to reduce losses; in the event 
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Construction Planning, Environmental Impact of Foundation Studies and Earthquake Issues. Figure 1 

Effect of earthquakes on low- and high-rise buildings founded on soil and rock 


of an earthquake, it is advisable to protect belongings 
and persons from falling objects. The instructions 
published by the United States Geological Survey, 
Department of the Interior, are very helpful in this 
regard. 

If need be, vacate or demolish hazardous old build¬ 
ings, and retrofit by strengthening the newly built 
structure, as well as regulating the construction of 
new buildings by adopting methods that cater to 
ground-structure interaction principles. Instances of 
fire, breaking of pipelines, disruption of electricity 
and water supply, and outbreak of disease are common 
following earthquakes, and among other things, they 
call for rescue, relief, rehabilitation of displaced people, 
relocation, and reconstruction of structures at safer 
places. The migration of displaced people from one 
area to another creates problems of adaptability, and 


issues raised by the native community in accepting 
people from different social backgrounds. The burden 
on the native community can be reduced by the 
construction of additional infrastructure such as hos¬ 
pitals, schools, community centers, market places, and 
so on. 

In an attempt to avoid adverse environmental 
impact to buildings, it is important to prepare 
isoseismic maps of major earthquakes, which may 
have occurred in the area [10, 11]. These maps delin¬ 
eate areas of equal damage or of equal intensity of the 
earthquake in the past. A similar exercise can be initi¬ 
ated to delineate areas in terms of return periods of 
earthquakes of the same intensity. Furthermore, seis¬ 
mic zoning maps can be prepared to define areas which 
are extremely, highly, moderately, and slightly suscep¬ 
tible to the likely damage, which may be caused by an 
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earthquake of a specified return period. Building codes 
should be prepared and enforced to avoid damage to 
high-risk installation as well as buildings in different 
seismic zones. 

Future Directions 

Natural earthquakes are a major threat to society. 
Therefore, monitoring earthquakes is essential for 
providing scientific data to investigate complex 
earthquake phenomena, and to mitigate seismic haz¬ 
ards [12]. 

The earthquake early warning systems offer practi¬ 
cal information for reducing seismic hazards in earth¬ 
quake-prone regions. Attempts will be made here to 
mention about some recent developments followed by 
suggestions for future work, in this direction. 

Over the past few decades, notable progress has 
been made in the understanding of some of the factors 
and processes that govern the occurrence of earth¬ 
quakes. For instance, it is known [13] that the amount 
of damage caused by an earthquake is not only related 
to its magnitude, depth, and geographical location of 
the site, but also factors such as the type and quality of 
affected buildings, as well as the nature of the ground 
on which such buildings are constructed. 

Building codes, relevant to different regions of the 
world, defined by earthquake zoning, earthquake 
recurrence interval, and isoseismic maps help in pro¬ 
viding guidelines for the design, evaluation, and in 
some instances rehabilitation or retrofitting of building 
structures, in a given area. 

The advent of Global Positioning Systems (GPS) 
opened a new avenue of opportunity to seismologists 
for monitoring movements in geographical positions 
of marked stations, and allows better long-term fore¬ 
casts of areas likely to be affected by earthquakes. This 
development together with advances in telemetry and 
invention of earthquake early warning systems linked 
with real-time seismology offer practical information 
for reducing seismic hazards in earthquake-prone 
regions. Such systems are already successfully deployed 
in Mexico, Japan, and Taiwan [14]. However, there is 
no general method for predicting earthquakes in the 
actual time domain, and only probable estimates of 
recurrence of earthquakes of a given magnitude are 
available. Moreover, as described above, there are 


techniques that can detect an earthquake in progress 
and provide notice of seconds to tens of seconds, prior 
to actual ground shaking. 

Real-time seismology refers to a practice in which 
seismic data are collected and analyzed quickly after 
a significant seismic event. The results thus obtained 
can be effectively used for post-earthquake emergency 
response and early warning. Such systems, on 
their own, can generally detect strong shaking at an 
earthquake’s epicenter, and transmit alerts ahead of 
the damaging earthquake waves. The speed of an elec¬ 
tronic warning signal, in time domain, is faster than 
the speed of earthquake waves traveling through the 
earth. 

Unfortunately, there is no worldwide public early 
warning system for earthquakes. It is important to 
use this relatively new science and modern technology 
to rapidly detect the beginning of an earthquake, 
assess the hazard that the earthquake poses, and then 
provide a warning to people if they are in any harmful 
situation. Such early information about past earth¬ 
quake occurrences is particularly useful in the disci¬ 
pline of disaster management. It calls for hazard 
mitigation plans from on-site and regional bases to 
consider post-earthquake emergency response and 
early warning. 

There is a real need for a radically different design of 
seismographic networks employing earthquake early 
warning system. The importance of actively exploring 
the potential of wireless sensor networks cannot 
be overemphasized. However, building a radically 
different design of high-density seismic networks at 
national and international levels is not economical 
using existing seismic technology. Innovative 
approaches must be developed and perseverance is 
needed. 
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Glossary 

Contaminant Any substance which, in sufficient con¬ 
centration, can adversely affect living organisms. 
Contaminated soil The soil containing substances at 
levels where they would not normally be present. 
Ex situ treatment technologies Technologies that 
require excavation or removal of the contaminated 
source media. 

Groundwater The subsurface water that lies beneath 
the water table in soils and geologic formations 
which are fully saturated. 

Groundwater treatment technologies Technologies 
that address the contaminated groundwater 
directly. 

In situ treatment technologies Technologies that 
treat or remove the contaminant without excava¬ 
tion or removal of the source media. 

Nonaqueous phase liquids (NAPLs) Refer to liquids 
that are immiscible with water, which can be further 
classified to less dense (light NAPLs or LNAPLs) 
or more dense (dense NAPLs or DNAPLs) than 
water. 

Soil The three-phase system of the earth, including 
a solid, a gas, and a liquid phase, which might 
range from very soft organic deposits through less 
compressible clays and sand to soft rock. 
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Soil remediation The collective term for various strat¬ 
egies that are used to purify and revitalize soil. 
Source treatment technologies Technologies that 
address only the source of contamination, namely, 
soil, sediment, sludge, solid-matrix wastes, or 
NAPL, but do not address groundwater directly. 

Definition of the Subject and Its Importance 

“Contaminated soil,” “contaminated site,” “contami¬ 
nated land” are different terms for the same environ¬ 
mental problem, which is encountered in millions of 
sites across the world. Soil becomes contaminated when 
a single or a mixture of contaminants reaches the soil 
surface, the vadose zone, or the aquifer. Soil is a limited 
and valuable natural resource, since it controls the 
element and energy cycles within the ecosystems, it is 
the habitat to countless organisms and plants, but, 
mainly, because it accommodates the groundwater, 
which makes up 97% of global freshwater, and the 
most important source of drinking water across the 
world [ 1 ]. However, at the same time, soil is the field 
that accommodates numerous essential socioeconomic 
and inevitably contaminating human activities. As 
a result, the extended production of a high number of 
chemicals and their wide use for domestic, industrial, 
and military reasons, combined with the common 
practice of its improper storage and neglectful disposal, 
has resulted in millions of contaminated sites across the 
world [2]. Soil contamination is an old problem dating 
back to the Industrial Revolution. However, this crucial 
environmental problem failed to receive prompt atten¬ 
tion, and it was only recognized when incidents such as 
the Love Canal site in New York and the Lekkerkerk site 
in the Netherlands were published in 1970s, increasing 
the public concern about this serious environmental 
issue. The extent of the above environmental disorder 
is vividly portrayed if one bears in mind that the esti¬ 
mated cost of the potential remediation actions exceeds 
hundreds of billions of US dollars and the estimated 
number of contaminated sites today exceeds 500,000 in 
the USA, 30,000 in Canada, and 500,000 in Japan, while 
there are no available data for Eastern Europe, Africa, 
South America, and for key countries such as China 
and India [2]. Additionally, in European Union (EU), 
the number of contaminated sites requiring cleanup 
has been estimated as approximately 250,000 and the 


potential polluting activities have been expected to 
have occurred at nearly three million sites [3]. These 
numbers underline the significant lack of data for the 
EU and underscore the apparent differences in 
reporting among different geographical and other enti¬ 
ties, where “apples” do not always mean “apples” 
depending on which country or organization does the 
reporting. It is, thus, imperative that a common lan¬ 
guage of data on environmental damage and solutions 
should be developed and implemented. 

Furthermore, taking into account the large num¬ 
bers of known or estimated contaminated sites across 
the world and the huge budget for soil remediation, the 
focus should be on those contaminated sites that 
potentially pose the highest risks to humans and the 
environment. Therefore, appropriate decision-making 
tools have been developed and applied to prioritize 
contaminated sites, thereby saving valuable resources 
for the most significant of them [4]. 

Contaminated soil remediation is one of the most 
rapidly developing environmental protection subjects. 
However, the high cost that soil remediation requires 
along with the complexity of the problem renders con¬ 
taminated soil remediation a real challenge for envi¬ 
ronmental professionals and society altogether. 

Introduction 

Soil is usually defined as the three-phase system of the 
earth, including a solid, a gas, and a liquid phase, which 
might range from very soft organic deposits through less 
compressible clays and sand to soft rock [5]. Therefore, 
soil contamination is by definition a very complex prob¬ 
lem to solve, since a three-phase medium is contami¬ 
nated by chemicals, commonly as mixtures, which not 
only interact together but also with each phase of the soil 
concurrently. As a result, the way in which this system is 
changing both in place and in time depends on a series of 
different and interacting parameters relating both to the 
soil and the contaminants. Contaminated soil remedia¬ 
tion is usually approached using a phased methodology, 
during which the contaminated soil is investigated, 
the remedial goals are determined and the most appro¬ 
priate remediation technology is selected. The most 
commonly employed remediation technologies include 
solidification/stabilization, enhanced in situ biore¬ 
mediation, monitored natural attenuation, soil vapor 
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extraction, in situ chemical oxidation, multiphase extrac¬ 
tion, incineration, pump-and-treat, and others [6]. 

Today, the leading country in contaminated soil 
remediation is definitely the USA, where regulations 
for soil contamination have been in force as early as the 
1980s. Sites with contaminated soil in the USA have 
been divided into several different categories, the most 
important of which is the Superfund sites, which are 
uncontrolled or abandoned places, where hazardous 
waste is located and can potentially affect local ecosys¬ 
tems or people. Other site categories are the Resource 
Conservation and Recovery Act (RCRA) Corrective 
Action sites, the Underground Storage Tanks (USTs) 
sites, the Department of Energy (DOE) sites, the 
Department of Defense (DOD) sites, various Civilian 
Federal Agencies sites, and State and private parties, 
including brownfields sites [7]. In the EU, there still 
does not exist a dedicated legislation for soil contami¬ 
nation on the grounds that the Soil Directive is still 
pending since 2006. Therefore, soil remediation policy 
in the EU is presently based solely on the “polluter 
pays” principle which is mainly applied via the envi¬ 
ronmental liability Directive (Directive 2004/35/EC) 
[3]. In other words, EU has a long way to go before it 
provides a safer and more sustainable context for its 
citizens with reference to contaminated soil 
remediation. 

This entry aims to summarize and present the key 
and updated information on contaminated soil reme¬ 
diation and is divided into four parts. In the first part, 
the main key concepts, such as the soil properties, the 
main pollutants, and their fate and transport, are ini¬ 
tially developed, while, in the second part, the remedial 
approach that should be followed by scientists and 
engineers is described. In the third part of this entry, 
the main remediation technologies are analyzed, while 
in the fourth and the last part, the future directions of 
contaminated soil remediation are outlined. 

Soil Contamination 

Soil Origin, Constituents, and Properties 

Soil is mainly derived from solid materials of the earth’s 
crust, the so-called rocks, by a series of physical, chem¬ 
ical, and biological mechanisms, collectively called 
weathering. Once the soil is formed, it may either 
remain in place as weathered soil material or be 


transported to new locations by various potential 
agents, such as wind and water or by glacial and 
anthropogenic activity [5, 8]. The soil that remains in 
place is termed residual soil, while the soil that is 
transported is termed alluvial, aeolian, or glacial soil, 
if water, wind, or ice was the transfer agent, respectively 
[9]. Generally speaking, soil consists of solid particles 
with pores in between, where water potentially flows 
replacing air and gradually turning dry soil, where 
pores have no water, to unsaturated soil, where pores 
are filled with air and water, and, finally, to saturated 
soil, where water completely replaces air and pores are 
totally filled with water. Unsaturated soils are typically 
near the ground surface, while saturated soils lie 
beneath the water table. Therefore, regarding its 
saturation state and overall environmental and physi¬ 
cochemical behavior, soil potentially consists of three 
different phases which are the solid, the liquid, and/or 
the gaseous phase, commonly referred to as 
geoenvironment, which are further described below. 

The solid phase component consists of solid parti¬ 
cles with size ranges from colloids (<0.001 mm) to 
boulders (>300 mm), which typically have an inor¬ 
ganic and an organic fraction. Inorganic fraction 
mostly consists of soil minerals and free oxides and 
hydroxides. Soil minerals are usually divided into two 
categories, carbonates and silicates. The most common 
carbonate minerals are calcite and dolomite, and the 
most common silicate minerals are quartz, feldspar, 
and mica in coarse-grained soils and the clay minerals 
kaolinite, illite, and montmorillonite in fine-grained 
soils. Furthermore, another very important constituent 
of the inorganic fraction of soil is free oxides and 
hydroxides, which exist in soil either as coating of soil 
particles, discrete particles, or cementing agents 
between soil particles and organic matter. Finally, the 
organic fraction of the solid phase includes 
nonhumified compounds, such as proteins and carbo¬ 
hydrates, and humified compounds, such as humic and 
fulvic acids [10]. All of these constituents, especially 
when they exist in finer grade (i.e., higher surface 
area) or amorphous state, are the main culprits of 
what is called geochemical attenuation of contami¬ 
nants (explained in section “Fate and Transport of 
Contaminants in the Geoenvironment”) as they 
strongly react with contamination in the 
geoenvironment. 
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The liquid phase component typically comprises 
only groundwater in pristine aquifers, while a mixture 
of groundwater, dissolved chemicals, chemicals in free 
phases, and dissolved gases might be the liquid phase in 
contaminated aquifers. Groundwater may exist in 
pores either as absorbed water on solid phase surfaces, 
as double-layer water close to a clay surface, or as free 
water surrounding the diffuse double layer [5, 10]. Due 
to its chemical properties, groundwater contains 
a variety of inorganic constituents in various concen¬ 
trations, which are typically classified into major con¬ 
stituents, such as calcium and sulfate, into minor 
constituents, such as boron and iron, and into trace 
constituents, such as arsenic and nickel. On the other 
hand, groundwater contains also organic constituents, 
such as humic and fulvic acids, and dissolved gases, 
such as N 2 , 0 2 , C0 2 , CH 4 , H 2 S, and N 2 0 [11]. 

The gaseous phase exists in unsaturated soils or, in 
other words, in the vadose zone, where the liquid phase 
does not occupy the total volume of pores. However, as 
soon as the liquid phase enters the pores, gaseous phase 
is pushed out, while the procedure is reversed when the 
pores are drained out. Gaseous phase quality of 
surfacial soils is similar to the atmospheric air, despite 
the fact that oxygen concentration decreases and 
carbon dioxide increases with soil depth [ 10, 12] . Addi¬ 
tionally, gaseous phase quality might also be affected by 
contaminants vapors, which could be further generated 
through several remediation activities. 

Physical properties of soil (e.g., hydraulic conduc¬ 
tivity, particle size, and surface area), along with 
chemical properties (e.g., mineralogy, pH, and 
oxidation-reduction status) and biological properties 
(e.g., particular bacteria species existence) are signifi¬ 
cant parameters that control both soil contamination 
and remediation. In particular, hydraulic conductivity, 
namely, the ease with which water can flow through the 
soil, controls groundwater flow and contaminant 
plume propagation and is the determining parameter 
in coarse-grained contaminated soils. However, parti¬ 
cle size and surface area are more significant parame¬ 
ters in fine-grained soils. A typical example that is 
commonly employed to outline these concepts is the 
striking surface area of the clay mineral montmorillon- 
ite (800 m 2 /g), 12 g of which would be sufficient to 
cover an entire football field [13]. Thus, particle size 
and surface typically appear as the key parameters 


affecting sorption processes of organic and inorganic 
contaminants and, therefore, geochemical attenuation 
and adopted remediation strategies [14]. Mineralogy 
and pH also play key roles in the fate and transport of 
contaminants in geoenvironment. For instance, it has 
been found that Pb release in soil which has been used 
as firing range can affect its pH and buffering capacity, 
while mineralogy and the presence of organic matter 
and ferromanganese in particular have been reported as 
responsible for Pb retention in several studies [14-19]. 
On the other hand, the oxidation-reduction condi¬ 
tions, usually called redox conditions, is another key 
factor that also affects a series of critical parameters, 
such as chemical speciation, bioavailability, toxicity 
and mobility of metals and metalloids, and the effi¬ 
ciency of remediation processes [20] . A typical example 
is the biodegradation of vinyl chloride (VC), which 
might behave either as electron acceptor or electron 
donor depending on the redox conditions [21] . Finally, 
since contaminant detoxification is sometimes accom¬ 
plished only by very specific microorganisms (e.g., 
Dehalococcoides spp.), the presence of these microor¬ 
ganisms is a prerequisite for a successful soil remedia¬ 
tion effort [22]. 

Major Soil Contaminants and Their Sources 

Soil contaminants can be classified into several differ¬ 
ent ways. First of all, they can be roughly classified as 
organic and inorganic compounds, according to the 
presence or the absence of carbon atoms in their mol¬ 
ecules, respectively. Organic contaminants can be fur¬ 
ther classified by the presence of halogens atoms in 
their molecules as halogenated or nonhalogenated 
compounds, by their carbon backbone shape as ali¬ 
phatic or aromatic, or by their vapor pressure as vola¬ 
tilized (VOCs), semivolatilized (SVOCs), or 
nonvolatilized compounds. Additionally, organic con¬ 
taminants, which are immiscible with water and may 
form a separate phase, are referred to as nonaqueous 
phase liquids (NAPLs) and are further classified 
according to whether they are less dense (light NAPLs 
or LNAPLs) or more dense (dense NAPLs or DNAPLs) 
than water. Most chlorinated VOCs, such as 
tetrachloroethene (PCE) and trichloroethene (TCE), 
are DNAPLs, and most nonchlorinated VOCs, such 
as benzene, are LNAPLs. Less complicated is the 
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classification of inorganic pollutants, which are usually 
classified as metals and metalloids and radionuclides. 
The most significant categories of soil contaminants are 
(a) petroleum hydrocarbons, (b) halogenated organic 
compounds, (c) metals and metalloids, (d) radionu¬ 
clides, and (e) explosives. The most significant contam¬ 
inants of these categories and their typical sources are 
presented in Table 1 and further discussed below. 

However, although the most commonly detected 
contaminants were classified above as pure materials, 
in real life, they typically occur as mixtures, since many 
chemicals used in commerce and industry are mixtures 
and disposal sites receive a large variety of chemicals 
that can leach into soil. For example, gasoline contains 
numerous hydrocarbons as well as a range of organic 
and inorganic additives and waste generated from 
cleaning operations containing the cleaning agents 
and the chemical removed during cleaning, while 
waste from nuclear weapons manufacturing activity 
contains mixtures of several chemicals, such as radio¬ 
nuclides, metals, solvents, and organic chelating agents 
[23]. Therefore, soil contamination is usually 
a complicated phenomenon where several contami¬ 
nants coexist under a wide variety of conditions. 

Soil contamination might be caused either by point 
sources, such as damaged storage tanks and defectively 
designed waste disposal sites, or nonpoint sources, 
such as transport activity, intensive agricultural activi¬ 
ties, organized or nonorganized military activities, and 
extreme natural phenomena, such as floods and 


hurricanes. In particular, sites where petroleum hydro¬ 
carbons, halogenated organic compounds, and metals 
and metalloids may be found include burn pits, chem¬ 
ical manufacturing plants or disposal areas, disposal 
wells and leach fields, electroplating/metal finishing 
shops, firefighting training areas, hangars/aircraft 
maintenance areas, landfills and burial pits, leaking 
collection and system sanitary lines, etc. Additionally, 
metals and metalloids may also be found in artillery 
and small-arms impact areas, battery disposal areas, 
and sandblasting areas. Sites where radionuclides are 
usually found include radioactive and mixed waste 
disposal areas, while the most common sites where 
explosives are usually found include artillery/impact 
areas, contaminated marine sediments, disposal wells, 
leach fields, landfills, burial pits, and TNT washout 
lagoons [24]. 

Petroleum Hydrocarbons Petroleum hydrocarbons 
are aliphatic or aromatic, linear or branched, saturated 
or unsaturated, simple or complex natural compounds, 
which are composed solely of carbon and hydrogen 
atoms and are mainly used as combustible fuels [25]. 
Among the most environmentally significant petro¬ 
leum hydrocarbons are the monoaromatic benzene, 
toluene, ethylbenzene, and xylene, collectively called 
BTEX, and the polycyclic aromatic hydrocarbons 
(PAHs). Gasoline additive such as methyl tertiary 
butyl ether (MTBE), which are used to make gasoline 
burn cleaner, are common cocontaminants with BTEX 


Contaminated Soil, Remediation of. Table 1 Main soil contaminants 


Type of contaminants 

Typical contaminants 

Typical sources 

Petroleum 

hydrocarbons 

BTEX, alkanes, PAHs 

Aircraft areas, leaking storage tanks, vehicle maintenance areas 

Halogenated organic 
compounds 

TCE, PCE, PCBs, Aldrin, 
Dieldrin, DDT 

Electroplating/metal finishing shops, solvent degreasing areas 

Metals and metalloids 

Pb, Cr, As, Cd, Ni, Zn, Hg, 
Se, Cu 

Shooting ranges, battery disposal areas, electroplating/metal 
finishing shops 

Radionuclides 

Uranium, Tritium, 
Plutonium 

Radioactive and mixed waste disposal areas, areas affected by 
nuclear power plant accidents 

Explosives 

TNT, DNT, RDX, 
Nitroaromatics 

Shooting ranges, TNT washout lagoons 
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in a large number of contaminated sites. BTEX might 
be found in soil due releases associated with fuels but 
also due effluents associated with industrial activities, 
wood processing, and manufacturing of pesticides, 
detergents, chemicals, paints, and varnishes [26]. Typ¬ 
ically, the presence of BTEX in soil is accompanied by 
the presence of additives such as MTBE, whose high 
solubility hinders the remediation activity. PAHs are 
produced either by incomplete combustion and pyrol¬ 
ysis, natural or anthropogenic, of organic matter and 
easily adsorb onto the organic matter of soil. Several 
different physical, chemical, and biological remedia¬ 
tion technologies have been used for soil contaminated 
with petroleum hydrocarbons so far, some of which are 
aerobic or anaerobic Enhanced In Situ Bioremediation, 
Monitored Natural Attenuation, in situ chemical oxi¬ 
dation (section “Chemical Treatment”), Pump-and- 
Treat, Phytoremediation, Thermal Treatment, Air 
Sparging), and Incineration [27, 28]. 

Halogenated Organic Compounds Another group 
of very common soil contaminants are the halogenated 
organic compounds, which are mainly anthropogenic 
and have been used in a wide variety of applications, 
such as solvents, degreasing agents, biocides, pharma¬ 
ceuticals, plasticizers, hydraulic and heat transfer 
fluids, intermediates for chemical synthesis and in 
many other activities. The majority of them are chlori¬ 
nated, although brominated, fluorinated, and iodin- 
ated compounds are also used in various industrial 
activities [29]. Among them, chloroethenes, mainly 
PCE and TCE, are some of the most commonly 
detected pollutants in the contaminated sites [30]. 
Chloroethenes belong to DNAPL group and the tech¬ 
nologies selected for chloroethene-contaminated soil 
used to involve mainly extraction technologies such 
as Soil Vapor Extraction, Pump-and-Treat, or Air 
Sparging. However, there has currently been a clear 
trend in favor of in situ destruction mechanisms 
employing either biotic, such as Enhanced In Situ 
Bioremediation, Monitored Natural Attenuation, or 
abiotic processes, such as in situ chemical oxidation 
(section “Chemical Treatment”) or Thermal 
Treatment [31]. 

Metals and Metalloids Metals and metalloids, such 
as lead, chromium, cadmium, nickel, zinc, copper, 


mercury, silver, and arsenic, are another group of con¬ 
taminants that is usually detected in contaminated soils 
[12]. Typical sources of soil contamination include 
land disposal of solid wastes, liquid sewage or sewage 
sludge, leachate from mine tailings and other mining 
wastes, deep-well disposal of liquid wastes, seepage 
from industrial waste lagoons, or from other spills 
and leaks from industrial metal processing facilities 
[32]. However, since all soils naturally contain trace 
levels of metals and metalloids, only the difference 
between their ambient and their background concen¬ 
trations could be a reliable criterion for diagnosing soil 
contamination by those contaminants. As opposed to 
organic contaminants, metals and metalloids cannot be 
degraded and completely destroyed. Nevertheless, con¬ 
taminated soil might be remediated by contaminant 
transformation in less mobile and/or less toxic forms 
by changing its speciation. The most applicable reme¬ 
diation technologies for soil contaminated with metals 
and metalloids include Solidification/Stabilization, 
Permeable Reactive Barriers, Soil Flushing, 
Phytoremediation, and Soil Washing [33]. 

Radionuclides Radionuclides, such as uranium-238, 
thorium-228, and plutonium-238, are elements with 
unstable nuclei that spontaneously alter their composi¬ 
tion through radioactive decay in a series of successive 
nuclear reactions, ultimately leading to a stable configu¬ 
ration. Radionuclides can be found in soil either due to 
natural processes or due to man-made activities, such as 
nuclear weapon tests and nuclear power plant accident 
[12, 34]. Radionuclides are a widely spread group of 
contaminants across the world, since in the USA, only 
they were detected in 120 contaminated sites [20]. The 
most applicable remediation technology for soil con¬ 
taminated with radionuclides is the Solidification/Sta¬ 
bilization, while other remediation methods, such as 
the enhanced in situ Bioremediation and Electrokinetic 
Separation, have started to be used as well [33, 35]. 

Explosives Finally explosives, such as TNT (2,4,6- 
trinitrotoluene), RDX (l,3,5-trinitro-l,3,5-triazine), 
and HMX (l,3,5,7-tetranitro-l,3,5,7-tetrazocane), are 
mainly nitrogen-containing organic compounds with 
the potential of self-oxidation to small gaseous mole¬ 
cules (N 2 , H 2 0, and C0 2 ). It has been estimated that 
there are hundreds of explosives-contaminated sites 
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within the USA, and the number significantly increases 
in Europe and Asia. Several different technologies have 
been used so far to remediate explosives-contaminated 
soil, but the most widely applied one is Incineration. 
However, the high cost of incineration and the 
wide environmental concerns that this technology 
brought about have triggered the development 
of more sustainable alternatives, such as in situ chem¬ 
ical oxidation (section “Chemical Treatment”) and 
Phytoremediation [36]. 

Fate and Transport of Contaminants in the 
Geoenvironment 

As soon as a single pollutant or, most commonly, 
a mixture of pollutants encounters soil, it tends to 
travel via the unsaturated pores of the vadose zone 
toward the aquifer, creating a plume which is 
transported with groundwater. During this journey, 
a number of physical, chemical, and biological pro¬ 
cesses, collectively called natural attenuation, might 
take place which can lead to dilution, immobilization, 
or degradation of a soil contaminant, alter its chemical 
and biological identity, and usually decrease its envi¬ 
ronmental footprint. These processes, which are 
further discussed below, are generally grouped into 
three categories: (a) the transport processes, (b) the 
geochemical attenuation processes, and (c) the intrin¬ 
sic bioremediation processes. 

Transport of a contaminant is generally accom¬ 
plished via a threefold mechanism comprising (a) 
advection, (b) diffusion, and (c) mechanical dispersion 
[37]. The first process, advection, refers to the macro¬ 
scopic contaminant movement by flowing water as 
a result of a hydraulic gradient. However, contaminant 
transport also occurs in the microscopic level either 
due to a chemical concentration gradient resulting in 
diffusion or due to the heterogeneous subsurface pore 
geometry resulting in both longitudinal and transverse 
mechanical dispersion. Additionally, contaminant 
transport also depends on its immiscibility, namely, 
on whether it belongs or not to the NAPL group and, 
if it does, whether it is LNAPL or DNAPL. In particular, 
in incidents of LNAPL spills, contaminants are retained 
on grain surfaces within the vadose zone, when spills 
are small, or they are accumulated on the water table in 
the case of larger scale spills. On the contrary, in cases of 


DNAPL, contaminants reach a larger depth of the 
vadose zone when the spills are small, while contami¬ 
nants can either be dispersed around or near the water 
table or penetrate the water table and be accumulated 
on deeper impermeable layers, when the spills are large 
enough [12]. 

As it has already been stated, apart from transpor¬ 
tation through its pores, contaminants also interact 
with soil, usually resulting to the partial or total immo¬ 
bilization of soil contaminants, a process called 
geochemical attenuation. The ability of soil to trap 
contaminants and geochemically attenuate their move¬ 
ment in soil depends on (a) the chemical composition 
of seepage or groundwater, (b) the geochemical and 
mineralogical properties of the soil, and (c) the pH and 
Eh conditions that are established in the soil-water 
interface and overall matrix [38]. In particular, geo¬ 
chemical attenuation of contaminants depends on 
a series of geochemical reactions, the most important 
of which are (a) the acid-base reactions, (b) oxidation- 
reduction, (c) precipitation, (d) aqueous complexa- 
tion, (e) sorption, (f) hydrolysis, and (g) radioactive 
decay [23]. Acid-base reactions determine the pH of 
groundwater and, therefore, affect almost all geochem¬ 
ical and biological reactions in the subsurface. As 
a general rule, geochemical attenuation mechanisms 
are most active in a pH range from 5 to 8 [38, 39]. 
Oxidation-reduction reactions are usually catalyzed by 
microorganisms, although some of them can occur 
abiotically, where electrons are transformed from the 
electron donor (reductant) to the electron acceptor 
(oxidant), without the mediation of microorganisms. 
Typical electron donors for abiotic oxidation- 
reduction processes are ferrous iron (Le 2+ ), sulfides 
(S 2_ , HS _ , H 2 S), and humic substances [23]. Another 
important geochemical reaction is the precipitation, in 
which cationic dissolved metals react with anions, such 
as carbonate, hydroxide, and sulfide, to form a solid or 
precipitate. This process, which is promoted with 
increasing pH values, can be further enhanced via the 
processes of coprecipitation, where metals precipitate 
along with or are trapped within the solid formed by 
major ions, such as Ca, Al, and Le [23] . Dissolved metal 
cations can also react with anions to form complexes. 
Complexation prevents the metal from precipitating 
and changes the sorption to solids regime in compar¬ 
ison to free metal ions. Sorption is another process that 
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plays a pivotal role in contaminant attenuation by 
slowing the contaminant transport in groundwater. 
To make matters more tangible, sorption reactions 
concentrate dissolved chemicals on the soil particles. 
This process might occur by two different mechanisms, 
which are (a) adsorption and (b) absorption. Adsorp¬ 
tion occurs when chemicals cling to the surface of the 
soil particles, while absorption takes place when 
chemicals are diffused into the particle. Another signif¬ 
icant process in soil is hydrolysis, in which an electron- 
withdrawing group, such chlorine, is replaced either by 
OH - or H 2 0. Finally, radioactive decay is the process 
in which radioactive elements are spontaneously 
decayed to daughter products. Geochemical attenua¬ 
tion is exemplified in the process of the Cr 3+ immobi¬ 
lization via the mechanisms of adsorption when pH 
values are lower than 4, via adsorption precipitation 
when pH values range between 4 and 6, and via pre¬ 
cipitation when pH values are higher than 6 [38, 39]. 

The strong presence of microorganisms in the soil 
and their remarkable ability to degrade contaminants 
through a plethora of enzyme systems placed biodeg¬ 
radation among the most important attenuation 
mechanisms. The biodegradation of contaminants is 
carried out mainly by oxidation-reduction reactions, 
where microorganisms generate the required energy 
through the transfer of electrons from the electron 
donor to the electron acceptor. Biodegradation is 
a vital mechanism of natural attenuation, since it is 
possible to completely detoxify organic contaminants 
to C0 2 and water and immobilize inorganic com¬ 
pounds, transforming them to less mobile and less 
toxic species. During this process, microorganisms 
can use a wide variety of organic or inorganic com¬ 
pounds as electron donors but relatively limited elec¬ 
tron acceptors. Common electron acceptors include 0 2 
in aerobic conditions and N0 3 _ , S0 4 2- , Mn 4 + , Fe 2+ , or 
C0 2 in anaerobic conditions. However, there are also 
microorganisms which are able to use contaminants as 
electron acceptor, when the appropriate electron donor 
is sufficient. A typical example of these microorganisms 
is the bacteria of the genus Dehalococcoides , which are 
capable of completely reducing PCE or TCE to ethene, 
in the presence of a sufficient electron donor, which in 
their case would be H 2 [40]. However, the aforemen¬ 
tioned example also indicates that natural attenuation 
does not always reduce the environmental footprint of 


the contaminants, since if H 2 is not sufficient or other 
environmental conditions are not the appropriate ones, 
PCE or TCE dechlorination would never be complete 
and intermediate dechlorination products, such as VC, 
which is more toxic than the parent compounds, are 
usually produced [41]. Finally, microorganisms can 
also degrade contaminants without gaining energy, by 
a process known as cometabilism, which is the fortu¬ 
itous degradation of a contaminant when other com¬ 
pounds are available to serve as the energy source [42]. 

Remedial Approach 

During the last 30 years, the perspective of contami¬ 
nated soil remedial approaches has changed 
significantly from the cost-driven approach of the 
mid-1970s, through the feasibility-driven approach of 
the mid-1980s, the risk-based approach of the mid- 
1990s, to the recently emergent sustainable remedia¬ 
tion [43]. Sustainable remediation can be broadly 
defined as a remedy or combination of remedies 
whose net benefit on human health and the environ¬ 
ment is maximized through the judicious use of limited 
resources [44]. In particular, when a sustainable reme¬ 
diation approach is adopted (a) the energy consump¬ 
tion or the consumption of other natural sources and 
the associated costs of the remediation application 
approach should be minimized, (b) the contaminant 
releases to the environment should be reduced or elim¬ 
inated, (c) the process should harness or mimic the 
natural attenuation processes, and (d) the contami¬ 
nated land and undesirable materials should be reused 
or recycled. Generally speaking, the remedial approach 
consists of three successive actions, which are described 
in detail below and are the following: (a) site charac¬ 
terization, (b) risk assessment, and (c) remediation 
technology selection. 

Site Characterization 

Site characterization is always the first action of 
a remedial approach. During this action, all the appro¬ 
priate data should be gathered, in order to determine 
the type, the extent, and the fate and transport of the 
contamination, while constructing a conceptual site 
model based on the site geoenvironmental properties 
acquired. Since site characterization is usually an expen¬ 
sive and a time-consuming process and the nature of the 
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contaminated site is at the beginning unknown, 
a phased approach is commonly adopted, which usually 
comprises two parts typically called the Phase I and the 
Phase II environmental site assessments (ESA). 

The Phase I ESA, or the so-called preliminary site 
assessment, aims at identifying the presence or likely 
presence of contamination in a site. In particular, dur¬ 
ing the Phase I ESA, information is gathered on the 
background environmental conditions, the current and 
the historical uses of the site, and the permitting 
regulations, rendering the first picture of the contam¬ 
ination problem being as comprehensive as possible. 
A Phase I ESA typically consists of four components, 
which are [45] (a) the records review, (b) the site 
reconnaissance, (c) the interviews, and (d) the 
reporting. Records review includes information 
sources such as maps, photographs, soil and mineral 
resource surveys, and previous reports, while site 
reconnaissance is a tool for verifying the collected 
information and collecting new data pertaining to 
information such as unrecorded pollution sources or 
receptors. Interviewing of people being involved, such 
as past and present owners and occupants, is another 
significant step of the Phase I ESA process, via which 
additional valuable information for the site is typically 
gathered, while reporting is the final step of the Phase 
I ESA. Reporting encompasses all relevant information 
pertaining to the contamination problem at hand, 
where existing data gaps are clearly identified and an 
action plan is devised so as to obtain the missing 
information in order to enable remediation of the 
contaminated site and a comprehensive solution to 
the contamination problem. 

The next step of site characterization is the Phase II 
ESA, which aims to evaluate those environmental con¬ 
ditions that were identified by the previous site char¬ 
acterization phase and link them to possible 
remediation approaches and their associated costs. 
Phase II ESA is an iterative process typically comprising 
four components [46]: (a) the scope of work develop¬ 
ment, (b) the assessment activities, (c) the evaluation 
and interpretation of the data, and (d) the reporting. 
A typical Phase II ESA starts with the description of the 
work that will be performed. As soon as the rationale of 
the sampling locations and the measured parameters is 
determined, the appropriate assessment activities, 
ranging from simple field screening methods to 


complicated intrusive multimedia sampling and labo¬ 
ratory analyses, should be properly chosen and 
conducted. When data gathering has been completed, 
the assumptions, such as the groundwater direction 
and the depth to the water table, where the assessment 
was based, should be verified and the results should be 
interpreted and evaluated. During this stage of assess¬ 
ment, issues such as the existence of nearby contami¬ 
nation sources and naturally occurring contaminants 
should be addressed and properly evaluated. Finally, 
site characterization is completed with a report where 
the work that was performed and the conclusions that 
were drawn should be summarized and presented 
properly. 

Today a consensus has been reached that the most 
important factor of site characterization is the contam¬ 
inant speciation, namely, the determination of the 
chemical form of contaminant (formula, valence, and 
crystalline structure or amorphicity) as well as the 
physicochemical association between soil and contam¬ 
inants (sorption and crystal or amorphous inclusion). 
Contaminant speciation is considered as the corner¬ 
stone of the decision-making process, since among 
others it (a) explains the mechanisms of contaminants’ 
attenuation and possible degradation, (b) evaluates 
public health risk associated with contamination, (c) 
delineates mechanisms of contaminant release and 
their ultimate fate under specific exposure environ¬ 
ments, and (d) determines the release potential and 
the contaminant bioavailability. 

Although the water quality analysis methods can be 
considered to implement cutting edge technology, this 
does not apply for the analysis of contaminated soil, 
since the discovery of reliable, simple, and cost-effective 
lab and field investigatory tools using readily accessible 
equipment is still among the greatest challenges of this 
scientific area. The main field methods that are cur¬ 
rently used for site characterization of contaminated 
soils are the penetrating tests and the geophysical 
methods, each of which has its own advantages and 
drawbacks. Therefore, although penetration tests are 
relatively inexpensive and capable of providing contin¬ 
uous records, they cause mechanical disturbance of 
contaminated soils. On the other hand, although 
geophysical methods are nondestructive and less 
expensive, they cannot be used alone, but only in con¬ 
junction with limited traditional characterization 
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methods, where actual samples are obtained and eval¬ 
uated, which, in turn, should verify the geophysical 
methods’ results. Despite the fact that it is clear that 
the nondestructive methods constitute the future trend 
in site characterization, it is expected that it will take 
time and further research to develop the appropriate 
nondestructive methods that could be reliably applied 
on their own. In addition, laboratory methods for site 
characterization also need further research so as to 
provide reliable information mainly in relation to con¬ 
taminant speciation. In particular, traditional batch, 
leach/extraction tests, such as Toxicity Characteristic 
Leaching Procedure (TCLP) and Sequential Extraction 
Test (SET), can be viewed as problematic so long as 
contaminant speciation is altered during the test, while 
methods such as X-ray Powder Diffraction (XRPD) 
and Electron Microprobe Analysis (EPMA) need to 
become more standardized. Finally, state-of-the-art 
site characterization methods such as X-ray Absorption 
Near-Edge Spectroscopy (XANES) and X-ray Fluores¬ 
cence Spectroscopy (XFS) are very expensive, difficult 
to gain access to, and also formidable in terms of results 
interpretation. Overall, there are still numerous 
unresolved issues in field and laboratory site character¬ 
ization methods, and additional research and develop¬ 
ment are regarded as mandatory, in order to further 
improve this very crucial subject. 

Risk Assessment 

If site characterization confirms the existence of con¬ 
tamination, a risk assessment should follow to evaluate 
the risk posed to potential receptors, which are usually 
the humans and the neighboring ecosystems. There¬ 
fore, a comprehensive risk assessment typically com¬ 
prises the human health risk assessment and, the much 
more complex, ecological risk assessment. Although 
these two procedures have much in common, the 
main difference between them is that, while human 
health risk assessment is limited to how humans can 
be affected, ecological risk assessment should also take 
into account the complexity of factors affecting the 
whole ecosystem. 

Risk assessment comprises four successive stages 
which are [47] (a) hazard identification, (b) hazard 
assessment, (c) risk estimation, and (d) risk evaluation. 
In particular, during the hazard identification stage, 


site characterization data are further analyzed in 
a more systematic way in order to identify the contam¬ 
inants of potential concern, their sources, the environ¬ 
mental media that they could affect, their fate and 
transport in the geoenvironment, and, finally, their 
potential receptors. As soon as the hazard has been 
identified, the potential pathways by which pollutants 
of concern could reach receptors and the characteristics 
of the receptors should be determined during the haz¬ 
ard assessment stage. If the hazard assessment indicates 
a possible and significant exposure, a risk estimation 
stage is further required, during which the potential 
exposures and effects are assessed. More specifically, 
during the risk estimation stage, the intensity, the fre¬ 
quency, and the duration of the exposure are measured 
or estimated, while potential human health and envi¬ 
ronmental effects are determined and quantified. 
Finally, based on the information gathered during the 
previous stages, risk is evaluated using appropriate 
indicators of acceptability of the risk in human health 
or in the environment. If the risk assessment concludes 
that the risk of contamination is unacceptable, the site 
should be remediated and the appropriate remediation 
technology should be selected. However, political 
reasons and social pressures may very well supersede 
all of the above and render that a site should be 
remediated without following all the aforementioned 
risk assessment steps. Nevertheless, some limited risk 
assessment is still applied in order to set the specific 
goals (e.g., target concentration levels) of remediation. 

Remediation Technology Selection 

As soon as site characterization and risk assessment 
have been completed and the remedial goals for soil 
and water have been determined, decision-makers 
should select the appropriate remediation scheme. 
However, although soil remediation is a point-in-time 
process, it should be capable of permanently solving 
a very complex and dynamic phenomenon, which is 
variously affected by a series of interactive parameters, 
such as 

• Type of soil 

• Type of contaminants 

• Sources of contaminants 

• Age of contamination (aging) 

• Depth of contamination 
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• Type of receptors 

• How contaminants are introduced into soil 

Taking into account that soil contamination is 
a highly costly and complex problem, it is evident 
that selection of the appropriate remediation technol¬ 
ogy is more of an “art” rather than a standard practice 
yet. The decisive parameters are the experience of deci¬ 
sion-makers coupled with the overall quality of site 
characterization, which will highly affect the ultimate 
success of any remediation effort. 

Remediation Technologies Classification Remedia¬ 
tion technologies are generally classified in several use¬ 
ful ways. First of all, they are divided, based on the 
medium that they address, into two classes and partic¬ 
ularly in (a) source treatment technologies, which 
address only the source of contamination, namely, 
soil, sediment, sludge, solid-matrix wastes, or NAPL, 
but do not address groundwater directly and 
(b) groundwater treatment technologies, which 
address the groundwater directly. Other interesting 
classifications of remediation technologies are based 
either on the depth of contamination or where reme¬ 
diation takes place or, finally, on the predominant 
remediation process employed. In particular, given 
that different technologies should be used for shallow 
soil contamination and others for deep soil contami¬ 
nation, remediation technologies can be classified as 

(a) shallow remediation technologies, which are appro¬ 
priate for vadose zone and shallow aquifers and 

(b) deep remediation technologies, when soil contam¬ 
ination has reached deeper aquifers. Additionally, they 
are typically classified also into (a) in situ technologies 
or (b) ex situ technologies, based on whether contam¬ 
inated soil or groundwater is treated in place or after 
removal, respectively, or according to the primary 
mechanisms by which treatment is achieved in 
(a) biological, (b) physical/chemical, and (c) thermal 
technologies. Finally, a rougher but very useful classi¬ 
fication is based on the fate of the contaminant during 
remediation, and consequently, technologies are 
divided into 

• Removal technologies 

• Immobilization technologies 

• Extraction technologies 

• Destruction technologies 


Removal technologies, such as the Dig-and-Dump 
and the Pump-and-Treat, intend to completely remove 
soil and groundwater contamination in order to restore 
the contaminated soil to “pristine” conditions. 
Although this seemed possible in the past, nowadays 
it has become clear that return to pristine conditions is 
an extremely difficult and expensive practice, which is 
very rarely successful. Therefore, during the last decade, 
the accumulated experience and the adoption of the 
risk-based approach led to broader application of more 
sober management practices, including immobiliza¬ 
tion and destruction technologies. The main represen¬ 
tative of immobilization and destruction technologies 
is Solidification/Stabilization and Bioremediation, 
respectively, which, although they do not restore “pris¬ 
tine” conditions, can efficiently reduce the groundwa¬ 
ter contaminants’ concentrations to accepted levels. 
Another approach, which is usually adopted, is based 
on extraction technologies, such as Phytoremediation 
and Soil Washing, which are capable of extracting con¬ 
taminants from the soil and the groundwater-reducing 
contaminants’ concentrations to an acceptable level. 
However, the major drawback of extraction technolo¬ 
gies is the generation of another waste stream, which, in 
the case of Soil Washing, amounts to large volumes of 
wastewater and sludges, while in the case of 
phytoremediation, large volumes of harvested plants 
or even worse plant parts. Finally, Containment Tech¬ 
nologies such as Vertical Engineered Barriers (section 
“Containment Technologies”) can also be used mainly 
to protect sensitive receptors by effectively controlling 
contaminant migration. Table 2 illustrates the diverse 
classifications of remediation technologies along 
with their frequency of application and the cost. How¬ 
ever, it should be noted that combinations of remedi¬ 
ation technologies are usually required, either in 
sequence or as cotreatment procedures, mainly due to 
the contamination complexity and the different effi¬ 
ciency that different technologies are capable of 
achieving. 

Selection Process and Criteria Generally speaking, 
a remediation technology selection process can be 
divided into three successive stages, which are 

• Development of remediation alternative scenarios 

• Screening of alternatives 

• Treatability and pilot studies 
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Contaminated Soil, Remediation of. Table 2 Main contaminated soil remediation technologies 


Remediation 

technology 

Depth 

Place 

Fate of 
contaminant 

Typical 

contaminants 

Application 

frequency 3 

Application 

trend 3 

Cost 

Source treatment technologies 

Solidification/ 

stabilization 

Shallow/ 

deep 

In situ/ 
ex situ 

P/C-l 

Metals, 

radionuclides, 

organic 

VF 

t 

Low 

Soil vapor 
extraction 

Shallow 

In situ 

P/C-E 

BTEX, solvents 

VF 

- 

Medium 

Bioremediation 

Shallow/ 

deep 

In situ/ 
ex situ 

B-D 

PAHs, BTEX, 
pesticides, solvents 

F 

- 

Low- 

Medium 

Incineration 

Shallow 

Ex situ 

P/C-D 

PCBs, solvents, 
explosives 

F 


High 

Chemical 

treatment 

Shallow 

In situ 

P/C-D 

BTEX, pesticides, 
solvents 

NF 

t 

Medium 

Multiphase 

extraction 

Shallow 

In situ 

P/C-E 

BTEX, solvents 

NF 

- 

Medium- 

High 

Phytoremediation 

Shallow 

In situ 

P/C-E 

Metals, solvents 

NF 

- 

Low- 

Medium 

Thermal treatment 

Shallow 

In situ/ 
ex situ 

P/C-D 

BTEX, pesticides 

NF 

- 

High 

Flushing 

Shallow 

In situ 

P/C-E 

BTEX, metals, 
radionuclides 

NF 

- 

Medium- 

High 

Bioventing 

Shallow 

In situ 

B-D 

BTEX, pesticides 

NF 

- 

Low- 

Medium 

Soil washing 

Shallow 

Ex situ 

P/C-E 

Metals, PAHs, PCBs, 
pesticides 

NF 


High 

Electrokinetic 

separation 

Shallow 

In Situ 

P/C-E, P/C-l 

Metals, 

radionuclides 

NF 

- 

High 

Groundwater treatment technologies 

Monitored natural 
attenuation 

Shallow/ 

deep 

In situ 

P/C/B-l-D 

BTEX, pesticides, 
solvents 

VF 

t 

Low 

Pump-and-treat 

Shallow/ 

deep 

Ex situ 

P/C-E 

TCE, BTEX 

VF 


High 

Bioremediation 

Shallow/ 

deep 

In situ/ 
ex situ 

B-D 

BTEX, pesticides, 
solvents 

F 

t 

Low- 

Medium 

Chemical 

treatment 

Shallow/ 

deep 

In situ 

P/C-D 

BTEX, pesticides, 
solvents 

NF 

t 

Medium- 

High 

Phytoremediation 

Shallow 

In situ 

P/C-E 

Metals, solvents 

NF 

- 

Low- 

Medium 

Flushing 

Shallow 

In situ 

P/C-E 

BTEX, metals, 
radionuclides 

NF 

- 

Medium- 

High 

Multiphase 

extraction 

Shallow 

In situ 

P/C-E 

BTEX, solvents 

NF 

- 

Medium- 

High 
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Contaminated Soil, Remediation of. Table 2 (Continued) 


Remediation 

technology 

Depth 

Place 

Fate of 
contaminant 

Typical 

contaminants 

Application 

frequency 3 

Application 

trend 3 

Cost 

Permeable reactive 
barriers 

Shallow/ 

deep 

In situ 

P/C-B-D 

Metals, solvents, 
radionuclides 

NF 

t 

Medium- 

High 

Air sparging 

Shallow 

In situ 

P/C-E 

BTEX, solvents 

NF 

i 

Medium 


B biological process, P/C physical/chemical process, D destruction, / immobilization, R removal, C containment 

Application frequency and trend at Superfund sites where the treatment option or MIMA was selected in the USA [6], VF very frequent 
(>20% of the Superfund sites), F frequent (10-20% of the Superfund sites), NF not frequent (<10% of the Superfund sites) 


During the first stage, a series of scenarios should be 
created which should be applicable to the specific con¬ 
tamination problem, site-specific hydrogeological and 
other geoenvironmental conditions, and the feasibility 
of the remediation goal. Thus, the remediation scenarios 
created ought to cater for the respective differences 
between organic and inorganic contaminants, mixtures 
and single contaminants, coarse and fine soils, point and 
nonpoint sources, vadose zones and aquifers, shallow and 
deep contamination, etc. As soon as remediation scenar¬ 
ios have been created, the alternative scenarios should be 
further screened on the basis of several technical, envi¬ 
ronmental, health and safety, and socioeconomic criteria. 

The most important of these criteria are the 
following [48]: 

• Short-term and long-term effectiveness 

• Reduction in toxicity, mobility, or volume of 
contamination 

• Implementability and cost 

• Compliance with applicable or relevant and appro¬ 
priate requirements 

• Overall protection of human health and the 
environment 

• Regulatory acceptance 

• Sustainability 

The remediation technology that will be screened 
from the second stage should ensure the long-term 
effectiveness of the remediation effort but with an 
affordable cost for construction and operation that 
would not cause the remediation effort termination 
before the remedial goal has been achieved. Addition¬ 
ally, a remediation technology that fails to gain the 
community support is difficult to be implemented 
and operated over the extended period of time that 
a remediation effort usually requires. 


Finally, the most prevailing remediation scenarios, 
which have successfully passed through the first and the 
second stage of the selection process, should be further 
examined with treatability studies and pilot tests. 
Treatability studies are laboratory tools which 
are used to determine the treatability of specific con¬ 
taminants under conditions similar to those prevailing 
in the field. Typical examples of treatability studies are 

• TCLP or synthetic precipitation leaching procedure 
(SPLP) for solidification/stabilization performance 
evaluation 

• Microcosm studies, which are employed to examine 
the efficiency of enhanced in situ bioremediations 

Solidification/stabilization performance and overall 
sustainability are usually evaluated solely on the basis of 
regulatory leaching tests and typically the TCLP. Inter¬ 
estingly, these tests and other similar leaching tests, used 
in Europe and elsewhere, provide point-in-time perfor¬ 
mance data and cannot provide a solid foundation for 
a reliable assessment of long-term performance and 
treatment overall sustainability. In particular, Dermatas 
et al. [49] have pointed out the inadequacy of TCLP and 
sequential extraction procedures in evaluating Pb 
release, while Sheckel et al. [50] found that in phos¬ 
phate-treated Pb-contaminated soils, a significant shift 
of extractable Pb to the residual phase occurred due to 
the formation of pyromorphite during the extraction 
process, artificially overestimating the presence of 
pyromorphite. Additionally, Cao and Dermatas [51] 
concluded that the standard one-point TCLP test 
would either over- or underestimate Pb leachability in 
firing range soils and that the SPLP is a more appro¬ 
priate alternative than TCLP for assessing Pb leachabil¬ 
ity. Similarly, Kosson et al. [52] and van der Sloot et al. 
[53] point out to the inadequacies of regulatory 
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leaching tests to explain the fact that the European 
regulatory framework is still under development. 
Thus, it appears that treatability bench scale screening 
studies, which are mainly based on regulatory testing, 
may lead to the wrong treatment alternative selection. 
Even worse, there have been many cases where solidifi¬ 
cation/stabilization treatment that appeared to be ini¬ 
tially successful, ended up failing in the near or long 
term [39] . Moreover, this approach often leads to solid¬ 
ification/stabilization overtreatment along with the 
associated waste of energy and resources. Therefore, 
a multiple line of evidence strategy should be adopted 
to understand the actual phenomena given the limita¬ 
tions of the individual investigation techniques [49]. 
This involves at least a combination of traditional geo¬ 
technical analyses (particle size distribution, unit 
weight, water content), physicochemical analyses (pH, 
total metal contents), and a subset of controlled pH 
extraction tests (TCLP, SPLP, etc.) coupled with non¬ 
destructive mineralogical quantitative analyses (XRPD 
combined with Rietveld Quantitative Analysis, RQA). 
Following these analyses, micromorphological (Scan¬ 
ning Electron Microscopy-Energy Dispersive Spectros¬ 
copy, SEM-EDS and/or EPMA) and contaminant 
speciation analyses (XANES and/or Extended X-ray 
Absorption Fine Structure, EXAFS or other newly 
developed techniques) should be used once a more 
in-depth assessment is required [49, 54]. 

On the other hand, microcosm studies provide 
evidence of the potential response of the indigenous 
microbial community to several different electron 
donor additions (e.g., butyrate, lactate) while, at the 
same time, they allow for mass balance calculations 
regarding the chloroethenes that were dechlorinated 
and the electron donor that was consumed either by 
dechlorination or by other competing final-accepting 
processes (e.g., methanogenesis, sulfate reduction) 
[55]. Microcosm studies evaluating enhanced remedi¬ 
ation of sites contaminated with chloroethenes should 
be prepared under aseptic and anaerobic conditions. 
A focal point in microcosm studies is samples repre¬ 
sentative preparation, which, more often than not, is 
a difficult task, since microcosm conditions should 
mimic the site-specific conditions as much as possible 
and laboratory cross-contamination phenomena, 
which could violate the study results, should be 
avoided. Microcosm studies are usually coupled with 


molecular techniques, such as fluorescence in situ 
hybridization (FISH) or polymerase chain reaction 
(PCR), which by studying the microbial community 
structure and the functional relationships between 
community members provide valuable information 
for the potential behavior of the microbial community. 

Finally, as soon as treatability studies indicate the 
potential applicability of a remediation technology, 
a field pilot test is usually employed to further prove 
the treatability study results under real-life conditions 
and to obtain critical information so as to make 
the final decision or to optimize implementation of 
the selected option. If the results of the pilot test agree 
with the results of treatability test, a full-scale scheme 
of the selected remediation technology should be 
designed and implemented properly, otherwise alter¬ 
native scenarios should be considered and the most 
appropriate technology should be tested again. 

Remediation Technologies 

According to the latest USEPAs Superfund Remedy 
Report [6], solidification/stabilization, soil vapor 
extraction, bioremediation, and incineration were 
selected for approximately 80% of the applied source 
treatment technologies. Other technologies, such as 
chemical treatment, soil washing, and phytoremediation, 
have also been applied in a limited number of Superfund 
sites, though [6]. On the other hand, pump-and-treat, 
monitored natural attenuation, and vertical engineered 
barriers are the most frequently applied groundwater 
treatment technologies for Superfund sites [6]. These 
technologies have demonstrated different trends dur¬ 
ing the last three decades, since, although the selection 
of pump-and-treat, soil washing, incineration, and air 
sparging have leveled off, the use of solidification/sta- 
bilization, chemical treatment, monitored natural 
attenuation, bioremediation, and permeable reactive 
barriers has increased. The most significant source 
and groundwater treatment technologies are discussed 
in detail below, while a brief description of the 
remaining treatment and containment technologies is 
limited only to the key points. 

Solidification/ Stabilization 

Solidification/stabilization refers to a group of in situ 
or ex situ source treatment technologies which, using 
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the appropriate binder agents, prevent or minimize the 
migration of contaminants in the environment rather 
than degrade them [56, 57]. According to USEPAs 
Superfund Remedy Report [6], solidification/stabiliza- 
tion is the most frequently selected ex situ and among 
the most frequently selected in situ source treatment 
technologies for Superfund sites, being used in 203 and 
56 projects, respectively. The widespread acceptance of 
this technology stems from the lower cost incurred in 
comparison to much more expensive alternatives and 
the better understanding of the prevailing mechanisms 
which render possible the reliable evaluation of the 
technology performance. Solidification/stabilization 
has been applied so far to contain or treat soils con¬ 
taminated with various hazardous contaminants, such 
as metals, radionuclides, and organic contaminants 
[57]. In particular, solidification/stabilization technol¬ 
ogies, using a binder agent, encapsulate soil contami¬ 
nants in a nontoxic or less toxic material, which may 
be either a monolithic block, a clay-like material, 
a granular particulate, or some other physical form 
commonly considered “solid” (Solidification) while, 
at the same time, chemically transform soil contami¬ 
nants in less soluble, immobile and less toxic forms 
(Stabilization) [56-58]. Moreover, the stabilized wastes 
may attain adequate stress-strain properties to enable 
their utilization in construction applications, such as 
engineering fill, road or pavement subgrade, backfill, 
and base material [39, 59, 60]. Binder agents that have 
been used so far are mainly inorganic materials, such as 
cement [61], lime [39], and phosphate [62, 63]. How¬ 
ever, other binders that could be used in the context of 
the waste combination approach include fly ash [39], 
fish cannery wastes [64], and a plethora of other waste 
materials. Lately, sorbents such as zero-valent iron 
(ZVI), titanium dioxide, zeolites, and other nonwaste 
materials have also been employed, increasingly so in 
nanosize forms. 

The most costly version of solidification/stabiliza- 
tion is the in situ or ex situ Vitrification, which immo¬ 
bilizes inorganics and destroys organic contaminants 
by heating the soil at extremely high temperatures 
(1,600-2,000°C), using a powerful source of energy, 
such as electric current, direct-fired kiln, or other heat 
sources. By this method, most inorganics are 
immobilized and incorporated within the vitrified 
glass and crystalline mass, while organic pollutants 


are destroyed by pyrolysis and the water vapors and 
organic by-products are captured by an off-gas treat¬ 
ment system prior to discharge. 

Bioremediation 

Bioremediation comprises a group of in situ or ex situ 
technologies, which achieve either source or ground- 
water remediation using microorganisms’ metabolism. 
The most promising among them is the enhanced in 
situ bioremediation (EISB). EISB is a source and 
a groundwater remediation technology, in which addi¬ 
tives, such as nutrients, electron donors or acceptors, 
stimulate the biodegradative activity of native 
(biostimulation) or added microorganisms 
(bioaugmentation) [58] . This technology has been suc¬ 
cessfully used for soil contaminated with petroleum 
hydrocarbons, solvents, pesticides, wood preservatives, 
and other organic chemicals [24]. The most frequently 
used versions of this technology include landfarming, 
reductive dechlorination, and biosparging, the analysis 
of which will follow. 

Landfarming is an ex situ source treatment technol¬ 
ogy, in which excavated soil, being contaminated with 
petroleum hydrocarbons, is spread in a layer with 
a width of less than 1.5 m where aerobic biodegradation 
mechanisms are stimulated by aeration and/or addi¬ 
tion of required bacteria, nutrients, minerals, and 
water. However, among the most significant drawbacks 
of Landfarming are the large amount of land require¬ 
ment, the air pollution induced by contaminants’ vol¬ 
atilization, the low efficiency regarding heavy 
petroleum components, its potential of accidentally 
contaminating uncontaminated sites, the high opti¬ 
mum temperature that is required, and the difficulty 
involved both in achieving efficiency higher than 95% 
and in remediating very contaminated soil [65]. 

Reductive Dechlorination (RD) is among the most 
commonly used groundwater treatment technologies 
for soil contaminated mainly with PCE or TCE, which, 
being in an oxidized state, are susceptible to anaerobic 
reduction rather than to aerobic oxidation processes 
[66, 67]. Two decades of intense research efforts have 
proved that under strictly anaerobic conditions, PCE 
and TCE can be detoxified, via a process of the so-called 
chlororespiration, consecutively producing cis- 
dichloroethene (cDCE), VC, and finally innocuous 
ethylene (ETH), when sufficient H 2 -releasing electron 
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donor and dechlorinating bacteria coexist [68]. 
Although a large number of dechlorinating bacteria 
has already been isolated [41], only members of the 
genus Dehalococcoides have been shown to be capable 
of completely dechlorinating TCE to ETH [40, 69-72]. 
However, since normally electron donors are not suffi¬ 
cient for the complete PCE or TCE to ETH dechlori¬ 
nation [73], H 2 -producing simple substrates, such as 
lactate, butyrate, and methanol, or more complex sub¬ 
strates, such as molasses, vegetable oils, mulch, or com¬ 
post, should be added, which stimulate the native 
biological mechanisms [66, 74]. Furthermore, the lack 
of electron donor can be intensified due to the compe¬ 
tition that is mainly developed between dechlorinators 
and methanogens or sulfate-reducing bacteria (SRB) 
[55]. Nevertheless, research studies conducted during 
the last 15 years have proved that dechlorinators can 
easily outcompete methanogens for H 2 , if electron 
donor is slowly and consistently degraded producing 
low H 2 concentration [75-78]. Unfortunately, sulfate 
appears to delay RD under sulfate-reducing conditions, 
although Dehalococcoides population seems not to be 
affected by sulfate concentration [79]. Although RD is 
today a widely accepted and relatively low-cost in situ 
groundwater treatment technology, it cannot be 
applied indiscriminately to all contaminated sites 
with chlorinated solvents due to a series of limitations. 
The most important of them are the site-specific lim¬ 
itations, such as the low permeability or heterogeneity 
of the aquifer, the slow and incomplete dechlorination, 
due to microbial competition, unfavorable redox con¬ 
ditions or lack of the appropriate microbial consortia, 
and the secondary degradation effects, such as solubi¬ 
lization of metals or production of other undesirable 
fermentation products, due to the electron donor 
addition. 

Biosparging is a remediation technology which, 
generally, stimulates the indigenous aerobic microor¬ 
ganisms of aquifers by injecting air or oxygen and 
nutrients, if needed [67, 80]. By and large, Biosparging 
is a technology that is recommended for sites where 
lightweight (e.g., gasoline) or midweight (e.g., diesel, 
jet fuel) petroleum hydrocarbons are dissolved in 
groundwater and adsorbed to soil below the water 
table and/or within the capillary fringe [67, 80]. How¬ 
ever, Biosparging is not the appropriate remediation 
technology when free product is present, when 


basements, sewers, or other subsurface confined spaces 
are located near the site, and when the aquifer is con¬ 
fined [80]. The efficiency of Biosparging primarily 
depends on soil permeability, since this parameter con¬ 
trols how well oxygen can be delivered to the subsurface 
microorganisms and contaminant biodegradability, 
which determine the rate of remedy action. Other 
parameters such as soil stratification, groundwater 
temperature, pH level, microbial population density, 
nutrient concentration, and contaminants concentra¬ 
tion and properties should also be taken into account 
to appropriately evaluate and design a Biosparging sys¬ 
tem [80]. Since soil gas and vapors produced com¬ 
monly require further treatment, Biosparging may be 
implemented in conjunction with source treatments 
technologies, such as SVE or bioventing [67]. 

Despite the fact that the appropriate microorgan¬ 
isms, capable to degrade the target contaminants, can 
be widely distributed in contaminated soils, this is not 
always the case [81]. This seems even more likely it is 
considered that most of times, soil is contaminated by 
complex contaminants, and often mixture of them, 
that can only be degradated by a very specific microbial 
consortium. In cases like these, Bioaugmentation, 
namely, the inoculation of the contaminated soil with 
specific competent strain or consortia of microorgan¬ 
isms, could be the solution, if the site conditions have 
been thoroughly studied and understood [82]. How¬ 
ever, today, bioaugmentation is still a limited-applied 
technology, and further research is needed to build 
a robust knowledge base of this technology. 

Monitored Natural Attenuation 

As it has been determined by USEPA, monitored nat¬ 
ural attenuation or MNA is the remediation technol¬ 
ogy, which relies on natural chemical, physical, or 
biological attenuation processes and, as such, it is suit¬ 
able to achieve site-specific remediation objectives 
without human intervention within a more reasonable 
time frame compared to that offered by other more 
active remediation technologies. These in situ processes 
include biodegradation, dispersion, dilution, sorption, 
volatilization, radioactive decay, and chemical or bio¬ 
logical stabilization, transformation, or destruction of 
contaminants [83]. Monitored natural attenuation has 
been successfully implemented to date at sites 
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contaminated with petroleum hydrocarbons, alcohols, 
ketones, esters, ethers, iron, and manganese, while 
MNA could also be applicable, under a narrow range 
of conditions, to chlorinated solvents, lightly haloge- 
nated aromatics, PCBs, nitroaromatics, some pesti¬ 
cides, chromium, copper, cadmium, lead, zinc, and 
nickel [67]. However, prior to selecting MNA as the 
appropriate treatment technology for a contaminated 
site, it must be verified that natural processes are effi¬ 
cient enough to consistently ensure the protection of 
the environment. For this reason, a number of pro¬ 
tocols have been published so far that allow proponents 
of this technology to show that natural degradation 
processes will reduce the concentrations of 
corresponding contaminants below regulatory stan¬ 
dards before potential receptor exposure pathways are 
completed [84, 85]. MNA is frequently used in con¬ 
junction with EISB. 

Soil Vapor Extraction 

Soil vapor extraction (SVE), also known as Soil Venting 
or Vacuum Extraction, is a mature and widely used in 
situ physical/chemical technology usually applied to 
remediate well-drained, high permeability unsaturated 
soils with low organic carbon content mainly contam¬ 
inated with VOCs or SVOCs [58]. Examples of con¬ 
taminants that can be removed by SVE include 
common chemical mixtures, such as gasoline, kero¬ 
sene, jet fuels, BTEX, styrene, chlorobenzene, chloro¬ 
form, carbon tetrachloride, PCE, TCE, DCEs, VC, 
trichloroethane (TCA), dichloroethane (DCA), ethyl¬ 
ene dibromide, methylene chloride, acetone, methyl 
ethyl ketone (MEK), and methyl isobutyl ketone 
(MIBK) [86]. The general idea behind this technology 
is to induce a controlled subsurface airflow, in order to 
remove the soil vapors directly from the subsurface and 
vaporize contaminants, which are either adsorbed on 
soil particles, dissolved in pore water or in the form of 
free-product phase [58]. A typical SVE system usually 
consists of the extraction wells, equipped with vacuum 
blower, the injection wells or the air vents, the vapor/ 
liquid separator system, the liquid treatment system, 
and the off-gas treatment system, which commonly 
employs an adsorption system, such as Granular Acti¬ 
vated Carbon (GAC) or a thermal oxidation system, 
when a more concentrated waste streams should be 


treated [86]. As soon as vacuums are applied through 
the extraction wells to the vadose zone near the source 
of contamination, a stream of vapors is generated, due 
to the concurrent accomplished of volatilization, dis¬ 
solution, and desorption of the free-phase, of the 
dissolved-phase, and of the sorbed-phase contaminant, 
respectively. These processes can be promoted through 
high contaminant vapor pressures, high contaminant 
Henry’s law constants, low contaminant solubilities, 
high soil intrinsic permeability, and low water and 
organic carbon soil content [58, 87, 88]. 

As it has become evident, SVE may achieve more 
than 90% removal of the contaminants, and generally, 
its effluent contaminant concentrations become asymp¬ 
totic after a period of operation [58]. Although this 
asymptotic performance is usually interpreted as suc¬ 
cessful completion of the remediation effort, a series of 
reasons, such as the water table rise and the overdryness 
of soil, can potentially account for inducing the effluent 
contaminant concentration [58]. According to USEPAs 
Superfund Remedy Report [6], SVE has been used in 
276 Superfund sites so far, and it is the most frequently 
used source treatment technology in the USA. 

Chemical Treatment 

Chemical Treatment comprises a group of in situ or ex 
situ technologies, which treat either the source or the 
groundwater employing chemical reactions. The typi¬ 
cal representative of this group is the in situ chemical 
oxidation (ISCO). 

ISCO is among the most frequently selected in situ 
soil remediation technologies either for source or 
groundwater treatment, mainly due to the wide range 
of organic contaminants (BTEX, MTBE, petroleum 
hydrocarbons, chlorinated solvents, PAHs, PCBs, chlo¬ 
rinate benzenes, phenols, organic pesticides, and explo¬ 
sives) that are amenable to chemical oxidation 
treatment [6, 89]. Generally speaking, ISCO is based 
on the oxidation of groundwater contaminants into 
carbon dioxide, water, and chloride ions (in the pres¬ 
ence of chlorinated compounds), brought about by the 
oxidative power of mainly six classes of oxidants, which 
include permanganate, catalyzed hydrogen peroxide, 
ozone, persulfate, ozone activated with hydrogen per¬ 
oxide (peroxone), and percarbonate. Among these, 
permangnate is the most frequently used [90, 91]. 
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The most common forms of permanganate are the 
potassium permanganate (KMn0 4 ) and the sodium 
permanganate (NaMn0 4 ), which are both very strong 
oxidizing agents and present a remarkable affinity for 
organic compounds containing carbon-carbon double 
bonds, aldehyde groups, or hydroxyl groups [89]. 
However, each kind of permanganate has its own 
advantages and disadvantages. In particular, since 
KMn0 4 is provided as a solid product transportation 
hazards are minimized, while the concentrated liquid 
form of NaMn0 4 permits more flexibility in the design 
of the injection volume. Since manganate can have 
several different mineral forms and exists in multiple 
valence states, it can participate into a complex system 
of different pH-dependent chemical reactions, produc¬ 
ing Mn 2+ , Mn0 2 (s), or Mn0 4 - . In addition to being 
nonselective, permanganate can be readily consumed, 
oxidizing in this way other reduced species present in 
contaminated soil as well, such as natural organic mat¬ 
ter or reduced minerals, which could severely affect 
ISCO performance [89]. 

Multiphase Extraction 

Multiphase extraction (MPE) is a technology employed 
to remediate both the groundwater and the vadose 
zone that involves simultaneous extraction of soil 
vapor and groundwater, which are further treated on 
site [92]. MPE is usually combined with other remedi¬ 
ation technologies, such as bioremediation, air sparg¬ 
ing, or bioventing, when the target contaminants 
include long-chained hydrocarbons, in order to 
shorten the cleanup time at a site [24]. Although 
MPE is more effective than SVE for heterogeneous 
clays and fine sands, it is not recommended for lower 
permeability formations due to its tendency of leaving 
isolated lenses of undissolved product in the formation 
[24]. MPE is usually selected to remediate sites con¬ 
taminated by VOCs and petroleum hydrocarbons [24], 
since, according to the recent USEPAs Superfund Rem¬ 
edy Report, it has been implemented in 54 Superfund 
sites [6]. MPE can be designed and implemented either 
as a system of a single or two pumps configuration or as 
the so-called technology of bioslurping, which is 
applied in order to recover the free-product phase of 
pollutants. In the single-pump configuration, which 
can be used only for shallow aquifers, a single vacuum 


pump is employed to extract both liquid and vapor 
from a single well. The depth limitation can be over¬ 
come by using the two-pump configuration, where 
a submersible pump for groundwater recovery is used 
in conjunction with a separate vacuum applied at the 
sealed wellhead. In this configuration, liquid and vapor 
streams are separated from one another [92]. Finally, 
the third configuration, the so-called bioslurping, com¬ 
bines the vacuum-enhanced free-product recovery, 
which extracts LNAPLs from the capillary fringe and 
the water table, with bioventing, which stimulates the 
aerobic bioremediation of hydrocarbon-contaminated 
soils [24, 92]. 

Permeable Reactive Barriers 

Permeable reactive barriers (PRBs) is a shallow or deep 
groundwater in situ remediation technology, in which 
contaminants are treated by means of physical, chem¬ 
ical, or biological processes. In particular, PRBs are 
continuously permeable treatment zones designed to 
remediate a contaminant plume, so that the ground- 
water quality will be acceptable downgradient of the 
barrier. Generally speaking, PRBs might be divided 
into conventional and advanced systems. Typical rep¬ 
resentatives of conventional systems are the continuous 
PRBs, consisting of a single reactive zone, and the 
Funnel-and-Gate PRBs, consisting of a reactive placed 
between two impermeable walls, while advanced sys¬ 
tems employ other techniques such as injection sys¬ 
tems, hydraulic/pneumatic fracturing, and passive 
groundwater capture and treatment by reactor cells 
[93]. PRBs are usually applied to remediation ground- 
water plumes contaminated with organic compounds, 
such as chlorinated solvents, BTEX, metals and metal¬ 
loids, and radionuclides [33, 94, 95]. Today the most 
widely applied reactive material is by far the zero-valent 
iron (ZVI), while other materials such as red mud, 
zeolites, activated carbon, and Ti0 2 appear to be 
among the emerging trends of the subject [33, 93]. 

Pump-and-Treat 

A pump-and-treat system consists of a groundwater 
extraction system and an aboveground treatment/ 
disposal system, where the pumped groundwater is 
treated properly and the treated water is either 
reinjected into the subsurface or discharged into 
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surface water bodies or to municipal sewer systems 
[96]. Generally speaking, the purpose of a pump-and- 
treat system is typically to combine the groundwater 
treatment with its hydraulic containment [96]. Pump- 
and-treat has been the dominant treatment technology 
for groundwater until recently, although its low effi¬ 
ciency and the emergence of more innovative technol¬ 
ogies, such as the EISB and the ISCO, have leveled off its 
application during the last 15 years [6]. The failure of 
the pump-and-treat was identified as its inability to 
achieve the remedial goal within a reasonable time 
period. The most significant factors that contribute to 
this shortcoming are, on the one hand, the progres¬ 
sively slower rate of decline in dissolved contaminant 
concentration, referred to as tailing, and, on the other 
hand, the fairly rapid increase in contaminant concen¬ 
tration that can occur after pumping has been 
discontinued, referred to as rebound, which usually 
results in longer treatment times and higher residual 
concentrations than the cleanup standards. Today it is 
generally accepted that, although a stand-alone pump- 
and-treat system is not as efficient as it was expected in 
the past, the combination of pump-and-treat system 
with other remediation technologies, such as EISB, may 
improve the groundwater cleanup efforts [96]. 

Soil Flushing 

Soil flushing is an in situ source treatment technology, 
in which an aqueous solution is injected or infiltrated 
into a moderate-to-highly permeable contaminated 
soil, and the produced elutriate (flushing solution 
mixed with contaminants) is extracted and treated by 
the appropriate aboveground processes. Soil flushing 
can be used to remediate soils contaminated either with 
inorganics, such as volatile and nonvolatile metals, 
corrosive, cyanides and radioactive contaminants, or 
organics, such as NAPLs, VOCs, SVOCs, PCBs, halo- 
genated pesticides [97], where it has been proven that 
a strong function of interfacial tension is existent [98]. 

Bioventing 

Bioventing is an in situ source treatment technology 
that, enhancing indigenous microorganism activity, 
biodegrades organic constituents adsorbed in soils of 
the unsaturated zone or capillary fringe [67]. More 
specifically, when bioventing is applied, oxygen is 


introduced into the unsaturated zone either by injec¬ 
tion of either oxygen or air or by the atmosphere due to 
pressure difference, when soil gas is extracted. In this 
case, the principle of bioventing is similar to SVE, 
although biodegradation is the principle mechanism 
in the former technology and volatilization in the latter. 
Bioventing is usually applied in sites contaminated 
with petroleum hydrocarbons, nonchlorinated sol¬ 
vents, some pesticides, wood preservatives, and other 
organic chemicals [24]. 

Phytoremediation 

Phytoremediation is a shallow in situ technology 
mainly employed to contain and extract metals, pesti¬ 
cides, explosives, or oil for contaminated soil [99]. 
Basic mechanisms being involved in phytoremediation 
technology are phytosequestration, rhizodegradation, 
phytohydraulics, phytoextraction, phytodegradation, 
and phytovolatilization. Recently, this technology has 
been into the spotlight thanks to its numerous benefits, 
the most important of which are the low construction 
and operating cost, its ability to remediate soil with 
organic and inorganic contaminants, its solar-powered 
nature, the improvement of aesthetics that causes and 
its applicability as a polishing step in conjunction with 
another remediation technology, as well [100]. How¬ 
ever, the slow removal of contaminants from soil and 
the absence of significant detoxification, which possibly 
leads to toxic metabolites accumulation inside plant 
tissues, result in the generation of large volumes of 
harvested contaminated plants, which are difficult to 
dispose, along with the possibility of contaminants 
being transferred from soil to the food chain via ani¬ 
mals and insects’ activities. These significant limita¬ 
tions are mainly responsible for the limited 
application of phytoremediation in real sites so far 
and for intensive research on engineered plants, 
which could more efficiently remediate contaminated 
soil [101-103]. 

Thermal Treatment 

Thermal treatment is an ex situ or an in situ technol¬ 
ogy, which might achieve contaminated soil remedia¬ 
tion by enhancing contaminant vapors and liquids 
removal as soil temperature is increased. In particular, 
increasing soil temperature contaminant liquid 
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viscosity will result in a decrease in interfacial tension 
as well, while biodegradation rates, solubility, and/or 
volatility might be increased. The most commonly used 
thermal technologies include steam-based heating, 
conductive heating, electrical resistance heating, radio 
frequency heating, and in situ soil mixing with large 
diameter augers combined with steam and hot air 
injection. Thermal treatment presents some notewor¬ 
thy advantages in comparison with other treatment 
alternatives, the most significant of which are the 
reduced operation time, the high variability of the 
chemical that could be treated, and the low sensitivity 
to subsurface heterogeneities. However, thermal treat¬ 
ment bears several drawbacks, on the grounds that it is 
not applicable to occupied sites, is very costly, demands 
more sophisticated design and operation, has the 
potential of transferring contaminants to 
uncontaminated sites, and, finally, elevating tempera¬ 
ture after treatment which could remain in that state 
for months or years [104, 105]. 

Air Sparging 

Air sparging is an in situ remediation technology by 
which clean pressurized air is injected directly to aquifers 
mainly contaminated with fuels and VOCs. Air sparging 
is mainly an extraction technology, since the main reme¬ 
diation mechanism is the volatilization of dissolved, 
trapped, or sorbed contaminants. Additionally, oxygen 
introduced into the aquifer via aeration also promotes 
contaminants’ biodegradation, which further increases 
technology’s performance. However, since vapors of 
contaminants are the technology end-product, air sparg¬ 
ing is usually applied in conjunction either with the SVE 
or with the EISB, which undertake contaminant vapor 
treatment in the vadose zone [24, 65]. 


groundwater from the anode to the cathode, through 
a treatment zone, while electromigration induces the 
movement of charged particles to the electrode with 
opposite charge, where contaminants are removed for 
ex situ treatment. Typical contaminants that could be 
treated with electrokinetic separation include heavy 
metals, anions, and polar organics, with concentrations 
ranging from a few parts per million (ppm) to tens of 
thousands ppm [24]. Among the main limitations of 
this technology are the high energy amount required, 
the close dependence on the local geology, and the 
production of undesirable end-products. 

Soil Washing 

Soil washing is an ex situ technology by which contam¬ 
inants, such as SVOCs, heavy metals, petroleum and 
fuel residuals, PAHs, PCBs, and pesticides, are removed 
from excavated soil by physical and/or chemical sepa¬ 
ration processes [65, 107]. Some physical separation 
processes commonly used include mechanical screen¬ 
ing, hydrodynamic classification, gravity concentra¬ 
tion, froth flotation, magnetic separation, electrostatic 
separation, and attrition scrubbing. Since further 
treatment might be necessary to efficiently remediate 
excavated contaminated soil, appropriate reagents 
might be used to extract contaminants from soil 
[ 107]. These reagents are selected based on their ability 
to react with specific contaminants and typically 
include acids, chloride solutions, chelating agents, sur¬ 
factants, or reducing and oxidizing agents [108]. 
A major drawback of soil washing is the generation of 
a heavily contaminated stream of waste, which should 
be further and properly treated usually by another 
remediation technology such as incineration or biore¬ 
mediation [65]. 
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Electrokinetic Separation 

The electrokinetic separation is an in situ remediation 
technology with very limited application so far [6]. The 
principle behind this technology is the mobilization of 
contaminants and contaminated water due to an 
imposed electric field formed by the application of 
a low-voltage direct current through the soil [106]. 
The basic mechanisms of contaminants transport 
are the electroosmosis and electromigration. Electro¬ 
osmosis induces the movement of contaminated 


Incineration 

Another widely used source treatment technology is 
the incineration via which harmful chemicals, such as 
PCBs and dioxins, are volatilized, combusted in 
870-1,200° C, and, finally, destroyed and removed by 
as much as 99.9999%, while off gases and combustion 
residuals are further treated and properly disposed 
[24, 109]. The economics of incineration, however, 
are generally questionable, even more so in the treat¬ 
ment of contaminated soils. 




2450 


c 


Contaminated Soil, Remediation of 


Containment Technologies 

Containment Technologies might be used either inde¬ 
pendently or in conjunction with treatment technolo¬ 
gies, such as EISB, MNA, ISCO, etc. Typical 
Containment Technologies that are widely used today 
are the cover systems, the bottom liner, and the vertical 
engineered barriers [6]. Cover systems are surface bar¬ 
riers used to contain source material to minimize 
leachate creation and direct contact with receptors, 
while bottom liner is a subsurface barrier that prevents 
the leachate from contaminated source to migrate to 
groundwater. Finally, vertical engineered barriers are 
usually designed either to contain source material and 
groundwater or to divert contaminated groundwater 
from receptors [6]. 

Emerging Trends 

Since the need for more sustainable and more efficient 
contaminated soil remediation is continuously grow¬ 
ing, new trends have emerged to these directions. 
Among them, the waste combination approach, reme¬ 
diation with nanoparticles and phytoremediation 
with engineered plants are probably the most impor¬ 
tant ones. 

In particular, the waste combination approach 
appears to be an attractive alternative to traditional 
technologies, since waste disposal and contaminated 
soil remediation are combined. Typical examples of 
the waste combination approach are the use of fly-ash 
waste materials [39], fish cannery wastes [64], barium 
wastes [110], and red mud [111] to remediate heavy- 
metals-contaminated sites. 

The use of nanoparticles is another very promising 
emerging trend in contaminated soil remediation, 
called nanoremediation [112]. Many different 
nanoparticles have been studied so far for contami¬ 
nated soil remediation such as nanoscale zeolites, 
metal oxides, carbon nanotubes and fibers, enzymes, 
various noble metals, titanium dioxide, and nanoscale 
zero-valent iron (nZVI). However, the most important 
of these materials are the nZVI, commonly being mod¬ 
ified to improve their deliverability in porous media 
[112, 113]. Extensive research studies have demon¬ 
strated that nZVI can effectively treat a series of com¬ 
monly detected soil contaminants such as PCE, TCE, 
carbon tetrachloride, nitrate, explosives, pesticides, 


and heavy metals [114]. Although nZVI holds great 
potential becoming an efficient remediation perspec¬ 
tive, there still remain open questions concerning the 
fate, transport, and toxicity of nZVI and nanomaterials 
in general, awaiting convincing answers [115]. 

The last of the aforementioned emerging trends 
regards the use of transgenic plants to improve 
phytoremediation efficiency in soil contaminated 
with organic compounds. In particular, plants usually 
lack the catabolic enzymes which are necessary for the 
complete detoxification of organic compounds [101, 
102]. Therefore, during the last decade, intensive 
research has been conducted in order to produce and 
test new transgenic plants, such as Nicotiana tabaccum 
and Oryza sativa , which are able to completely detoxify 
organic pollutants, such as solvents, pesticides, and 
explosives [101-103]. 

Future Directions 

Contaminated soil remediation is a costly and techni¬ 
cally challenging activity, but it can be viewed as imper¬ 
ative for human health and the environments 
protection worldwide. Since our scientific knowledge 
is continuously expanding and our analytical capabili¬ 
ties are rapidly improving, requirements for cleaner 
drinking water, increased reliance on groundwater 
resources, and a drive for more extensively restored 
contaminated land will be intensified during the com¬ 
ing years. However, the high complexity of contami¬ 
nated soil problems along with the very high cost that 
soil remediation requires render appropriate contami¬ 
nated site characterization, effective design of remedi¬ 
ation schemes, and affordable implementation of the 
selected technologies great challenges for scientists, 
engineers, and practitioners. Since the cost investment 
required for remediation of all the known contami¬ 
nated sites is currently out of reach, the most significant 
contaminated site problems in terms of risk for poten¬ 
tial receptors should be prioritized and addressed first. 
Thus, sites currently posing the most significant risk to 
human health and the environment should be high in 
the list, while remediation of contaminated sites that 
pose no significant risk could be delayed. On the other 
hand, as the high cost of remediation allows practi¬ 
tioners to test merely one or two different alternatives 
in practice, the whole decision-making process should 
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be further improved. In particular, regarding site char¬ 
acterization, procedures such as the determination of 
contaminant speciation ought to be mandated as well 
as improved, in order to become standard practice 
while been more informative, widely available, and 
less expensive. In addition, as far as laboratory tech¬ 
niques are concerned, treatability studies and molecu¬ 
lar techniques should be simplified and be more 
representative of the field conditions. Furthermore, 
the rationale of waste combination should be adopted 
whenever possible, in order to achieve a twofold goal, 
namely, waste minimization and soil remediation. Tak¬ 
ing into account the high value of soil and water for 
mankind and the environment and the collectively 
unaffordable remediation costs, it can be safely con¬ 
cluded that there will be a future shift of focus in this 
subject aiming both in pollution prevention at or near 
the source and overall sustainability of remediation 
activities. 

Bibliography 

Primary Literature 

1. Howard G, Bartram J, Pedley S, Schmoll 0, Chorus I, 
Berger P (2006) Groundwater and public health. In: 
Schmoll 0, Howard G, Chilton J, Chorus I (eds) Protecting 
groundwater for health. World Health Organization, Geneva 

2. Singh A, Kuhad RC, Ward OP (2009) Biological remediation of 
soil: an overview of global market and available technologies. 
In: Singh A, Kuhad RC, Ward OP (eds) Advances in applied 
bioremediation. Springer, Berlin, pp 1-19 

3. European Environment Agency (2007) Progress in management 
of contaminated sites (CSI 015). http://www.eea.europa.eu/data- 
and-maps/indicators/progress-in-management-of-contam- 
inated-sites/progress-in-management-of-contaminated-1 

4. Vatseris C, Tsatsarelis T, Karteris A, Papadopoulos S, 
Dermatas D (2010) Environmental risk assessment model for 
potentially contaminated sites. In: Protection and restoration 
of the environment X, 5-9 July, Corfu (CD-ROM edition) 

5. Mitchell JK (1993) Fundamentals of soil behavior. Wiley, 
New York 

6. USEPA (2010) Superfund remedy report, 13th edn. Office of 
Solid Waste and Emergency Response, Washington, DC, EPA 
542-R-10-004 

7. Reddy KR (2008) Physical and chemical groundwater remedi¬ 
ation technologies. In: Darnault CJG (ed) Overexploitation and 
contamination of shared groundwater resources. Springer, 
Dordrecht, pp 257-274 

8. Yong RN (2001) Geoenvironmental engineering, contami¬ 
nated soils, pollutant fate, and mitigation. CRC Press, Boca 
Raton 


9. Reddi L, Inyang HI (2000) Geoenvironmental engineering. 
Marcel Dekker, New York 

10. Sharma HD, Reddy KR (2004) Geoenvironmental engineering, 
soil remediation, waste containment and emerging waste 
management technologies. John Wiley & Sons Inc, Hoboken, 
New Jersey 

11. Freeze RA, Cherry JA (1979) Groundwater. Prentice Hall, 
Englewood Cliffs, New Jersey 

12. Mirsal IA (2010) Soil pollution, origin, monitoring and remedi¬ 
ation. Springer, Berlin 

13. Lambe TW, Whitman RV (1969) Soil mechanics. Wiley, 
New York 

14. Dermatas D, Menounou N, Dadachov M, Dutko P, Shen G, 
Xu X, Tsaneva V (2006) Lead leachability in firing range soils. 
Environ Eng Sci 23:86-99 

15. Tessier A, Campbell PGC, Bisson M (1982) Particulate trace 
metal speciation in stream sediments and relationship with 
grain size: implication for geochemical exploration. 
J Geochem Explor 6:77-104 

16. Yong RN, Warkentin BP, Phadungchewit Y, Galvez R (1990) 
Buffer capacity and lead retention in some clay materials. 
Water Air Soil Pollut 53:53-67 

17. Yong RN, Phadungchewit Y (1993) pH Influence on selectivity 
and retention of heavy metals in some clay soil. Can Geotech 
J 30:821-833 

18. Cao X, Ma LQ, Chen M, Hardison DW, Harris WG (2003) Lead 
transformation and distribution in the soils of shooting ranges 
in Florida, USA. Sci Total Environ 307:179-189 

19. Dermatas D, Menounou N, Meng XG (2006) Mechanisms of 
lead immobilization in treated soils. Land Contam Reel 
14:43-56 

20. Borch T, Kretzschmar R, Kappler A, Van Cappellen P, Ginder- 
Vogel M, Voegelin A, Campbell K (2010) Biogeochemical redox 
processes and their impact on contaminant dynamics. Environ 
Sci Technol 44:15-23 

21. Chapelle FH, Bradley PM (2003) Redox conditions and the 
reductive/oxidative biodegradation of chlorinated ethenes in 
groundwater systems. In: Haggblom MM, Bossert ID (eds) 
Dehalogenation, microbial processes and environmental 
applications. Kluwer Academic, Boston, pp 373-384 

22. Major D, Edwards E, McCarty P, Gossett J, Hendrickson E, 
Loffler F, Zinder S, Ellis D, Vidumsky J, Harkness M, Klecka G, 
Cox E (2003) Discussion of environment vs. bacteria or let's 
play, name that bacteria. Ground Water Monit R 23:32-48 

23. NRC (2000) Natural attenuation for groundwater remediation. 
National Academic Press, Washington, DC 

24. FRTR (2007) Remediation technologies screening matrix and 
reference guide, version 4.0. http://www.frtr.gov/matrix2/ 
top_page.html 

25. Kostecki P, Morrison R, Dragun J (2005) Hydrocarbons. In: 
Hillel D (ed) Encyclopedia of soils in the environment. Elsevier, 
Oxford, pp 217-226 

26. Andreoni V, Gianfreda L (2007) Bioremediation and monitor¬ 
ing of aromatic-polluted habitats. Appl Microbiol Biotechnol 
76:287-308 


C 




2452 


Contaminated Soil, Remediation of 



27. Farhadian M, Vachelard C, Duchez D, Larroche C (2008) In situ 
bioremediation of monoaromatic pollutants in groundwater: 
a review. Bioresour Technol 99:5296-5308 

28. Gan S, Lau EV, Ng HK (2009) Remediation of soils contami¬ 
nated with polycyclic aromatic hydrocarbons (PAHs). J Hazard 
Mater 172:532-549 

29. Haggblom MM, Bossert ID (2003) Halogenated organic com¬ 
pounds - a global perspective. In: Haggblom MM, Bossert ID 
(eds) Dehalogenation, microbial processes and environmental 
applications. Kluwer Academic, Boston, pp 3-29 

30. Moran JM, Zogorski JS, Squillace PJ (2007) Chlorinated solvents 
in groundwater of the United States. Environ Sci Technol 
41:74-81 

31. McCarty PL (2010) Groundwater contamination by chlorinated 
solvents: history, remediation technologies and strategies. In: 
Stroo HF, Ward CH (eds) In situ remediation of chlorinated 
solvent plumes. Springer, New York, pp 1-28 

32. Evanko CR, Dzombak DA (1997) Remediation of metals - 
contaminated soils and groundwater. Technology Evaluation 
Report, TE-97-01. Ground-Water Remediation Technologies 
Analysis Center (GWRTAC), Pittsburg 

33. Hashim MA, Mukhopadhyay S, Sahu JN, Sengupta B (2011) 
Remediation technologies for heavy metal contaminated 
groundwater. J Environ Manage 92:2355-2388 

34. USEPA (2006) Radionuclides in soil. RadTownUSA, 
United States Environmental Protection Agency Office 
of Radiation and Indoor Air, Washington, DC, EPA 402-F- 
06-051 

35. Gavrilescu M, Pavel LV, Cretescu I (2009) Characterization and 
remediation of soils contaminated with uranium, review. 
J Hazard Mater 163:475-510 

36. Kalderis D, Juhasz AL, Boopathy R, Comfort S (2011) Soil 
contaminated with explosives: environmental fate and evalu¬ 
ation of state-of-the-art remediation processes (IUPAC techni¬ 
cal report). Pure Appl Chem 83:1407-1484 

37. Shackelford CD (1993) Contaminant transport. In: Daniel DE 
(ed) Geotechnical practice for waste disposal. Chapman and 
Hall, London, pp 33-65 

38. Rouse JV, Pyrih RZ (1993) Geochemistry. In: Daniel DE (ed) 
Geotechnical practice for waste disposal. Chapman and Hall, 
London, pp 15-32 

39. Dermatas D, Meng X (2003) Utilization of fly-ash for stabiliza¬ 
tion/solidification of heavy metal contaminated soils. Eng Geol 
70:377-394 

40. Maymo-Gatell X, Chien YT, Gossett JM, Zinder SH (1997) Isola¬ 
tion of a bacterium that reductively dechlorinates 
tetrachloroethene to ethene. Science 276:1568-1571 

41. Aulenta F, Majone M, Tandoi V (2006) Review: enhanced 
anaerobic bioremediation of chlorinated solvents: environ¬ 
mental factors influencing microbial activity and their rele¬ 
vance under field conditions. J Chem Technol Biot 81:1463- 
1474 

42. Bhatt P, Kumar MS, Mudliar S, Chalrabarti T (2007) Biodegra¬ 
dation of chlorinated compounds - a review. Crit Rev Environ 
Sci Technol 37:165-198 


43. Pollard SJT, Brookes A, Earl N, Lowe J, Kearney T, Nathanail CP 
(2004) Integrating decision tools for the sustainable manage¬ 
ment of land contamination. Sci Total Environ 325:15-28 

44. Ellis DE, Hadley PW (2009) Sustainable remediation white 
paper - integrating sustainable principles, practices, and met¬ 
rics into remediation projects. Remediation 19:5-10 

45. ASTM El 527-05 (2005) Standard practice for environmental 
site assessment: phase I environmental site assessment pro¬ 
cess. ASTM, West Conshohoken, Pennsylvania 

46. ASTM El 903-97 (2002) Standard guide for environmental site 
assessment: phase II environmental site assessment process. 
ASTM, West Conshohoken, Pennsylvania 

47. Petts J, Cairney T, Smith M (1997) Risk-based contaminated 
land investigation and assessment. John Wiley & Sons Ltd, 
Chichester 

48. Caliman FA, Robu BM, Smaranda C, Pavel VL, 
Gavrilescu M (2011) Soil and groundwater cleanup: benefits 
and limits of emerging technologies. Clean Technol Environ 
Policy 13:241-268 

49. Dermatas D, Shen G, Chrysochoou M, Grubb DG, Menounou N, 
Dutko P (2006) Pb speciation versus TCLP release in army firing 
range soils. J Hazard Mater 136:34-46 

50. Scheckel KG, Impellitteri CA, Ryan JA, Mcevoy T (2004) Assess¬ 
ment of a sequential extraction procedure for perturbed lead- 
contaminated samples with and without phosphorus amend¬ 
ments. Environ Sci Technol 37:5296-5304 

51. Cao X, Dermatas D (2007) Evaluating the applicability of reg¬ 
ulatory leaching tests for assessing lead leachability in con¬ 
taminated shooting range soils. Environ Monit Assess 
139:1-13 

52. Kosson DS, van der Sloot HA, Sanchez F, Garrabrants AC 
(2002) An integrated framework for evaluating leaching in 
waste management and utilization of secondary materials. 
Environ Eng Sci 19:159-204 

53. van der Sloot HA, van Zomeren A, Meeuwsen HCL, Seignette P, 
Bleyerveld R (2007) Test method selection, validation against 
field data, and predictive modelling for impact evaluation of 
stabilized waste disposal. J Hazard Mater 141:354-369 

54. Dermatas D, Chrysochoou M, Moon DH (2008) Geoenvir- 
onmental characterization to assess waste stabilization/solid¬ 
ification treatment performance and sustainability. 
GeoCongress 2008: Geotechnics of Waste Management and 
Remediation, pp 660-667 

55. Fennell DE, Gossett JM (2003) Microcosms for site-specific 
evaluation of enhanced biological reductive dehalogenation. 
In: Haggblom MM, Bossert ID (eds) Dehalogenation, microbial 
processes and environmental applications. Kluwer Academic, 
Boston, pp 385-420 

56. USEPA (2001) A citizen's guide to solidification/stabilization. 
Office of Solid Waste and Emergency Response, Washington, 
DC, EPA 542-F-01-024 

57. USEPA (2009) Technology performance review: selecting and 
using solidification/stabilization treatment for site remedia¬ 
tion. National Risk Management Research Laboratory, 
Washington, DC, EPA 600-R-09-148 



Contaminated Soil, Remediation of 


2453 



58. USEPA (2006) In situ treatment technologies for contaminated 
soil. Office of Solid Waste and Emergency Response, 
Washington, DC, EPA 542/F-06/013 

59. Mitchell JK, Dermatas D (1992) Clay soil heave caused by lime- 
sulfate reactions. Innovations in uses for lime, ASTM STP 1135. 
American Society for Testing and Materials (ASTM), 
Philadelphia, pp 41-64 

60. USEPA (1999) Solidification/stabilization resource guide. 
Office of Solid Waste and Emergency Response, Washington, 
DC, EPA 549-B-99-002 

61. Conner JR (1990) Chemical fixation and solidification of haz¬ 
ardous wastes. VNR, New York 

62. USEPA (2001) Best management practices for lead at outdoor 
shooting ranges. Division of Enforcement and Compliance 
Assistance, New York, EPA-902-B-01-001 

63. Chrysochoou M, Dermatas D, Grubb DG (2007) Phosphate 
application to firing range soils for Pb immobilization: the 
unclear role of phosphate. J Hazard Mater 144:1-14 

64. ESTCP (2006) PIMS mt : remediation of soil and groundwater con¬ 
taminated with metals, cost and performance report (ER-0020) 

65. Khan FI, Hussain T, Hejazi R (2004) An overview and analysis of 
site remediation technologies. Environ Manage 71:95-122 

66. Parsons Corporation (2004) Principles and practices of 
enhanced anaerobic bioremediation of chlorinated solvents 
Prepared for AFCEE (Air Force Center for Environmental Excel¬ 
lence), NFESC (Naval Facilities Engineering Service Center) and 
ESTCP (Environmental Security Technology Certification 
Program) 

67. USEPA (2006) In situ and ex situ biodegradation technologies 
for remediation of contaminated sites. Office of Research and 
Development National Risk Management, Cincinnati, EPA 
625-R-06-015 

68. Bradley PM, Chapelle FH (2010) Biodegradation of chlorinated 
ethenes. In: Stroo HF, Ward CH (eds) In situ remediation of 
chlorinated solvent plumes. Springer, New York, pp 39-67 

69. Cupples AM, Spormann AM, McCarty PL (2003) Growth of 
a De/ra/ococco/c/es-like microorganism on vinyl chloride and 
c/'s-dichloroethene as electron acceptors as determined by 
competitive PCR. Appl Environ Microbiol 69:953-959 

70. He J, Ritalahti KM, Yang KL, Koenigsberg SS, Loffler FE 
(2003) Detoxification of vinyl chloride to ethene coupled to 
growth of an anaerobic bacterium. Nature 424:62-65 

71. He J, Sung Y, Krajmalnik-Brown R, Ritalahti KM, Loffler FE 
(2005) Isolation and characterization of Dehalococcoides sp. 
strain FL2, a trichloroethene (TCE)- and 1,2-dichloroethene- 
respiring anaerobe. Environ Microbiol 7:1442-1450 

72. Sung Y, Ritalahti KM, Apkarian RP, Loffler FE (2006) Quantita¬ 
tive PCR confirms purity of strain GT, a novel trichloroethene- 
to-ethene-respiring Dehalococcoides isolate. Appl Environ 
Microbiol 72:1980-1987 

73. WiedemeierTH, Swanson MA, Moutoux DE, Gordon EK, Wilson 
JT, Wilson BH, Kampbell DH, Hansen JE, Haas P, Chapelle FH 
(1998) Technical protocol for evaluating natural attenuation of 
chlorinated solvents in groundwater. Office of Research and 
Development, Cincinnati, EPA 600-R-98-128 


74. Panagiotakis I, Mamais D, Pantazidou M, Marneri M, 
Parapouli M, Hatziloukas E, Tandoi V (2007) Dechlorinating 
ability of TCE-fed microcosms with different electron donors. 
J Hazard Mater 149:582-589 

75. Smatlak CR, Gossett JM, Zinder SH (1996) Comparison kinetics 
of hydrogen utilization for reductive dechlorination of 
tetrachloroethene and methanogenesis in an anaerobic 
enrichment culture. Environ Sci Technol 30:2850-2858 

76. Ballapragada BS, Stensel HD, Puhakka JA, Ferguson JF 
(1997) Effect of hydrogen on reductive dechlorination of chlo¬ 
rinated ethenes. Environ Sci Technol 31:1728-1734 

77. Fennell DE, Gossett JM, Zinder SH (1997) Comparison of 
butyric acid, ethanol, lactic acid, and propionic acid as 
hydrogen donors for the reductive dechlorination of 
tetrachloroethene. Environ Sci Technol 31:918-926 

78. Yang Y, McCarty PL (1998) Competition for hydrogen within 
a chlorinated solvent dechlorinating anaerobic mixed culture. 
Environ Sci Technol 32:3591-3597 

79. Pantazidou M, Panagiotakis I, Mamais D, Zikidi V (2011) 
Chloroethene biotransformation in the presence of varying 
sulfate concentrations. Ground Water Monit R (to appear) 
doi:10.1111/jl 745-6592.2011.01372.x 

80. USEPA (2004) How to evaluate alternative cleanup technolo¬ 
gies for underground storage tank sites: a guide for corrective 
action plan reviewers. Office of Solid Waste and Emergency 
Response, Washington, DC, EPA/510/R-04-002 

81. Hendrickson ER, Payne JA, Young RM, Starr MG, Perry MP, 
Fahnestock S, Ellis DE, Ebersole RC (2002) Molecular analysis 
of Dehalococcoides 16S ribosomal DNA from chloroethene- 
contaminated sites throughout North America and Europe. 
Appl Environ Microbiol 68:485-495 

82. El Fantroussi S, Agathos SN (2005) Is bioaugmentation 
a feasible strategy for pollutant removal and site remediation? 
Curr Opin Microbiol 8:268-275 

83. USEPA (1999) Monitored natural attenuation of petroleum 
hydrocarbons. Office of Research and Development, 
Washington, DC, EPA 600-F-98-021 

84. Stiber NA, Pantazidou M, Small MJ (1999) Expert system meth¬ 
odology for evaluating reductive dechlorination at TCE sites. 
Environ Sci Technol 33:3012-3020 

85. Wiedemeier TH, Swanson M, Moutoux DE, Gordon EK, 
Wilson JT, Wilson BH, Kampbell DH, Hansen JE, Haas P, 
Chapelle FH (1998) Technical protocol for evaluating 
natural attenuation of chlorinated solvents in ground- 
water. U.S. Government Printing Office, Cincinnati, EPA 
600-R-98-128 

86. USEPA (2006) Off-gas treatment technologies for soil vapor 
extraction systems: state of the practice. Office of Solid 
Waste and Emergency Response, Washington, DC, EPA 542- 
R-05-028 

87. Albergaria JT, Alvim-Ferraz MCM, Delerue-Matos C (2006) 
Remediation efficiency of vapour extraction of sandy soils 
contaminated with cyclohexane: influence of air flow rate, 
water and natural organic matter content. Environ Pollut 
143:146-152 


C 




2454 


c 


Contaminated Soil, Remediation of 


88. Soares AA, Albergaria JT, Domingues VF, Alvim-Ferraz MCIVI, 
Delerue-Matos C (2010) Remediation of soils combining soil 
vapor extraction and bioremediation: benzene. 
Chemosphere 80:823-828 

89. ITRC (2005) Technical and regulatory guidance for in situ 
chemical oxidation of contaminated soil and groundwater, 
2nd edn. In Situ Chemical Oxidation Team, Washington, DC 

90. Krembs FJ, Siegrist RL, Crimi ML, Furrer RF, Petri BG (2010) ISCO 
for groundwater remediation: analysis of field applications and 
performance. Ground Water Monit R 30:42-53 

91. USEPA (2006) In situ chemical oxidation. Office of Research 
and Development National Risk Management Research Lab¬ 
oratory, Washington, DC, EPA 600-R-06-072 

92. USEPA (1999) Multi-phase extraction: state-of-the-practice. 
Solid Waste and Emergency Response, Washington, DC, 
EPA 542-R-99-004 

93. Thiruvenkatachari R, Vigneswaran S, Naidu R (2008) Perme¬ 
able reactive barrier for groundwater remediation, review. 
J Ind Eng Chem 14:145-156 

94. USEPA (1998) Permeable reactive barrier technologies for 
contaminant remediation. Office of Research and Develop¬ 
ment, Office of Solid Waste and Emergency Response, 
Washington, DC, Report number EPA 600-R-98-125 

95. Gillham RW, Vogan J, Gui L, Duchene M, Son J (2010) Iron 
barrier walls for chlorinated solvent remediation. In: Stroo HF, 
Ward CH (eds) In situ remediation of chlorinated solvent 
plumes. Springer, New York, pp 537-571 

96. USEPA (1996) Pump-and-treat ground-water remediation a 
guide for decision makers and practitioners. Office of 
Research and Development, Cincinnati, EPA 625-R-95-005 

97. Roote DS (1997) In situ flushing. GWRTAC, Technology over¬ 
view report, TO-97-02 

98. Dermatas D, Korfiatis GP, Kostarellos D (1993) Soil column 
study of DNAPL recovery using surfactant flushing. In: 
National conference on environmental engineering, July, 
1993, Montreal, vol 2, pp 1411-1418 

99. USEPA (2001) A citizen's guide to phytoremediation. Office of 
Solid Waste and Emergency Response, Washington, DC, EPA 
542-F-01-002 

100. ITRC (2009) Phytotechnology technical and regulatory guid¬ 
ance and decision trees, revised. ITRC, Washington, DC 

101. Van Aken B (2008) Transgenic plants for phytoremediation: 
helping nature to clean up environmental pollution. Trend 
Biotechnol 26:225-227 

102. Van Aken B (2009) Transgenic plants for enhanced 
phytoremediation of toxic explosives. Curr Opin Biotechnol 
20:231-236 

103. Abhilash PC, Jamil S, Singh N (2009) Transgenic plants for 
enhanced biodegradation and phytoremediation of organic 
xenobiotics. Biotechnol Adv 27:474-488 

104. USEPA (2004) In situ thermal treatment of chlorinated solvents 
fundamentals and field applications. Office of Solid Waste and 
Emergency Response, Washington, DC, EPA 542-R-04-010 

105. Kingston JT, Dahlen PR, Johnson PC, Foote E, 
Williams S (2010) Critical evaluation of state-of-the-art in 


situ thermal treatment technologies for DNAPL source zone 
treatment. ESTCP project final report 

106. Virkutytea J, Sillanpaa M, Latostenmaa P (2002) Electrokinetic 
soil remediation - critical overview. Sci Total Environ 
289:97-121 

107. Dermont G, Bergeron M, Mercier G, Richer-Lafleche M 
(2008) Soil washing for metal removal: a review of physical/ 
chemical technologies and field applications. J Hazard Mater 
152:1-31 

108. Chu W, Chan KH (2003) The mechanism of the surfactant- 
aided soil washing system for hydrophobic and partial hydro- 
phobic organics. Sci Total Environ 307:83-92 

109. USEPA (2002) A citizen's guide to incineration. Office of Solid 
Waste and Emergency Response, Washington, DC, EPA 
542-F-01-018 

110. Chrysochoou M, Dermatas D, Moon DH, Christodoulatos C, 
Wazne M, French C, Morris J, Kaouris M (2006) Investigation 
of barium treatment of chromite ore processing residue 
(COPR). In: Fukue M, Kita K, Ohtsubo M, Chaney R (eds) 
Contaminated sediments, evaluation and remediation 
techniques. ASTM, West Conshohocken, pp 165-175 

111. Lombi E, Zhao F-J, Zhang G, Sun B, Fitz W, Zhang H, McGrath SP 
(2002) In situ fixation of metals in soils using bauxite residue: 
chemical assessment. Environ Pollut 118:435-443 

112. Karn B, Kuiken T, Otto M (2009) Nanotechnolgy and in situ 
remediation: a review of the benefits and potential risks. 
Environ Health Persp 117:1823-1831 

113. Phenrat T, Saleh N, Sirk K, Tilton RD, Lowry GV (2007) Aggre¬ 
gation and sedimentation of aqueous nanoscale zerovalent 
iron dispersions. Environ Sci Technol 41:284-290 

114. X-Q Li, Elliott DW, Zang W-X (2006) Zero-valent iron nano¬ 
particles for abatement of environmental pollutants: materials 
and engineering aspects. Crit Rev Solid State 31:111—122 

115. Keane E (2009) Fate, transport, and toxicity of nanoscale 
zero-valent iron (nZVI) used during superfund remediation. 
USEPA, Office of Solid Waste and Emergency Response, 
Office of Superfund Remediation and Technology Innova¬ 
tion, Washington, DC 

Books and Reviews 

NRC (2004) Contaminants in the subsurface, source zone assess¬ 
ment and remediation. The National Academic Press, 
Washington, DC 

Nielsen DM (2006) Environmental site characterization and 
ground-water monitoring. Taylor & Francis, Boca Raton 

Pankow JF, Cherry JA (1996) Dense chlorinated solvents and other 
DNAPLs in groundwater history, behavior, and remediation. 
Waterloo Press, Portland 

Sara MN (2003) Site assessment and remediation handbook. Lewis 
Publishers, Boca Raton 

Siegrist RL, Crimi M, Simpkin TJ (2011) In situ chemical oxidation 
for groundwater remediation. Springer, New York 

Stroo HF, Ward CH (2010) In situ remediation of chlorinated 
solvent plumes. Springer, New York 



Contemporary Sea Level Variations, Observations and Causes 


2455 



Contemporary Sea Level Variations, 
Observations and Causes 

Anny Cazenave 

LEGOS, Laboratoire d’Etudes en Geophysique 
et Oceanographie Spatiales, Toulouse, France 

Article Outline 

Glossary 

Definition of the Subject 
Introduction 

Observations of Sea Level Change and Variability 
(Twentieth Century and Last Two Decades) 

Causes of Present-Day Sea Level Changes (Global Mean 
and Regional Variability) 

Future Sea Level Rise 
Impacts of Sea Level Rise 
Future Directions 
Bibliography 

Glossary 

Glacier Large persistent ice body, generally formed in 
mountain areas by snow accumulation during 
winter and further compaction into ice. 

Ice sheets Ice bodies covering the Greenland and 
Antarctica continents. 

Mean sea level A measure of the average height of the 
ocean s surface with respect to a fixed reference surface. 
Satellite altimetry Space technique dedicated to the 
measurement of the height of the sea surface from 
a satellite-borne radar altimeter. 

Sea level rise Sea level rises in response to global 
warming. The components that cause sea level rise 
are ocean thermal expansion and freshwater mass 
addition to the oceans due to land ice melt and land 
water-storage decrease. 

Steric sea level Contribution to observed sea level due 
to temperature and salinity variations. 

Thermal expansion Volume change of ocean water 
response to a change in temperature. 

Definition of the Subject 

Sea level change is a very sensitive index of climate 
change and variability. For example, as the ocean 
warms in response to global warming, seawaters 


expand, and thus sea level rises. When mountain gla¬ 
ciers melt in response to increasing air temperature, sea 
level rises because of freshwater mass input to the 
oceans. Similarly, ice mass loss from the ice sheets causes 
sea level rise. Corresponding increase of freshwater into 
the oceans changes water salinity, hence seawater density 
as well as ocean circulation that in turn affects sea level 
and its spatial variability. Modification of the land 
hydrological cycle due to climate variability and direct 
anthropogenic forcing leads to increased or decreased 
runoff, hence ultimately to sea level change. Hence local 
and regional climate changes may affect the sea level. 

Introduction 

Sea level variations spread over a very broad spectrum. 
The largest global-scale sea level changes (100-200 m in 
amplitude) occurred on geological timescales 
(~100 myr) and depended primarily on tectonics 
processes (e.g., large-scale change in the shape of ocean 
basins associated with seafloor spreading and mid¬ 
ocean ridges expansion). With the formation of long- 
live ice sheets (e.g., formation of the Antarctica ice sheet 
about 35 myr ago) global mean sea level dropped by 
about 60 m. Cooling of the Earth since about 3 myr ago 
has led to glacial/interglacial cycles driven by changes of 
the Earths orbit and obliquity. Quasi-periodic growth 
and decay of the northern hemisphere ice caps on 
a timescale of tens of thousands of years have produced 
large oscillations of the global mean sea level, on the 
order of 100 m. On shorter (decadal to multi-centen¬ 
nial) timescales sea level fluctuations are mainly driven 
by natural forcing factors (e.g., solar radiation, volcanic 
eruptions) and internal variability the climate system 
(e.g., atmosphere-ocean perturbations such as El Nino- 
Southern Oscillation (ENSO), North Atlantic Oscilla¬ 
tion (NAO), and Pacific Decadal Oscillation (PDO)). 
Since the beginning of the industrial era, sea level is also 
responding to anthropogenic global warming. In this 
chapter, only recent (last few decades) sea level varia¬ 
tions and their causes are discussed. 

Observations of Sea Level Change and Variability 
(Twentieth Century and Last Two Decades) 

Twentieth Century 

Our knowledge of the past century sea level change 
comes from tide gauge measurements located along 
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continental coastlines and islands. The largest tide gauge 
data base of monthly and annual mean sea level records 
is the Permanent Service for Mean Sea Level (PSMSL, 
www.psmsl.org) which contains data for the twentieth 
century from ~2,000 sites maintained by about 200 
nations. The records are somewhat inhomogeneous in 
terms of data length and quality. For long-term sea level 
studies, only ~10% of this data set is useable because of 
data gaps. Tide gauges measure sea level relatively to the 
ground, hence monitor also ground motions. In active 
tectonic and volcanic regions, or in areas subject to 
strong ground subsidence due to natural causes (e.g., 
sediment loading in river deltas) or human activities 
(groundwater pumping and oil/gas extraction), tide 
gauge data are directly affected by the corresponding 
ground motions. Postglacial rebound, the visco-elastic 
response of the Earth to last deglaciation (also called 
Glacial Isostatic Adjustment (GIA)) is another process 
that gives rise to vertical land movement. 

After the ~130 m sea level rise associated with the 
deglaciation that followed the Last Glacial Maximum 


~20,000 years ago, geological, geochemical and 
archeological observations indicate that the mean sea 
level remained almost stable during the last 2-3 millennia 
[1, 2]. However, since the late nineteenth century tide 
gauge records have shown significant sea level rise. Dur¬ 
ing the twentieth century, a mean rate of ~ 1.8 mm/year 
is reported [3-5]. Figure 1 shows tide gauge-based sea 
level evolution for the twentieth century. 

Satellite Altimetry Era 

Since the early 1990s, satellite altimetry has become the 
main tool for precisely and continuously measuring sea 
level with quasi-global coverage and a few days revisit 
time. Compared to tide gauges which provide sea level 
relative to the ground, satellite altimetry measures 
“absolute” sea level variations. The concept of the 
satellite altimetry measurement is simple: the onboard 
radar altimeter transmits microwave radiation toward 
the sea surface which partly reflects back to the satellite. 
Measurement of the round-trip travel time provides 
the height of the satellite above the instantaneous sea 
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Global mean sea level evolution (1900-2000) during the twentieth century (Data based on tide gauges; red points from 
Ref. [3]; blue points from Ref. [5]) 
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surface (called “range”). The quantity of interest in 
oceanography is the sea surface height above a fixed 
reference surface (typically a conventional reference 
ellipsoid). It is obtained by the difference between the 
altitude of the satellite above the reference (deduced 
from precise orbitography) and the range measure¬ 
ment. The estimated sea surface height needs be 
corrected for various factors due to ionospheric and 
atmospheric delay, and of biases between the mean 
electromagnetic scattering surface and sea at the air-sea 
interface. Other corrections due geophysical effects, 
such as solid Earth, and pole and ocean tides, are also 
applied. High-precision satellite altimetry started with 
the launch of Topex/Poseidon in 1992 and its succes¬ 
sors Jason-1 and Jason-2 launched in 2001 and 2008, 
respectively. The precision of an individual sea surface 
height measurement based on these missions has 
reached the 1-2 cm level. Precision on the global 
mean rate of rise is currently of — 0.4-0.5 mm/year. 
This value is based on error budget analyses of all 
sources of errors affecting the altimetry system or on 
comparisons with tide gauge-based sea level measure¬ 
ments (e.g., [6-10]). The temporal evolution of the 
global mean sea level from satellite altimetry since 


early 1993 is shown in Fig. 2. It is characterized by an 
almost linear increase (except for temporary anomalies 
associated with the 1997-1998 El Nino and the 2007- 
2008 and 2010-2011 La Ninas). Over this 18-year-long 
period, global mean sea level has been rising at a rate 
of-3.2 ± 0.4 mm/year. This rate is significantly higher 
than the mean rate recorded by tide gauges over the 
past decades. 

Satellite altimetry has also revealed that sea level is 
not rising uniformly (Fig. 3). In some regions 
(e.g., Western Pacific), the rates of sea level rise have 
been faster than the global mean rate by a factor of up 
to 3 over the past two decades. In other regions rates 
have been slower than the global mean (e.g., Eastern 
Pacific). 

Causes of Present-Day Sea Level Changes (Global 
Mean and Regional Variability) 

Global Scale 

The main factors causing current global mean sea level 
rise are thermal expansion of seawaters, land ice loss, 
and freshwater mass exchange between oceans and land 
water reservoirs. These contributions vary in response 
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Global mean sea level evolution from satellite altimetry (1993-2010) (Updated from Ref. [19]) 
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Contemporary Sea Level Variations, Observations and Causes. Figure 3 

Observed spatial trend patterns in sea level over the altimetry era (1993-2010) (Updated from Ref. [19]) 


to natural climate variability and to global climate 
change induced by anthropogenic greenhouse gases 
emissions. 

Ocean Warming Analyses of in situ ocean 
temperature data collected over the past 50 years by 
ships and recently by Argo profiling floats [11] indicate 
that ocean heat content, and hence ocean thermal 
expansion, has significantly increased since 1950 
(e.g., [12-14]). Ocean warming explains about 25% 
of the observed sea level rise of the last few decades 
(e.g., [ 15] ). However, this value is likely a lower bound, 
due to the lack of hydrographic data in remote regions 
of the southern hemisphere and in the deep ocean 
(below 1,000 m). Recent reevaluation of ocean 
warming estimates to >40% this contribution to past 
decades sea level rise [16]. A steep increase was 
observed in thermal expansion over the decade 
1993-2003 (e.g., [15, 17]), but since about 2003, 
thermal expansion has increased less rapidly [14, 18]. 
The recent slower rate of steric rise likely reflects 


short-term variability rather than a new long-term 
trend. On average, over the satellite altimetry era 
(1993-2010), the contribution of ocean warming to 
sea rise accounts for ~ 30-40% [19, 20]. 

Glaciers Melting Being very sensitive to global 
warming, mountain glaciers and small ice caps have 
retreated worldwide during the recent decades, with 
significant acceleration during the 1990s. From mass 
balance studies of a large number of glaciers, estimates 
have been made of the contribution of glacier’s ice melt 
to sea level [21-24]. For the period 1993-2010, glaciers 
and ice caps have accounted for ~ 30% to the observed 
sea level rise [19, 20]. 

Ice Sheets If totally melted, Greenland and West 
Antarctica would raise sea level by about 7 m and 
3-5 m, respectively. Thus even a small amount of ice 
mass loss from the ice sheets would produce substantial 
sea level rise, with adverse societal and economical 
impacts on vulnerable low-lying coastal regions. 

















































Contemporary Sea Level Variations, Observations and Causes 


C 


2459 


Since the early 1990s, different remote sensing obser¬ 
vations (airborne and satellite radar and laser altimetry, 
Synthetic Aperture Radar Interferometry (InSAR), and 
since 2002, space gravimetry from the GRACE mission) 
have provided important observations of the mass 
balance of the ice sheets. These data indicate that 
Greenland and West Antarctica are loosing mass at an 
accelerated rate (e.g., [25, 26]). Most recent mass bal¬ 
ance estimates from GRACE space gravimetry suggest 
for the 2002-2009 time span, averaged rates of ice mass 
loss of 240 Gt/year for Greenland and 150-200 Gt/year 
for Antarctica (e.g., [27, 28] ), in rather good agreement 
with INSAR (e.g., [29, 30] ) and radar and laser altim¬ 
etry results (e.g., [31]). The space-based observations 
unambiguously show ice mass loss acceleration in 
the recent years. For the period 1993-2003, <15% of 
the rate of global sea level rise was due to the ice sheets 
[21]. But their contribution has increased up to ~75% 
since 2003-2004, about equally split between Green¬ 
land and Antarctica (e.g., [29, 30]). Accelerated polar 
ice loss over the past few years has more than compen¬ 
sated recent slower rate of ocean thermal expansion, 
and as a result, sea level has continued to rise at almost 
the same rate (e.g., [32]). Although not constant 
through time, on average over 2003-2010, ice 
sheets mass loss explains ~25% of the rate of sea level 
rise [19, 20]. 

There is more and more evidence that recent 
negative ice sheet mass balance mainly results from 
rapid outlet glacier flow along some margins of 
Greenland and West Antarctica, and further iceberg 
discharge into the surrounding ocean (e.g., [33-38]). 
This process, known as dynamic thinning, is often 
observed in regions where glaciers are grounded below 
sea level (e.g., in the Amundsen Sea sector, West Ant¬ 
arctica). Thinning and subsequent breakup of floating 
ice tongues or ice shelves that buttressed the glaciers 
result in rapid grounding line retreat and accelerated 
glacier flow. Several recent observations have suggested 
that warming of subsurface ocean waters may trigger 
these dynamical instabilities [34-38]. 

Land Water Storage Change in land water storage, 
due to natural climate variability and human activities 
(i.e., anthropogenic changes in the amount of water 
stored in soils, reservoirs and aquifers as a result of 
dam building, underground water mining, irrigation, 


urbanization, deforestation, etc.) is another potential 
contribution to sea level change. Model-based esti¬ 
mates of land water-storage change caused by natural 
climate variability suggest no long-term contribution 
to sea level for the past few decades, although 
interannual/decadal fluctuations have been significant. 
Since 2002, space gravimetry observations from the 
GRACE space mission now allow determination of 
the total (i.e., due to climate variability and human 
activities) land water contribution to sea level. 
GRACE data confirm that the land water signal is 
dominated by interannual variability with only 
a modest contribution (<10%) to the trend [39, 40]. 
On the other hand, intensive dam building along rivers 
during the second half of the twentieth century 
may have lowered sea level by rvj — 0.5 mm/year 
[41], but groundwater pumping, in particular for irri¬ 
gation, may have more or less counterbalanced this 
effect. 

Sea Level Budget over 1993-2010 Although none of 
the climate factors discussed above evolve linearly with 
time, on average over the 1993-2010 time span, ocean 
warming, glaciers melting, and ice sheet mass loss have 
each contributed by roughly 30% to global mean sea 
level rise [19, 20]. 

Regional Scale 

The regional variability in sea level trends is mainly due 
large-scale changes in the density structure of the 
oceans in response to forcing factors (e.g., heat and 
freshwater exchange at the sea-air interface) and 
changes of the ocean circulation [15]. The largest 
regional changes in sea level trends result from ocean 
temperature change (i.e., from nonuniform thermal 
expansion) but in some regions, change in water 
salinity is also important (e.g., [42, 43]). 

Observations of ocean temperature over the past 
few decades show that trend patterns in thermal expan¬ 
sion are not stationary but fluctuate both in space and 
time in response to internal perturbations of the cli¬ 
mate system such as ENSO, NAO, and PDO [15]. As 
a result, sea level trend patterns observed by satellite 
altimetry over the last 18 years are expected to be 
different from those of the last 50 years, as confirmed 
by past sea level reconstructions (e.g., [44, 45]). 
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Processes other than thermal expansion may also 
give rise to regional sea level variations. This is the case 
for the response of the solid Earth to the last deglacia¬ 
tion (GIA). The meltwater from the former ice sheets 
does not redistribute uniformly over the oceans 
because of several processes: self-gravitation between 
ice and water masses, ocean basin deformations asso¬ 
ciated with the viscoelastic response of the Earth to the 
changing load, and changes of the Earth’s rotation due 
to water and ice mass redistribution [46, 47]. On-going 
ice mass loss of the Greenland and Antarctica ice sheets 
also leads to nonuniform sea level rise because of about 
the same processes [48, 49]. These regional sea level 
changes are broadscale but their regional fingerprint is 
different for each melting source (Greenland, Antarc¬ 
tica, glaciers). Amplification by up to 30% are expected 
far from the melting source while significant sea level 
drop should occur around the region which is losing ice. 

Future Sea Level Rise 

IPCC AR4 projections based on coupled climate models 
indicate that sea level should be higher than today’s 
value by ~ 40 cm by 2100 (within a range of =b 20 cm 
due to model results dispersion and uncertainty on 
future greenhouse gases emissions) [50]. However this 
value is possibly a lower bound because it only accounts 
for future ocean warming, glaciers melting and changes 
in ice sheet surface mass balance. As discussed above, 
a large proportion of Greenland and West Antarctica 
ice mass loss results from coastal glacier flow into the 
ocean through complex dynamical instabilities. Such 
processes have begun to be understood only recently 
and thus were not taken into account in the IPCC AR4 
sea level projections. Recent studies suggest that 
account of ice sheet fast dynamics will increase future 
sea level rise (e.g., [51]). But the exact global mean sea 
level elevation by the end of the 21st century remains 
quite uncertain. Present-day sea level rise is not uniform; 
this is also expected for the future. The regional sea level 
map for 2090-2100 provided by IPCC AR4 (average of 
16 models for one emission scenario; Ref. [50]) shows 
higher than average sea level rise in the Arctic Ocean due 
to decrease in salinity in response to fresh water input 
and along a narrow band in the south Atlantic and south 
Indian oceans. Recent studies confirm this general 
behaviour (e.g., [52]). 


Impacts of Sea Level Rise 

Sea level rise is a major concern for populations living in 
low-lying coastal regions (about 25% of human beings) 
because it will give rise to inundation, wetland loss, 
shoreline erosion, and saltwater intrusion in surface 
water bodies and aquifers, and will raise water tables 
[53] . Moreover, in many coastal regions of the world, the 
effects of rising sea level act in combination with other 
natural and/or anthropogenic factors, such as decreased 
rate of fluvial sediment deposition in deltaic areas, 
ground subsidence due to tectonic activity or ground- 
water pumping, and hydrocarbon extraction [54]. 

Besides factors that modify shoreline morphology 
(e.g., sediment deposition in river deltas, change in 
coastal waves and currents), what does matter in coastal 
regions is relative sea level rise, i.e., the combination of 
sea level rise and vertical ground motions. In many 
coastal regions of the world, these two factors are 
currently of the same order of magnitude and most 
often of opposite sign (sea level rises and ground sub¬ 
sides). Accelerated ground subsidence is reported in 
many regions, either because of local groundwater with¬ 
drawal (Tokyo subsided by 5 m, Shanghai by 3 m, and 
Bangkok by 2 m during the last decades; Ref. [53]) or 
hydrocarbon extraction (e.g., Gulf Coast where ground 
subsidence in the range 5-10 mm/year is observed; Ref. 
[55]). Whatever the causes, ground subsidence directly 
interacts with and amplifies climate-related sea level rise 
(long-term trend plus regional variability). However, if 
sea level continues to rise at current rates and more 
likely accelerates, the climate factors (sea level rise) will 
become dominant. However, it remains difficult to 
quantify future sea level rise in specific regions where 
various factors interfere in a complex way. 

Future Directions 

Most recent developments indicate that sea level is 
currently rising slightly faster since the early 1990s 
than during the previous decades. Owing to the recent 
progress in understanding the causes of present-day sea 
level rise, the sea level budget for the period 1993-2010 
can be nearly closed. Approximately 30-40% of the rate 
of sea level rise is due to ocean thermal expansion in 
response to ocean warming. Mass loss in mountain 
glaciers and ice sheets accounts for the remaining 
60-70%. Sea level will continue to rise in the future 
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decades because of expected increased global warming. 
However, the exact amount of sea level rise in the 
coming decades is still an open question. The main 
source of uncertainty is the future behavior of the 
Greenland and Antarctica ice sheets in a warming cli¬ 
mate. Improved understanding and modeling of the 
complex dynamical response of the ice sheets to global 
warming is among the priorities of the international 
climate research community. In parallel, observing 
change and variability of sea level and its different com¬ 
ponents using in synergy various in situ and space-based 
observing systems is another important goal. 

The recently launched high-precision altimeter 
satellites of the Topex/Jason series and their expected 
successors will provide continuity in the monitoring of 
sea level variations from space on multidecadal time- 
scale. In addition to ocean temperature and salinity 
measurements from the international Argo project, 
mass balance of the ice sheets from GRACE and 
other remote sensing techniques, GRACE-based land 
water-storage change and in situ and remote observa¬ 
tions of mountain glaciers will provide invaluable 
observations for understanding the causes of sea level 
variations. 

Sea level changes involve interactions of all compo¬ 
nents of the climate system (oceans, ice sheets and 
glaciers, atmosphere, land water reservoirs) on a wide 
range of spatial and temporal scales. Even the solid 
Earth through its visco-elastic response to water mass 
redistribution affects sea level. Systematic monitoring 
of oceans, cryosphere, and land waters from in situ and 
space-observation systems are thus crucial to validate 
climate models, and hence improve future sea level 
projections. Considering the highly negative impact 
of future sea level rise for society, the multidisciplinary 
aspects of sea level rise (observations, modeling, coastal 
impact studies) should remain a major area of future 
climate research. 
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Glossary 

Cognitive vehicle A road vehicle equipped with sen¬ 
sors like video, radar, or lidar and information 
processing resources for environment perception. 
Furthermore, cognitive vehicles may have human- 
machine interfaces, actuators for automatic 
intervention or fully autonomous operation, and 
wireless communication devices. 

Common relevant picture Data structure containing 
the information necessary for situation analysis and 
decision making within a cooperative group. The 
information originates from the sensors of the vehi¬ 
cles, is enhanced by fusion algorithms, and is com¬ 
municated to all vehicles of the group. 

Cooperative group (CG) A set of cognitive vehicles 
which cooperate in order to mitigate dangerous 
traffic situations. 

Dangerous situation Traffic situation with immediate 
threat of an accident. 
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Driver assistance system Technical system supporting 
human drivers in performing certain driving tasks. 
Driver assistance systems can be classified into 
comfort systems (not subject of this entry) and 
safety systems. 

Motion planning Algorithmic planning of feasible, 
collision-free vehicle trajectories. 

Prediction Projection of system states into the near 
future. Specifically, in this context, model-based 
prognosis of vehicle position. 

Wireless vehicular communication Networking of 
multiple vehicles using wireless local area network 
or cellular communication technologies. 

Definition of the Subject 

The subject of this entry is cooperation among cogni¬ 
tive vehicles with the aim of increasing traffic safety. 
This includes the formation of cooperative groups, the 
recognition of dangerous situations using stochastic 
prediction methods, and the planning of cooperative 
maneuvers to avoid collisions (see Fig. 1). 

Introduction 

Intelligent vehicle technologies have been a topic of exten¬ 
sive research for the last decades. Important research 


areas include perception of roads, obstacles and environ¬ 
ment, situation recognition, vehicle control, and behavior 
planning. Most of the developed technologies can be used 
both for driver assistance systems and for fully autono¬ 
mous vehicles. Regarding driver assistance, some comfort 
and safety systems have been released to market, e.g., 
adaptive cruise control, antilock braking, and emergency 
brake assist systems [1,2]. State-of-the-art autonomous 
road vehicles have been demonstrated at the DARPA 
Urban Challenge 2007, where several automated vehi¬ 
cles completed driving missions lasting for hours in 
a restricted traffic environment [3]. 

Cooperative systems offer even more potential 
for increasing safety and improving traffic flow. 
In recent years, there has been a lot of research on 
wireless vehicle-to-vehicle communication, see [4] 
for an overview. Applications of vehicular communi¬ 
cation technology include wireless local danger warn¬ 
ing [5, 6], distributed sensor fusion [7-9], and 
improvement of traffic efficiency (e.g., platooning 
[10-12], intersection control [13, 14]). However, 
this entry focuses on the safety applications of 
recognizing dangerous situations and avoiding or mit¬ 
igating collisions by automated cooperative maneuvers. 

The benefits of cooperative behavior compared to 
selfish actions of each driver have been shown for other 
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Architecture of a cooperative collision warning and avoidance system 
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problems [15], and it is worthwhile to exploit the 
potential of cooperation also for the improvement of 
traffic safety. 

Cooperative Groups 

Before methods for cooperative collision warning and 
avoidance systems can be designed, the set of vehicles 
which can cooperate reasonably has to be determined. A 
predefined, but not static, group structure enables an 
efficient and unambiguous decision making in case of a 
dangerous situation. Limiting the size of cooperative 
groups is important because cooperative reasoning entails 
significant computation and communication complexity. 

A suitable distance measure for group formation 
has to take into account the road topology [16]. Based 
on this notion of spatiotemporal distance, a criterion 
for the distributed formation of cooperative groups is 
developed [17]. 

These concepts will be presented in detail, and 
related work will be reviewed. 

Motivation and Definition 

In dangerous situations, fast and unambiguous cooper¬ 
ative decision making is essential. Therefore, it is bene¬ 
ficial to know the possible cooperation partners in 
advance and to establish a decision structure. This is 
achieved by the concept of dynamic cooperative groups. 
A cooperative group consists of cognitive vehicles which 
might encounter a dangerous situation requiring coop¬ 
erative actions in the near future. Vehicles are 
partitioned into groups, i.e., each vehicle belongs to 
exactly one group at any point in time. The set of 


possible cooperation partners is restricted to the vehicles 
within the group. Each group has a designated coordi¬ 
nator vehicle which handles the necessary communica¬ 
tion (see section “Distributed Negotiation Protocol” 
below). A cooperative decision may only be selected 
and communicated by the coordinator. Otherwise, 
ambiguous decisions might arise if two or more vehi¬ 
cles recognize a dangerous situation at the same time. 

Computation and communication efforts increase 
rapidly with the number of vehicles involved. To enable 
real-time cooperative reasoning, cooperative groups are 
limited in size. This requires a reliable grouping crite¬ 
rion which assigns all vehicles within one dangerous 
traffic situation to the same group. This cannot always 
be achieved with a limited group size, but the results will 
never become worse than without cooperation. 

Distance 

Intuitively, it is clear that vehicles “near” to each other 
should be in one group. However, it is not so clear what 
a suitable notion of distance for vehicles within a road 
network should be. For example, cars driving in differ¬ 
ent directions on a dual carriageway are not able to 
perform cooperative maneuvers when they are sepa¬ 
rated by a guard rail or concrete barriers. This means 
that sometimes cars which are very near to each other 
in terms of Euclidean distance should nevertheless 
belong to different cooperative groups (Fig. 2). There¬ 
fore, an appropriate distance measure has to incorpo¬ 
rate knowledge of the road network. 

For navigation applications, it is common to repre¬ 
sent the road network as a graph [18]. For the purpose 
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Even though all six cars in this highway example are close enough to communicate, only the two indicated sets of 
vehicles constitute suitable cooperative groups because of the barrier between the lanes 
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(a) Example of a road network, in which the partition into disjoint vertices is indicated with dotted lines, (b) The graph 
1Z corresponding to (a) 


of defining a distance measure, a graph 7 Z is created by 
partitioning the road area into polygons and identify¬ 
ing each polygon with a vertex v of the graph (see 
Fig. 3). Edges between neighboring polygons are 
inserted according to the allowed driving directions 
and lane change possibilities. The weight w(e) of an 
edge e = (v 1? v 2 ) models the minimal time required for 
driving from vertex to the v 2 , 


w((v 1; v 2 )) 


d 2 (vi, V 2 ) 
veZ max (vi,v 2 )’ 


( 1 ) 


where d 2 (v u v 2 ) is the Euclidean distance between the 
centroids of the polygons associated with v x and v 2 , 
respectively, and ve/ max (v 1 , v 2 ) is the maximal velocity 
possible when driving from v 1 to v 2 , determined by 
speed limits and road curvature. Some simplifying 
assumptions are made in order to obtain a conceptually 
elegant formalism: maximum speeds varying with the 
vehicle type are not considered in favor of a conserva¬ 
tive approximation. Distances within one road polygon 
are neglected. This does not cause any problems if the 
discretization is fine enough. 

The shortest-path distance p^(vi, v 2 ) between arbi¬ 
trary vertices v 1? v 2 of 1Z corresponds to the minimal 
time required for driving from v x to v 2 . However, this 
does not exactly capture the distance between two cars q, 
c 2 positioned at V\ and v 2 , respectively, as the cars could 
drive toward each other and meet much earlier at 
a vertex v m in the middle of V\ and v 2 (see Fig. 4). In 
this case, a time span of max{p^(vi, v m ),p^(v 2 , v m )} 
elapses until both cars have reached the meeting point 
v m . Using this notion of virtual meeting points, the 
distance between two vehicles q, c 2 can be defined as 
follows: 


•--K> 


9 < - 


*•— 

“ v m 


O c i 
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The cars q and c 2 can meet each other at vertex v m 


d(c u c 2 ) := minmax{pft(v(c 1 ), v m ),pjj(v(c 2 ), v m )} 

( 2 ) 

This distance corresponds to the minimal time 
within which both vehicles can reach a common meet¬ 
ing point v m (see Fig. 5). It thus incorporates both 
spatial and temporal aspects. The function v(c) deter¬ 
mines the unique vertex v of 7Z so that the polygon 
associated to v contains the geometric centroid of the 
vehicle c. The vehicles’ positions of course depend on 
time; however, this is not expressed explicitly here for 
the sake of notational brevity. Obviously, d(•,•) is sym¬ 
metric and nonnegative. 

The distance measure d can be computed efficiently 
by an appropriate modification of Dijkstra’s shortest- 
path algorithm [19], Chap. 24.3. Search frontiers are 
expanded simultaneously starting from both v(q) and 
v(c 2 ). In each step, the vertex v having the minimal 
value of min{p^(v(q),v),p^(v(c 2 ),v)} among the 
vertices not yet visited by the respective search tree is 
chosen for processing. Once the two search trees reach 
a common vertex, this must be the nearest meeting 
point v m , and the distance can be computed as 
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The computational complexity of the algorithm is 
in 0(n log n), where n is the number of explored edges. 
This number can be restricted in two ways. First, 
a graph 1Z is chosen that is restricted to the surround¬ 
ing of the vehicle (electronic horizon method). Second, 
the algorithm can be stopped when the searched dis¬ 
tance exceeds a threshold beyond which no suitable 
cooperation is possible. Altogether, the real-time com¬ 
putation of the distance measure is possible without 
any difficulty using the described algorithm. 


c 


Objective Function 
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Situations, Recognition of. Figure 5 

Example of a situation in which the path length p n is not a 
suitable distance. Assuming unit edge weights, we have p n 
(ci,c 2 ) = 11 and p^(c 2 , Ci) = 10, while both cars can meet 
each other at v m within three time steps: pnfa , v m ) = 3, 
Pn(c 2l v m ) = 2. Accordingly, d[c v c 2 ) = d(c 2 , cd = 3 

d(c u c 2 ) = max{p K (v(c 1 ), v m ), p n (v(c 2 ), v m )}. The 
pseudocode of the method is shown in Algorithm 1. 



Based on the distance measure d, an objective function 
rating a cooperative group Q = {q, c 2 ,..., c m } of m 
cognitive vehicles can be defined. Several terms have been 
designed in order to capture certain criteria for the for¬ 
mation of cooperative groups. Besides distance, these are 
group size, relative velocity, and temporal consistency. 

The minimum spanning tree has proven to be 
a good indicator for the inter-vehicle distances within 
a group. A group can be represented as a complete 
graph with vertices q and edge weights d(c b Cj). The 
weights of the edges in the minimum spanning tree 
MST(Q ) of the group graph Q are summed up to yield 
the distance term s D of the objective function: 

Sd(G)-=—^—t V d(ci,Cj) (m> 1 ) 

( 3 ) 

The prefactor accounts for the fact that a graph 
with m vertices has a minimum spanning tree 
consisting of m — 1 edges. 

As mentioned above, cooperative groups should be 
limited to a reasonable number of vehicles. On the other 
hand, groups should not be too small either because this 
would narrow the possibilities of cooperation. Espe¬ 
cially, groups consisting of a single vehicle have to be 
penalized because the distance function cannot be 
defined for these groups. Accordingly, a desired group 
size of m 0 vehicles is chosen, and the term s s of the 
objective function rates the deviation from this value: 

ss(Q) := (m - m 0 ) 2 (4) 

These two terms already enable the formation of 
reasonable cooperative groups. However, two addi¬ 
tional terms will be defined in order to select among 
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Motivation for the velocity term, (a) Vehicles approaching 
each other are more likely to encounter a common 
dangerous situation, (b) Cooperative groups of vehicles 
driving away from each other are less desirable 


alternative group structures having almost equal dis¬ 
tance and size values. Obviously, a dangerous situation 
is more likely to occur within a cooperative group if its 
vehicles are driving toward each other, i.e., if the dis¬ 
tances are decreasing. By contrast, vehicles moving 
away from each other are less desirable to be assigned 
to the same group (see Fig. 6). These criteria are cap¬ 
tured by the change rate of the distance measure d. The 
relative velocity term s v of the objective function can 
therefore be formulated using the derivative of d with 
respect to time: 

m i ft 

—£ 5 ^,,, c,) (m> 1) (5) 

1=1 

From the perspective of the cooperative group, s v 
can be interpreted as the rate of contraction or expan¬ 
sion, respectively, of the group. Contracting groups are 
more desirable and hence have a lower (possibly nega¬ 
tive) value of s v . 

Finally, a temporal consistency term s T is defined in 
order to avoid frequent changes of the group structure, 
especially oscillations of vehicles between two neigh¬ 
boring groups. Such oscillations could be caused by 
small fluctuations of the distances, or by discretization 
artifacts. A countermeasure is to improve (i.e., 
decrease) the objective value of a vehicle that has just 
joined a new group, 

Y'' f U if 0 < ti < t T , . 

\ tj if U = 0 or ti > tj 5 - 


where t { is the time elapsed since vehicle c { has joined 
group Q, and t T is a threshold value. The temporal 
consistency term penalizes vehicles changing groups 
(ti= 0) but rewards newly established groups (0 < t z - 
< t T ) in order to increase their stability. 

The different terms introduced above can be com¬ 
bined in a weighted sum to yield the objective function 


5(0)*- 


max{0 ,A,dSd(<7) + A,sSs(£ 7) +A,ySy(^) + A,tSt(£/)} m> 1 
^s s s(Q) + ^tSt(£/) m= 1 


(7) 


rating a cooperative group Q of m vehicles, where 
X k > 0 are weighting factors. The objective function 
5 can be used to compare an existing cooperative group 
with hypothetical alternative groups. 


Computing Optimal Partitions 

The objective function s(Q) can be extended to a set 
P = {^i,..., Gk} of mutually disjoint cooperative 
groups as follows: 

s(P):=]Ts(0) (8) 

QeV 

Given a set C = {q,..., c n ] of cognitive vehicles, the 
optimal group structure P* according to the objective 
function is obtained as 


P* = argmins(P), (9) 

PeV{c ) 

where V{C) is the set of all partitions of C. The exact 
minimization of (9) is possible using, e.g., depth-first 
branch and bound search. However, the set of parti¬ 
tions grows rapidly as the number of vehicles increases. 
To enable real-time computation, strict optimality is 
abandoned. A suboptimal anytime algorithm is 
designed starting from the greedy clustering method. 
The greedy algorithm is initialized with each vehicle 
being in its own single-vehicle group. In each stage of 
the algorithm, every possible merging of two groups is 
evaluated by the objective function. The merging hav¬ 
ing the largest improvement is carried out before the 
next stage starts. The greedy algorithm halts if no 
merging remains which yields an improvement of the 
total objective value. 

An anytime algorithm is obtained by inserting every 
evaluated merging step into a priority queue and 
processing the merging with the totally maximum 
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improvement in each stage as long as execution time 
is left (Algorithm 2). Experiments have shown that 
the group structure resulting from this algorithm 
is identical to the optimal partition with a probability 
of more than 99%, and that the increase in the 
objective function is less than 1% in the remaining 
cases. 



Distributed Negotiation Protocol 


Informally, this means that there does not exist any 
group Q in between Gi and Q 2 . The resulting set of 
vehicles from neighboring groups is 

c(g 0 )-.= g 0 u y g, (n) 

N{Go,Gj) 

The optimal group structure for the local neighbor¬ 
hood of Go is obtained by evaluating (9) for C(Go)- 
The coordinator of group Go can perform this compu¬ 
tation approximately using the anytime algorithm 
presented above. 

With this notion of neighborhood, a negotiation 
protocol can be designed as follows (see Fig. 7). Each 
vehicle periodically sends its position information to its 
group coordinator. Group coordinators exchange their 
information with neighboring coordinators and com¬ 
pute the optimal partition. If it differs from the current 
group assignment, a change of group structure is initi¬ 
ated. The change of group assignment is performed 
after the involved coordinators have agreed. 

Whenever possible, a change of the group coordi¬ 
nator is avoided in order to minimize communication 
and initialization costs. However, it is sometimes 
inevitable to choose a new coordinator, e.g., when 
the previous coordinator has left the group, or 
when a new group is established. In these cases, 
group coordinators may be selected arbitrary, e.g., 
based on the identification numbers of the vehicles, 
or according to certain criteria. For example, it may 
be preferable to choose the central vehicle of the group, 
given by 


c 


To realize the group formation in a distributed vehicu¬ 
lar environment, the cognitive vehicles have to negoti¬ 
ate the group structure without any centralized 
instance. This implies that a mechanism is required 
which performs reliably with incomplete and local 
knowledge. The negotiation method chosen for group 
formation is based on coordinator vehicles communi¬ 
cating with neighboring groups. 

The notion of neighborhood can be formalized 
by means of the distance measure d introduced 
above. Two groups G\ and Gi are neighbored, 

N(GuGi), if 

\/G Vc E G 3 Ci G Gi 3c 2 E Gi 

max{d(c, a), d(c , c 2 )} > d(q, c 2 ). 


argmin V ( ^ D d(c, c') + ^v ^ d(c, c') j. (12) 

ceG C ’eg \ Ot J 

Results 

The described group formation method has been 
implemented and integrated with a traffic simulator 
[20]. Various highway, country road, and inner city 
scenarios have been simulated successfully with real¬ 
time performance. Figure 8 shows an example of the 
results. 

The approach has been evaluated with regard to the 
application of cooperative collision avoidance. The 
traffic simulator has been modified in order to provoke 
accidents. In all considered scenarios, the vehicles 
involved in the accidents have been assigned to the 











2470 


c 


Cooperative Group of Vehicles and Dangerous Situations, Recognition of 


carO:Vehicle 


carl :Vehicle 


car2:Vehicle 


car3:Vehicle 


car4:Vehicle 


I I 

I Vehicle position I 


o 

I 


I 

Vehicle position 


Vehicle position 


Group announcement 


-o 


n 



P 1 

louncement 

Group anr 


-rzH 

i - 

1 

Group position 

*■ i 1 

i 

i 

Group assignment 

Assignment response 

Group change 

0 

1 

1 

} position 

Vehicle 


I 


-u 

! 

Group announcement 


I 

I 

Group change 


-□ 

I 


Vehicle position 


I 


I 


I 


Cooperative Group of Vehicles and Dangerous Situations, Recognition of. Figure 7 

Sequence diagram of the proposed negotiation protocol. At the beginning of this example, Q-\ = {carO, carl} and 
Q 2 = {car2, car3, car4} with carO, car2 being the group coordinators. Then carO initiates a new group assignment that is 
agreed by car2, resulting in = {carO, carl, car2} and Q 2 = {car3, car4} 


same group in more than 90% of the cases, implying 
that the accidents can potentially be prevented by coop¬ 
erative maneuvers. 

Related Work 

Related vehicle grouping approaches can be classified 
according to the purpose of the groups. Often, groups 
are used to structure the communication network 
[21-26]. However, communication groups have to be 
distinguished from application-specific groups [27]. 
The method described above is designed to be applied 
to cooperative collision avoidance. Other conceivable 
applications include cooperative local danger warning, 
cooperative perception, intersection management, and 


platooning. Different, but possibly similar group forma¬ 
tion criteria may be needed for these applications. For 
example, warning messages may be propagated by 
oncoming traffic on a dual carriageway, while cooperative 
maneuvering is not possible across the central barrier. 

Many researchers neglect the issue of grouping and 
assume that all vehicles within communication range 
can cooperate [28]. Thus, the simplest approach is to 
consider groups identical to the connected components 
of the communication network [29]. Formation of 
groups based on a threshold for the Euclidean distance 
between vehicles has been evaluated in robotics [30], in 
road traffic applications [31], and in air traffic control 
[32, 33]. A more elaborated method incorporating 
road topology information is described in [34]. 
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Cooperative Group of Vehicles and Dangerous 
Situations, Recognition of. Figure 8 

Example of resulting cooperative groups. Cars belonging to 
one group are depicted in the same color. Note that 
vehicles physically separated - by the central barrier of the 
highway or by the bridge - are assigned to different groups, 
while the vehicles approaching the intersection form one 
group, shown in green 


For traffic efficiency applications, groups consisting 
of all vehicles within a fixed area have been considered, 
e.g., all vehicles near to an intersection [13, 35, 36] or 
on a specific highway section [37-39]. Regarding colli¬ 
sion avoidance, it is also possible to group all vehicles 
whose probability of mutual collision exceeds 
a predefined threshold [40-42]. However, this requires 
to compute the collision probabilities for all vehicles 
within communication range. 

Platoons consist of vehicles having common goals 
[10, 43]. Therefore, criteria evaluating destinations 
and, e.g., desired speed of vehicles are appropriate for 
platoon formation [44, 45]. By contrast, cooperative 
groups encompass possibly conflicting vehicles, inde¬ 
pendent of their destinations. 

Recognition of Dangerous Situations 

An important target for a cooperative group (CG) 
Q — {ci,..., c m } of vehicles q is the automatic detec¬ 
tion of short-term dangerous situations [46]. Short¬ 
term means within a forecasting horizon At that is 
shorter than the reaction time of human drivers. Such 
situations require an automatic reaction of the 


cooperative group as they cannot be handled safely by 
the drivers themselves. A typical value for the short¬ 
term prediction horizon At is about 1 s for moderate 
speeds of 20 m/s in curves. 

With this objective, the likely path of each vehicle q 
in the time interval [t 0 , t 0 + At] starting at the current 
time t 0 is predicted with an appropriate dynamic 
model. Then, for each vehicle pair {q, c ; }, i ^ j the 
progression of its mutual Euclidean distance e^{ t) is 
computed for t £ [t 0 , t 0 + At]. Here the Euclidean 
distance between two cars is the smallest distance 
11 pi — pj\ I between points pi and pj of vehicles q and Cj 
at the given time t, respectively. Thus this distance 
concept differs from the definition of distance d for 
the formation of cooperative groups. The distance e 
depends on the vehicle geometry and will be further 
refined below to take the uncertainty of the predicted 
vehicle position into account. 

If at least one of the e^t) falls below a distance 
threshold 8 with t E [t 0 , t 0 + At], then a dangerous 
situation for the CG Q arises, i.e., when 

min r / 1 ^ 

tg[0,A t] {^ + ')})<^ Danger 

(13) 

Figure 9 illustrates the detection strategy within the 
time distance plane. 


mm 

(H, Cj e G 


Vehicle Model and State Description 


For an overview of vehicle models, associated sensors, 
and state equations, see for example, the results of Tan 
and Huang in [47], where preprocessing of the GPS 
signals and measurement of yaw are also discussed. The 
authors propose a system design for a vehicle to vehicle 
cooperative warning system in [48]. The vehicle state 
model and equations for vehicle tracking are as follows: 


x(t) = [x(t) y{t) (t>(t) C0b(t) ] T 


x(f) = 


v(t) COS (j)(t) 

v(t) sin 
CO z (t) - co b (t) 
W\vb ( t) 


(14) 


(15) 


where x and y are the east and north distances in the 
earth-fixed coordinate frame respectively, (j) is the vehi¬ 
cle heading angle (measured positive anticlockwise 


c 
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Cooperative Group of Vehicles and Dangerous 
Situations, Recognition of. Figure 9 

(a) Predicted mutual distances within a forecast horizon of 
Af are above a critical level e. (b) Distances too small and 
time too short to manually master the situation => danger, 
(c) Enough time ahead for the drivers to manage the 
situation safely 


Cooperative Group of Vehicles and Dangerous 
Situations, Recognition of. Table 1 Vehicle parameters 
with values as used in the example 


Length 

Ri 

5.0 m 

Width 

R w 

1.8 m 

Distance of rear axle from back 

R d 

1.0 m 


These parameters depend on the vehicle type. The 
rectangle at vehicle position (0, 0) and heading angle 
0 is called the Reference Rectangle of the vehicle and has 
vertices ri = (-Rj, 0.5R W ) T , r 2 = (Rj - R d ,0.5R w ) T , 
r 3 = (Rj - Rd, —0.5R w ) T and r 4 = (~R d , -0.5R W ) T . 

Assuming that the current time is t 0 , the vehicle 
paths are to be predicted for a time interval [t 0 ,t 0 + 
At]. It is assumed that there are no changes of the driver 
actions in this interval and that vehicle paths are inde¬ 
pendent of each other. This assumption can be justified 
by taking a prediction horizon shorter than the time 
necessary for a specific human reaction. The path of 
a given vehicle is described with the equations 


from the x axis), v is the longitudinal velocity of the 
vehicle, co z is the vehicle’s yaw rate, co b is the bias in 
the yaw rate measurement, and n wb is a Gaussian 
white noise. 

The nonlinear differential equation (15) is solved 
approximately using an Extended Kalman Filter (EKF 
[49]) to track the vehicle position. The input values to 
the EKF are v from a wheel speed sensor, co z ( t) from 
a yaw gyro and the Gaussian noise n wb . The observation 
vector [x(t) y(t) </>(t)] T comprises preprocessed DGPS 
(Differential Global Positioning System) measure¬ 
ments. The measurement and observation values are 
transferred by the group members to the group coor¬ 
dinator in order to update the information in the Com¬ 
mon Relevant Picture. The group coordinator executes 
the EKF algorithm to track the vehicle positions. 

Path Prediction 

Using the information of the Common Relevant 
Picture, the group coordinator predicts the paths of 
the vehicles in its neighborhood based on their state 
information. 

The reference position of the vehicle is taken to be at 
the center of the rear axle. The vehicle shape is modeled 
as a rectangle with the parameters shown in Table 1. 


x(t) = [x(t) y(t) 0(f) v(f)] T 
v(t) cos 

Y / f N v(f) sin 0(f) 

[> ~ co z (to) - co b (to) 
a x (t 0 ) u d t o) 


(16) 

(17) 


where it is assumed that the unbiased yaw rate co z (t 0 ) — 
co b (to) and the unbiased acceleration a x (t 0 ) — a b (t 0 ) are 
constant in the time interval, x, y, </> and v are as above. 
The lateral velocity is ignored as it is normally small. 
The model does not use any vehicle specific parameters 
in contrast to the more detailed bicycle model, cf. [47]. 

The state-space equations are nonlinear differential 
equations. An EKF can be used to solve them approx¬ 
imately, for example, as in [48]. Better solutions are 
achieved with the Unscented Kalman Filter as described 
in [50] and also applied in [46]. 

The propagation of sigma particles representing 
the initial state distribution is calculated using numer¬ 
ical integration. The path of each sigma particle is 
weighted with an outcome probability. Following 
[50], the sigma particles of the unscented Kalman filter 
may be selected as 


x(f 0 ,0) = x(<o,0) 


(18) 
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x(t 0 , i) = x(fo, o) + Q i for i = 1,..., L (19) 

x( to, i ) = x( to , 0) - Q z _ l for i = L + 1,,.., 21 (20) 

where x is the mean of x, L is the dimension of x, i.e., 4, 
and Qi is the z'th row of the matrix square root (upper 
triangular Cholesky factorization): 

Q=v/(L + ^)P X (21) 

P x is the covariance matrix of x at the current 
time t 0 - The scaling parameter X - a 2 (I + k) — L with 
a = 0.001 and k= 0 . 

Let x(t,i) be the propagation of x(t 0 ,i) through the 


differential equation (17) till time tfor i = 0, ..., 2 L. 
The differential equation can be solved in MATLAB 
with the ode45 function for non-stiff ODEs. 

The mean of the predicted x(t) is the weighted 
mean of the x(f,z): 

x « = E w i m) X (M) 

i =0 


(22) 

with the weights defined as follows: 



W 0 (m) =X/(L + X) 


(23) 

W\ m) = 1/(2 (L + X)) for i = 

2 L 

(24) 


The covariance matrix of x( t) is the weighted sum 

PxW = 52 0 - x MK x ( f > 0 - X M} T 

i=0 

(25) 

with the weights defined as follows (with /? = 2 ): 

Wq ^ = A,/(L + A,) + 1 — (X 2 + ft (26) 

w/ c) = w\ m) for i = 1,..., 2L (27) 

For t G [t 0 , t 0 + At], let A(t) be the point set of the 
likely area occupied by the vehicle at time t given its 
state as described by the mean x(t) and covariance 
matrix P x (t). A(t) is estimated by firstly using the 
covariance ellipse as an estimate of the uncertainty of 
the vehicle reference position: 

E(t) = {v G R 2 : v T Q(f) _1 v < 7 } (28) 

where Q(t )- = P x (t)- for z, j = 1,2 and parameter 
y=l, and secondly by considering the rotational 


uncertainty of the vehicle about its reference position 
(heading angle): 

°<\) — y^Px(^) 33 (29) 

I4 ) = [x(t) 3 - b(7 (j) ,x(t) 3 + (30) 

T(t) =n({R(M :^el h ie{ 1,.• • ,4}}) (31) 

where is the standard deviation of the heading angle, 
parameter = 1 , £(i/q u) is the counterclockwise rota¬ 
tion of vector u by i/z, r z are the vertices of the Reference 
Rectangle of the vehicle, and 7Y is the convex hull set 
operator. 

Finally, we set A( t) as 

A(f) =£(f)© f(0© (x(l) p x(t) 2 ) T (32) 

where the ® denotes the vector (Minkowski) sum of the 
sets. 

Now consider a group of m vehicles q for 
z = 1, ..., m. An obstacle on the road can be modeled 
as a stationary vehicle. The path prediction is performed 
as above for each vehicle separately, assuming that there 
are no dependencies in the interval [t 0 ,t 0 + At]. 

Let t) denote the likely area occupied by q at time 
t for i- 1 , ..., m. 

For the assessment of collision danger, we compute 
the Euclidean distance between the vehicles: 

eij(t) = dist (Ai(t),Aj(t)) (33) 

= min {ll pi - Pi ll : pi e Ai(t),pj G Aj(t)} (34) 

and consider the minimum in the prediction interval 
[t 0 , to + A t]: 

e™ = min{ey(f) : t G [to, to + Af]} (35) 

A dangerous state is detected if e™ < s for at least 
one vehicle pair c iy Cp where s is a distance threshold (in 
the example set to 1 m). 

The geometric computations in (28), (32), and (33) 
may be executed efficiently with packages for polytope 
calculus such as the Multi-Parametric Toolbox (MPT, 
[51]) with the optimization toolbox YALMIP [52]. 
Distances below 5 m as considered here require an 
accurate polytope approximation of the vehicle and 
covariance ellipse. For vehicles whose predicted trajec¬ 
tories are sufficiently distant, say at least 30 m, only the 
positions of the Reference Points need be computed. 
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For intermediate distances above 5 m, the polytopes 
can be replaced by bounding rectangles. 

Example 

Figure 10 shows the scenario under discussion. Two 
vehicles q and c 2 (vehicle sizes as in Table 1) are 
approaching each other in a 45° curve on a country 
road. The center line of the road is composed of 
two clothoids, each of arc length / = 100 m. The cur¬ 
vature k of the center line as a function of the arc 
length from the bottom left is 

, x _ f s/ a 2 for 0 < s < / 

K 5 |(2 l — s)/a 2 for / < s < 2/ 

where a = 2l/y/n. The smallest radius of curvature is 
/car 1 = 4 l/7i ~ 127 m. We assume that each lane of 

the road is 4 m wide and that the vehicles travel in the 

center of their lane. The longitudinal velocity of 
both vehicles is a constant 20 m/s = 72 km/h. At time 
t - 0, q and c 2 are at the top right and bottom left 
corners, respectively. 

In addition, a stationary, broken down vehicle c 3 of 
the same size is located at 110 m from the bottom left in 
the middle of the right lane. So c 2 will hit c 3 in about 5.2 s 
if no action is taken and we, therefore, consider the pre¬ 
diction time interval [3.2 s, 5.2 s] in detail in this example. 

The state transition equations (17) assume that the 
unbiased yaw rate is constant in the time prediction 
interval. This is only an approximation in the curve 


where the yaw rate must change continually to follow 
the curve precisely. An improved prediction must 
either use information about the change of yaw rate 
derived from a presumed vehicle-driver behavior or the 
period without measurement feedback must be 
reduced. This assumption is reasonable if we restrict 
the prediction horizon to be below human reactivity. 

The future trajectories of the vehicles are predicted 
for the next 2 s given the uncertain information on their 
positions, heading angle, and velocity (units: m, radian, 
m/s). The covariance matrix of the state information is 
assumed to be the same for all vehicles: 

diag( 0.04, 0.04,0.001,0.1) (36) 

The standard deviations of the state are, therefore, 
for the x or / position: 0.2 m, heading angle: 0.0316 rad 
and velocity: 0.316 m/s. 

Figure 11 shows the pairwise distances between the 
three vehicles as calculated from (33) for the prediction 
time interval [3.2 s, 5.2 s]. q passes the stationary c 3 
without danger, but the distance between q and c 2 is 
predicted to enter the danger zone at 4.8 s. As shown in 
Fig. 12, this dangerous situation is due to the predicted 
understeering of q and c 2 in the curve. As expected, this 
problem becomes more serious when the velocities or 
curvature increase. In addition, the predicted distance 
between c 2 and c 3 becomes dangerous at 5.1 s. 

The next step for the cooperative group would be to 
calculate and evaluate possible joint reactions and new 
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Initial positions of vehicles c, for / = 1,..3 and the planned trajectories of q and c 2 
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Predicted distances e,y between the vehicles considering the uncertainty of their state. If uncertainty is not considered, 
then min(e 12 ) and min(e 13 ) are both approx. 2.14 m 
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Likely positions of vehicles at 4.8 s with solid rectangles for the actual positions and surrounding polytopes for the 
uncertainty area. Prediction calculated at 3.2 s 
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trajectories. In the example, q and c 2 should move 
closer to the road borders, and c 2 should decrease its 
velocity to avoid both dangers. However, depending on 
the parameter constellations, additional reactions are 
feasible, such as q changing its velocity to allow c 2 to 
overtake c 3 . The development of cooperative maneu¬ 
vers will be explored in more detail in the next section. 

Cooperative Decision Making 

Once a dangerous situation has been detected, the 
system can either issue a warning and leave the conflict 
resolution to the driver or it can intervene autono¬ 
mously by executing a cooperative maneuver. 

Warning Systems 

The following two questions are of particular impor¬ 
tance for the design of collision warning systems: 

1. Which is the criterion to issue a warning? Typically, 
thresholds in terms of collision probability or 
time to collision are defined [28, 53, 54]. It is 
important to minimize false alarms because drivers 
tend to ignore warnings if there are frequent false 
alarms. 

2. How is the warning presented to the driver? Possi¬ 
bilities include optic, acoustic, and haptic warnings, 
or combinations of these [55-57] . Ideally, the warn¬ 
ing should induce the driver to perform an ade¬ 
quate collision avoidance maneuver. For example, 
vibrations of the steering wheel could indicate 
a necessary steering maneuver to the driver. This 
technique is used in lane departure warning systems 
[58, 59]. 

Monitoring driver behavior is also of interest. For 
example, a driver who has already initiated a collision 
avoidance maneuver should not be distracted by an 
alarm [53, 60]. Some of the systems specifically address 
certain traffic scenarios, e.g., intersections, left-turning, 
lane changes, or rear-end collisions [61, 62]. 

Cooperative Driving 

Various approaches to realize cooperative driving are 
conceivable, many of them tailored to specific traffic 
situations or types of accidents. For example, 
a cooperative variant of adaptive cruise control has 


been developed, which increases the foresight horizon 
in order to reduce the amount of necessary accelera¬ 
tions and decelerations [11, 63]. A similar system is an 
emergency braking assistant preventing rear-end colli¬ 
sions based on cooperative information exchange [64]. 
Both systems only affect the longitudinal control of 
the vehicle. Platooning systems additionally address the 
task of lane keeping and the execution of some maneu¬ 
vers like merging into a platoon [10, 65]. Lane assign¬ 
ment for platoons with the aim of optimizing traffic 
flow is another topic of research [12]. Collision avoid¬ 
ance with obstacles or vehicles outside the platoon is 
usually not covered by these approaches. A method 
specialized on merging maneuvers is described in [38]. 

Cooperative intersection control systems can have 
two different goals: the focus is either on collision 
warning, as described in the previous section, or 
on optimizing traffic flow. Systems of the latter 
type usually assume fixed paths of the vehicles and 
optimize order of precedence and velocity profiles 
[13, 14, 66], see also section “Decoupled Planning” 
below. 

Cooperative control approaches keep formations of 
aerial, underwater, or land vehicles, usually in off-road 
exploration scenarios [67]. Few of these works address 
collision avoidance among cooperative vehicles [68]. 

Other systems specialize on obstacle avoidance from 
a single-vehicle perspective. Typically, prespecified 
evasive trajectories are parameterized to fit the encoun¬ 
tered situation [69, 70]. A system that can handle more 
general situations, such as multiple obstacles, has to use 
motion planning techniques [71]. Another approach 
applies differential game theory to avoid the collision of 
a single vehicle with a driver going the wrong way [72] . 

Altogether, motion planning techniques seem to be 
the most general approach to cooperative driving in 
dangerous situations as they can handle arbitrary 
obstacles and road geometries. Therefore, motion plan¬ 
ning is covered in more detail in the following section. 

Multi-vehicle Motion Planning 

In this section, methods are described which compute 
a cooperative motion plan for given initial positions of 
vehicles and obstacles. The plan must be feasible with 
regard to the kinematic and dynamic capabilities of the 
involved vehicles. Therefore, a level of abstraction 
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appropriate for cooperative motion planning has to 
represent vehicle dynamics and geometry. Once 
a feasible cooperative plan has been computed by the 
group coordinator, it can be communicated to the 
vehicles of the cooperative group. Within the tolerances 
of the cooperative plan, each vehicle can perform a finer 
planning of its own trajectory, e.g., optimizing comfort 
criteria, before executing the maneuver (cf. Fig. 1). 

This section starts with a formalization of the motion 
planning problem, especially introducing the concept 
of configuration spaces. Afterward, some motion¬ 
planning algorithms are described. The methods can be 
distinguished by the criterion if they do or do not make 
decoupling assumptions in order to reduce the compu¬ 
tational complexity. 

Configuration Space Path planning and motion 
planning originate from robotics, and car-like vehicles 
may be regarded to be specializations of robots. An 
important concept from robot motion planning is the 
configuration space. 

The configuration of a robot is a parametric speci¬ 
fication of the geometric position of every point of the 
robot [73]. The manifold of all possible configurations 
is called the configuration space Q of the robot [74, 75] . 
A path is a parametric curve in the configuration space, 
i.e., a continuous mapping [0,1] —> Q. 

The relevant configuration parameters of a car-like 
vehicle q are its position x b in the road plane and its 
orientation (yaw angle) </>*. It is common to assume 
a sufficiently planar road so that vertical position, roll, 
and pitch angle can be neglected. This results in the 
three-dimensional configuration space Qi = R 2 x S, 
where § is the unit circle parameterized by [0,2n). 
Note that the angles 0 and 2n describe the same 
orientation, which causes Qi to be a cylindrical mani¬ 
fold [75]. 

The motions of a car are restricted by the orienta¬ 
tion of its wheels, it cannot move in arbitrary direc¬ 
tions. This is expressed by the following constraint on 
incremental position changes dx b d y b 

dxi sin (j) i = dyi cos </> z - (37) 

Despite this constraint, all configurations in Qi 
can be reached if no obstacles are present. Such con¬ 
straints which do not reduce the dimensionality of the 
configuration space are called nonholonomic [75]. 


The nonholomic properties of a car are of special 
importance when parking maneuvers are considered. 
It is possible to plan a path without considering the 
nonholonomic constraint and to subsequently trans¬ 
form it to enforce conformity with the constraint [76]. 

The motion of a car is not only restricted by its 
nonholonomic kinematics, but also by its dynamics. 
Instantaneous accelerations or decelerations are 
impossible due to inertia. A state space model can be 
used to describe the vehicle kinematics and dynamics. 
A simple state space model for a car-like vehicle is given 
as follows, 
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where v z denotes the longitudinal velocity, a z the accel¬ 
eration, oci the steering angle, and Z* the wheelbase of the 
car (cf. Fig. 13). The first summand on the right-hand 
side of the equation models the state evolution, while 
the second one characterizes the actuator behavior. 
Acceleration and steering angle are the control inputs 
of the system. They are bounded by the capabilities of 
the vehicle, which depend on brakes, engine power, 
steering geometry, tires, and the friction of the road 
surface. A simplified model for the dynamic limits is 
Kamm’s circle stating that longitudinal and lateral 
acceleration a iylon and a b i at are limited by 

V^lon + ^lat < Pig, (39) 
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Notation used in the vehicle model (38) 
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where g « 9.8 m/s 2 is Earth’s gravitational acceleration 
and (ii w 1 is the road-tire friction coefficient [71]. 
Additional bounds may be imposed on acceleration 

and Steering angle, c.g., ion,min — ^i,lon — ^i. Ion,max 

and lajl < a z ; max . 

When vehicle dynamics have to be considered, it is 
usually necessary to solve a motion planning problem, 

i.e., to compute a continuous mapping [0, t max ] —> Q x . 
This mapping is sometimes called a trajectory. 
Unlike a path, it is explicitly parameterized by time. 
Methods for vehicle motion planning include 
sampling-based approaches [75], especially rapidly 
exploring random trees [77, 78], and elastic-band 
planning [79]. Vehicles with different configuration 
spaces, e.g., trucks pulling trailers, have also been con¬ 
sidered [76]. 

Obstacles divide the configuration space into free 
space and obstacle regions. A configuration q- 6 Qi 
belongs to free space if every point of the robot in 
configuration q z is collision-free with every obstacle. 
For collision checking, both robot and obstacle 
geometry have to be considered. The explicit construc¬ 
tion of the free space is avoided by most practical 
planning algorithms due to its algorithmic 
complexity [75]. 

Path planning for multiple vehicles can be 
performed in the composite configuration space, 
which is the Cartesian product of the individual con¬ 
figuration spaces [80]: Q = Q\ x ... x Q m . The pos¬ 
sibility of motion planning in the composite 
configuration space is well known from robotics text¬ 
books [74, 75]. However, the computational complex¬ 
ity of motion planning algorithms usually increases 
exponentially with the dimensionality of the configu¬ 
ration space [80]. If each individual configuration 
space Q x is of dimension d;, the composite configura¬ 
tion space has dimension d = i Therefore, 
cooperative motion planning in the composite config¬ 
uration space has rarely been used in practice. See [81, 
82] for some robotics examples. More common are 
decoupling approaches which make simplifying 
assumptions in order to lower the computational com¬ 
plexity. However, these assumptions also restrict the 
space of solutions, which might be particularly critical 
in dangerous situations. Therefore, both decoupling 
and cooperative planning methods are presented in 
the following sections. 


Altogether, the motion planning task for m vehicles 
can be formalized as follows: compute a continuous 
trajectory [0, t max ] —> Q = 2i x ... x Q m which 

1. Starts at the initial states of the vehicles 

2. Avoids collisions among the m cooperative vehicles, 
considering their geometric models 

3. Avoids collisions with both static and moving 
obstacles 

4. Keeps the vehicles on the road area 

5. Respects the kinematic and dynamic constraints of 
the vehicles 

Additionally, a goal region is specified usually. This 
has been omitted here because collision avoidance is 
the main objective in dangerous situations. 

When moving obstacles are considered, the obsta¬ 
cle-free regions of the configuration space become 
time-dependent. This can be incorporated by adding 
a time axis, resulting in the configuration space-time 
[75, 80]. The trajectories of moving obstacles may be 
derived from prediction methods as described above in 
section “Path Prediction.” The road constraint may be 
dealt with by declaring all off-road areas as obstacles. 

The resulting trajectory in composite configuration 
space is communicated to the vehicles of the coopera¬ 
tive group (cf. Fig. 1). Before execution, each vehicle 
can perform a refinement of its individual trajectory 
within the tolerances of the cooperative plan according 
to more detailed models and specific criteria. In fact, 
the main purpose of cooperative planning is not to find 
an optimal trajectory, but to construct and select 
a homotopy class of solutions in the composite config¬ 
uration space [75] . Optimizations within the tolerances 
of the correct homotopy class, such as smoothing, can 
be performed by the individual vehicles. 

Decoupled Planning Decoupling methods reduce 
the computational complexity of multi-vehicle motion 
planning by considering lower-dimensional subspaces 
of the composite configuration space. Usually, 
a subspace corresponding to a single vehicle is chosen. 

The path-velocity decomposition is a well-known 
method which decouples the composite problem into 
m path planning problems and a velocity coordination 
problem [83]. It uses path planning algorithms to find 
individual paths in Q x for each vehicle. Each path 
circumvents the static obstacles but ignores the 
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existence of the other vehicles. In a second stage of the 
algorithm, the relative velocities along the individual 
paths are coordinated in order to avoid inter-vehicle 
collisions [84]. This corresponds to a one-dimensional 
reparameterization of each path. It is a path planning 
problem in the m -dimensional coordination space. 
Alternatively, paths may be coordinated incrementally, 
resulting in m — 1 two-dimensional coordination 
problems. In the last stage of the algorithm, the time 
scaling of the path is performed to obtain a motion 
respecting the dynamic constraints. It is also possible to 
perform the time scaling during the coordination stage, 
which requires increasing the dimensionality of the 
coordination space by one. 

The path-velocity decomposition is an adequate 
approximation in many robotic scenarios [85-87] and 
has also been applied to road traffic problems [40]. It 
shows certain parallels to resource allocation methods 
which consider regions such as intersections to be 
mutual exclusive resources and apply scheduling mech¬ 
anisms to resolve conflicts [66, 88]. Other methods 
weaken the restriction to a single path and allow the 
coordination procedure to select a path within 
a probabilistic roadmap [89, 90]. 

The second approach to decouple multi-robot 
motion planning assigns priorities to the robots 
[91-95]. The robots are processed in the order of pri¬ 
orities, which means that lower-priority robots already 
know the planning results of the higher-priority ones. 
The zth robot plans a motion in its configuration space- 
time which considers the planned motions of the 
higher-priority robots 1, ..., i — 1 as moving 
obstacles but completely ignores the lower-priority 


robots i + 1, ..., m. This replaces the d -dimensional 
planning problem in the composite configuration 
space with m planning steps in configuration space- 
times of dimension + 1. Of course, the results depend 
on the selected order of priority. Therefore, methods 
for constructing somewhat optimal priorities are of 
interest [96, 97]. However, prioritized planning may 
be suboptimal even with the best priorities. The 
approach has been applied to manipulators, mobile 
robots, and vehicle-parking scenarios. 

In dangerous situations involving multiple vehicles, 
decoupled planners may fail to find an existing colli¬ 
sion-free solution. This can already happen in very 
simple scenarios (Fig. 14). Additionally, it is difficult 
to consider vehicle dynamics in the path-velocity 
decomposition and in the roadmap coordination 
approach. Therefore, three recent methods for cooper¬ 
ative planning in the composite configuration space are 
presented in the following sections (Fig. 15). 

Tree Search The current action of a vehicle can be 
characterized by its acceleration and its steering angle, 
see (38). The continuum of actions can be discretized by 
choosing a finite set of representatives. When an action is 
applied for a certain interval of time At before the next 
action is selected, a tree structure of decisions results. 
The discrete action set should contain extreme actions 
like maximum braking and maximum steering admis¬ 
sible by the dynamics of the vehicle, given by (39). These 
extreme actions are known to be optimal for a single 
vehicle evading an obstacle [69, 71]. In the automobile 
domain, time discretization is justified because actuator 
limits prohibit a quick change of actions. 


c 
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Comparison of cooperative and decoupled motion planning, (a) Oncoming traffic scenario, (b) path-velocity decoupling, 
(c) prioritized motion planning, (d) cooperative motion planning in the composite configuration space 
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Space-time visualization of a cooperative merging 
maneuver. In this 3D illustration, x and y directions of the 
road plane are depicted together with the time axis 
pointing upward vertically. Motion sequences of three 
vehicles can be seen, and a static obstacle appears as 
a pillar 
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Tree T of cooperative actions for m = 2 vehicles 


Each node n* of the tree T* corresponds to a state 
x* = ( x ;,yi,0;,v z ) T of the z'th vehicle. Applying an action 
actj = ( Uj , a i) T results in a successor state which can be 
computed by integrating (38) (cf. [80]): 

V;(f + At) = Vi(t) + dj(t)At 

+ a t) = + tan “’^ (y,-(f)+^a,- 0 )Af) ■ At 

H 

Xi(t) + (v z (f) +iflf(t)At) • At ■ cos</>*(t) 
if a*(t) = 0 

*<( f ) + 55^ ( sin< />i( f + Af) - sin 0 i(O) 

if 0Cj(t) 7^ 0 
' y*(t) + (vi(t)+\ai(t)At) • At • sin</> f (t) 

if a z (t) = 0 

y,(f + At) < y i (t)+ l -L( C0S< /, ; (t + Af )- C0S ^ i (t)) 

- ifa,(f)^0 

(40) 

The tree T* consists of T + 1 levels, resulting in 
a planning horizon of t max = T • At. It has Aj leaf 
nodes, where A z denotes the number of actions. With 
each action act* leading to tree node n b a loss L(n*) is 
incurred for the violation of constraints listed in sec¬ 
tion “Configuration Space.” Loss is a notion used in 
utility theory to determine preferences between alter¬ 
native actions [98]. The loss is composed additively of 
a term L roa d(^i) penalizing road departure, L ohst (rii) for 


collisions with obstacles, L cou ( n t ) for collisions among 
cooperative vehicles, and L con troi( w i) rating control 
effort with the aim of avoiding unnecessary braking 
and steering. The optimal action sequence corresponds 
to a path from the root r* to a leaf node n* having 
minimum accumulated loss L*. 

For cooperative motion planning, every action of 
one vehicle can be combined with every action of all 
other vehicles, resulting in a set of A = • ... • A m 

composite actions and a tree T having A r leaf nodes 
(Fig. 16). Even for moderate values of A and T, explicit 
construction of this tree exceeds memory and compu¬ 
tational resources. Therefore, pruning techniques are 
required which allow the optimal solution to be found 
without searching the whole tree. 

An important observation is that many computa¬ 
tions during traversal of T depend only on the action 
sequences of a single vehicle. This holds for vehicle 
states and for some loss components, namely T 0 bst> 
Lcontrob an d ^road- As a consequence, this information 
occurs redundantly in T, for example, at the edges 
highlighted in Fig. 16. It is possible to store these values 
in the single vehicle trees T) and look them up if they 
are needed several times. This can avoid many redun¬ 
dant computations. 

The single vehicle trees can also be precomputed 
completely as they are much smaller than T. This allows 


Xi(t + At) = 
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the establishment of lower bounds for the loss within 
a subtree. A lower bound h{n t ) for a subtree of 
rooted at n*- is obtained by recursively taking the min¬ 
imum values of the children of n* : 

{ 0 if rii is a leaf 

min {L(rii) + h(rij)} otherwise 

n': child of rij 

(41) 

A lower bound h(n ) for a node n of the cooperative 
tree T is given by the sum of the values from the 
corresponding nodes n&ri) of the single-vehicle trees 
Tn 


m 

h(n) := ^ h(rij(n )) (42) 

i=i 

These lower bounds can be used for an efficient 
branch and bound search in T. It explores the tree in 
depth-first order, remembering the best solution found 
so far and its loss value L*. Subtrees are excluded from 
the search if it can be proven that they cannot contain 
any part of the optimal solution. To this end, the sum 
g{n): = L(r,n) + h{n) is computed for the root node n of 
the subtree, where L(r>n) is the loss accumulated on the 
path from the root r of T to n. If g(n) > L*, the total 
loss of any path passing through the subtree must be 
greater than L*. In this case, the subtree can be 
excluded from the search without losing the guarantee 
to find the optimum (Fig. 17). The resulting branch 
and bound search method is shown in Algorithm 3. 
Alternatively, it is possible to search T using the A* 
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Criterion to prune a subtree 


algorithm, which also relies on the lower bounds h(n) 
to improve efficiency [99, 100]. 


Algorithm 3 General branch and bound method for 
cooperative motion planning 


1: procedure Search (f, n, L ) 

2: for all act e A do 

3: Compute child node n c , x(n c ) and L[n c ) 

4: if L + L(n c ) + h(n c ) < L* then 

5: if f + At < f max then 

6: Search (f + Af, n c/ L + L(n c )) 

7: else 

8: L* <- L + L(n c ) 

9: n* <— n c 

10: end if 

11: end if 

12: end for 

13: end procedure 
14: procedure Pi_AN(x start ) 

15: Perform precomputation 

16: /.* <— oc 

17: x(r) <— x start 

1 8: Search (0, r, 0) 

19: returnn* 

20: end procedure 


The only remaining loss component is L co u, which 
does not depend on the full cooperative action 
sequence either but only on the actions of pairs of 
vehicles [90]. Therefore, similar techniques can be 
applied in order to avoid redundant computations 
and to obtain lower bounds. The collision information 
can be stored in two-vehicle trees Tij for every pair of 
vehicles. There is, however, a trade-off between 
precomputation effort and savings during branch and 
bound search, and experiments show that 
precomputation of collision loss does not improve 
overall performance in most cases [101]. 

An important advantage of the tree search formula¬ 
tion is the use of explicit vehicle, loss, and action models, 
which can be chosen in the level of detail appropriate for 
the task under consideration. Precomputation of lower 
bounds significantly improves computational perfor¬ 
mance of the method. By shifting most of the compu¬ 
tational effort into the precomputation stage, running 
times become more predictable, increasing the real-time 
capability of the algorithm [101]. Figures 18 and 19 
show collision avoidance maneuvers planned with the 
tree search method. 
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Example of a cooperative merging maneuver planned by the tree search algorithm. While vehicle 1 changes to the left lane 
due to the obstacle, vehicles 2 and 3 brake in order to allow the merging. The full planning horizon has been 2.5 s 
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A cooperative collision avoidance maneuver in an 
intersection scenario 

Mixed-Integer Linear Programming The linear pro¬ 
gramming framework allows large systems of linear 
inequalities to be optimized efficiently. A linear pro¬ 
gram consists of an objective function, equality and 
inequality constraints, all of them being linear in the 
real-valued variables. An extension is mixed-integer 
linear programming (MILP) which constrains some 
of the variables to be integer, or - as a special case - 
binary. This makes the problem much harder. 

As efficient standard software exists for solving 
MILPs, cooperative motion planning has been formu¬ 
lated in this framework in recent years, with focus on 
the air traffic domain [102-104]. Variables x ih y ik 
denote the position of the z'th vehicle at time t = k- At 
within a planning horizon k = 0, ..., T. Further vari¬ 
ables are introduced for velocities and control inputs, 
with appropriate inequality constraints based on vehi¬ 
cle dynamics. They are coupled by equations describing 


a linearized version of a vehicle model like (38). Binary 
variables b{j k i are required for the collision avoidance 
constraints: 

*ik ~ Xjk + bijkiM > rij 

Yik ~ Yjk + b ijk2 M > r {j 
X-ik T Xjk + bijkiM > Tjj 
—yik + yjk + bijk^M > Tjj ( 43 ) 

4 

Y, bijkl < 3 
1=1 

bijkl C {0? 1} k = 1,..., T 

Therein, r { j is the minimum distance to be enforced 
between the vehicle reference points (x ik ,y ik ) T and (xj h 
y jk ) T at time k- At, and M 0 is a large constant. This 
formulations makes use of a well-known modeling 
technique to transform a disjunction of inequalities 
into a conjunction, which is the standard semantics of 
MILP constraints [105]. Only one of the first four 
inequalities has to be active, while the others can be 
trivially satisfied with byjd = 1. The binary variables thus 
describe on which side the vehicles pass each other. It 
may be necessary to discretize the space of directions 
into more than the four paraxial directions of (43). 
This can be accomplished by additional separating 
half-planes [106]. Obstacle avoidance is modeled in 
the same way, the only difference being that the 
position of the second object is no longer variable. 
Complex objects may be represented by a covering of 
several geometric primitives. 

Paralleling the tree search approach of the previous 
section, MILP solvers often employ branch and bound 
methods. However, in the tree search formulation, the 
branching occurs over the vehicle actions and collisions 
are tested depending on the selected actions, whereas 
in the MILP formulation, the binary decision variables 
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describe collision avoidance constraints and the vehicle 
actions are optimized dependent on the collision avoid¬ 
ance decisions. As a consequence, time discretization is 
tightly coupled with passing distances and cannot be 
coarsened without impairing collision avoidance. By 
contrast, in the tree search method, the number of 
decision points can be reduced without any difficulty 
in order to improve computational efficiency. This may 
be an explanation for the observed running times of the 
MILP approach, which seem to question a real-time 
usage at present [107]. Another drawback of the MILP 
method is the restriction to linear models and objective 
functions. An important advantage is the utilization of an 
established framework with available tool support, which 
facilitates modeling and implementation. Related work 
includes pure linear programming applied to air traffic 
control [108] and nonlinear optimization [109]. 

Elastic Bands Different variants of the elastic-band 
method have been applied to path adaptation for 
mobile robots [110-112] and to path planning for 
cars [79, 113-115]. Recently, the method has been 
extended to cooperative motion planning in dangerous 
situations [116]. 

Once again, nodes x i>k = (x iy]o y iyk ) T describe the 
vehicle positions at discrete points in time. Consecutive 
nodes of one vehicle are connected to form an elastic 
band. Orientation and velocity of the vehicle are 
implicitly represented by the sequence of nodes. 

Virtual forces are designed to represent the con¬ 
straints described above in section “Configuration 
Space.” The resulting elastic band can be interpreted 
as a combination of point masses and springs [114]. 


The virtual forces show some similarities with potential 
field path planners as forces can be interpreted as 
derivatives of potential fields. However, potential field 
planners only have a local view of the environment 
[80], while elastic bands allow collision avoidance 
forces to propagate along the band. Internal forces 
can be designed to represent the dynamic and kine¬ 
matic constraints of the vehicle. 

The total force vector applied to node x iyk is denoted 
by f iyk . It is additively composed of several force terms. 
The internal forces f dlst and f an§ implement the 
dynamic and kinematic constraints of the vehicle in 
longitudinal and lateral direction, respectively. They 
correspond to springs between consecutive nodes of 
one band, x iyk and x iyk + Y . 

External forces repel the band from obstacles, other 
vehicles, and the road boundary. For example, the force 
f coop acts between nodes of different bands having the 
same time index, x iyk and Xj yk . The elastic-band frame¬ 
work uses a circular object model. Other object shapes 
are approximated by a covering of circles [113]. 
A visualization of elastic bands and virtual forces is 
shown in Fig. 20. 

The nodes of an elastic band have to be initialized in 
a meaningful way, e.g., corresponding to a lane-keeping 
behavior of the vehicle. The virtual forces are applied to 
the band iteratively until an equilibrium is reached in 
which the resulting forces vanish for every node: 
Vi V/c f i yk « 0. A continuous state and control trajec¬ 
tory is obtained from the discrete sequence of nodes 
using, e.g., cubic spline interpolation [113]. 

The elastic-band method has been successfully 
applied to obstacle avoidance, merging, overtaking, 


c 
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Visualization of the elastic bands of two oncoming vehicles. Arrows indicate some selected force components 
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Cooperative obstacle avoidance maneuver planned by the 
elastic-band method 


and intersection scenarios. Figure 21 shows a coopera¬ 
tive maneuver planned by the elastic-band method, 
and Fig. 15 shows a space-time visualization of a merging 
scenario. Elastic bands exhibit potential for real-time 
implementation and a good scalability with the number 
of cooperative vehicles, even when a comparatively fine 
time discretization is chosen. However, the method can 
only find a local optimum, unlike the tree search and 
MILP approaches. This means that the result depends on 
the initialization of the elastic bands. 

Future Directions 

Many of the methods described in this entry are still at 
the research stage. They have been verified in simula¬ 
tions or experiments but are not yet ready for market 
deployment. The time to market may be quite long, 
e.g., as shown by platooning technologies that were 
demonstrated back in 1997 [117], but a commercial 
implementation of such systems is still pending. 

A major unsolved problem is how to deal with the 
inevitable uncertainty in the environment information 
caused by sensor-based perception. On the one hand, 
this can be alleviated by improving sensor technology, 
processing algorithms, and multi-sensor fusion. In the 
context of cooperating vehicles, the integration of per¬ 
ceptions made by multiple vehicles is of particular 
interest. Research on appropriate fusion architectures 
is in progress [8, 118]; however, to the best of the 
authors’ knowledge, cooperative perception has 
not yet been implemented in a real-world scenario. 
On the other hand, situation recognition and planning 
algorithms can be designed which take the uncer¬ 
tainties into account. For situation recognition and 
prediction, the methods described in this entry can 
already cope with uncertain information to a certain 
degree. Planning under uncertainty is a very difficult 
problem that is only partially solved. Statistical 


decision theory offers a framework to determine the 
best action under uncertain information [98]. In prin¬ 
ciple, this approach is equivalent to Bayesian decision 
networks, which establish a connection between situa¬ 
tion assessment and decision making [119, 120]. How¬ 
ever, its application to cooperative motion planning 
causes a dramatic increase in computational complex¬ 
ity. For single vehicles and robots, there are several 
approaches to considering model based probabilistic 
uncertainties [121-124]. A related, but different objec¬ 
tive in motion planning is to reduce the uncertainty of 
vehicle localization [125, 126]. Cooperative motion 
planning with uncertain environment information is 
still an unsolved problem. The problem complexity 
may require a hierarchical planning, increasing the 
level of detail step-wise. 

Regarding prediction methods, a comparative eval¬ 
uation of the numerous approaches would be a topic 
for useful future work. [127] describes a prediction 
method that considers the interaction of several vehi¬ 
cles. The evaluation of cooperative maneuvers is 
treated in [128]. 

In the field of automatic maneuvers, a promising 
approach to guarantee the safety of a planned motion is 
the concept of inevitable collision states and the partial 
motion planner [78]. This is a scheme of iterated plan¬ 
ning and execution. The plans selected are guaranteed 
not to induce a state in which a collision becomes 
inevitable. At least a collision-free contingency maneu¬ 
ver remains possible [129]. This principle has been 
extended to cooperative motion planning [130]. How¬ 
ever, it is not directly applicable to dangerous situations 
because the assumption of an initial safe state does not 
hold there. 

The group formation and decision algorithms 
described in this entry rely on a designated group 
coordinator vehicle. Future research should investigate 
to what extent decision making can be distributed. This 
could result in a better reliability and better distribu¬ 
tion of the computational load. Consensus algorithms 
or game theoretic methods might possibly be extended 
to handle distributed decision making in dangerous 
traffic situations [67, 131-133]. 

Regarding wireless vehicular communication, stan¬ 
dardization of protocols and bandwidth allocation are 
important prerequisites [134]. Field tests of wireless 
vehicular communication and cooperative local danger 
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warning are currently under way [135]. The US 
Department of Transportation is soliciting new ideas 
on this subject [136]. In Europe, a cooperative driving 
challenge has been initiated [137]. Several teams want 
to show improved driving capabilities based on coop¬ 
erative information exchange. This could initiate fur¬ 
ther research and development in this area, similar to 
the impulse generated by the DARPA Grand Challenges 
and Urban Challenge for autonomous vehicles [3]. 

As real-world testing of cooperative maneuvers is 
a safety-critical task, suitable testing methodologies 
have to be found. Urmson et al. [138] gives a general 
overview of testing. For single-vehicle driver assistance, 
hardware-in-the-loop simulation and driving tests 
with mock-up obstacles have been applied [139, 140]. 
Cooperative maneuvers could be tested by starting with 
only one real vehicle and several simulated vehicles, but 
including real communication. In order to ensure the 
safety of a system in real use, an enormous number of 
experiments under a wide range of conditions is 
required. Since particular dangerous situations may 
be rare in real traffic, it is difficult to collect sufficient 
data. The rate and handling of false alarms is another 
key issue. A major related difficulty is the development 
of appropriate certification procedures for vehicles and 
their systems [141]. Aspects such as user acceptance 
and legal regulations are still to be addressed [142]. 

These issues will have to be resolved before cooper¬ 
ative technologies may develop their potential to 
increase traffic safety and efficiency. 
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Glossary 

Coral reef A geological structure of significant topo¬ 
graphic relief above the sea floor, made of calcium 
carbonate and constructed by living organisms 
(mainly corals belonging to the order Scleractinia, 
phylum Cnidaria, and coralline algae). 

Hermatypic Refers to corals that are able to build reefs. 
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Holocene A geologic age that corresponds to 
10,000 years before the present, or £C B.P” 

Scleractinia The order of hard or stony corals to which 
the reef builders belong, under the phylum Cnidaria. 
Zooxanthellae Golden brown single-celled algae that 
inhabit the gastrodermal cells of reef-building 
corals (belonging to the phylum Dinophyta, order 
Gymnodiniales). 

Definition of the Subject and Its Importance 

Coral reefs are geological structures of significant 
dimensions, constructed over millions of years by 
calcifying organisms. The present day reef-builders are 
hard corals belonging to the order Scleractinia, phylum 
Cnidaria. The greatest concentrations of coral reefs are 
in the tropics, with highest levels of biodiversity situated 
in reefs of the Indo-West Pacific region. These ecosys¬ 
tems have provided coastal protection and livelihood to 
human populations over the millennia. Human activi¬ 
ties have caused destruction of these habitats, the inten¬ 
sity of which has increased alarmingly since the latter 
decades of the twentieth century. The severity of this 
impact is directly related to exponential growth rates of 
human populations especially in the coastal areas of the 
developing world. However, a more recently recognized 
phenomenon concerns disturbances brought about by 
the changing climate, manifested mainly as rising sea 
surface temperatures, and increasing acidification of 
ocean waters due to greater drawdown of higher con¬ 
centrations of atmospheric carbon dioxide. Manage¬ 
ment efforts have so far not kept pace with the rates of 
degradation, so that the spatial extent of damaged reefs 
and the incidences of localized extinction of reef species 
are increasing year after year. The major management 
efforts to date consist of establishing marine protected 
areas and promoting the active restoration of coral 
habitats. 

Introduction 

Over the millennia, coral reef ecosystems have served as 
the basis of human survival and sustenance in many 
coastal areas in the tropics and subtropics. They are 
significant geologic formations that provide crucial 
protection from destructive forces of the sea. They 
also harbor the greatest diversity of animal phyla than 
any other habitat on the planet. But because of the 


finely tuned balance between primary production 
and energy consumption within the ecosystem, coral 
reefs are unable to sustain high levels of biomass 
extraction. Added to this, a range of human impacts 
from overfishing to pollution and direct destruction of 
the habitat has caused extensive degradation of these 
ecosystems since the latter decades of the twentieth 
century. This trend has been exacerbated by recently 
recognized effects of climate change. The main culprits 
are rising sea surface temperatures and increasing acid¬ 
ification of ocean waters, both of which exert adverse 
impacts on the physiology of the coral-zooxanthellae 
complex, thus hampering its ability to effectively 
recover from various environmental disturbances. The 
best management option at the present time is to put as 
many reefs as possible under protection, while 
providing for alternative means of livelihood to 
displaced fishers. The field of active coral restoration 
is still at an early stage and warrants further research. 

Geology and Distribution 

A reef is a “discrete carbonate structure formed by 
in situ or bound organic components that develops 
topographic relief upon the sea floor” [ 1 ]. The majority 
of coral reefs are located in the tropics [2, 3] (Fig. 1). 
The limit of their distribution is represented by Mid¬ 
way and Kure, for example, which are well-developed 
algal-coral atolls that lie north of 28°N latitude at the 
northwestern end of the Hawaiian chain. As summa¬ 
rized by Moberg and Folke [4] , coral reefs are estimated 
to constitute from 0.1% to 0.5% of the ocean floor, 
with actual areal estimates ranging between 255,000 
and 1,500,000 km 2 . The coastlines of more than 
100 countries are lined by coral reefs. Advances in 
methodology, particularly remote sensing, are helping 
to delineate more accurately the distribution and 
extent of coral reefs, and the proportions of the differ¬ 
ent components that constitute the bottom substrate 
(live coral, vegetation, nonliving cover such as rock and 
sand) [5]. However, remote sensing is still limited by 
the fact that only the visible range of the electromag¬ 
netic spectrum can penetrate the water column 
sufficiently down to depths occupied by coral reefs 
which are shallow-water ecosystems. Hence, this tool 
should be complemented by actual ground-truth 
surveys whenever possible [5]. 
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Coral Reef Ecosystems. Figure 1 

Global distribution of coral reefs (Source: NOAA National Ocean Service Education Web Site, revised 25 March 2008, 
http://oceanservice.noaa.gov/education/, NOAA National Ocean Service, Communications and Education Division) 


The three main types of coral reefs are fringing 
reefs, barrier reefs, and atolls [2-4]. Other than these 
major categories, there are minor reef forms of various 
sizes and shapes, and situated at different depths of the 
ocean floor. For example, Moberg and Folke [4] 
mention platform reefs as a fourth category (Table 1). 
Fringing reefs develop close to the shore in relatively 
shallow waters. Among the longest expanses of reefs in 
the world is the fringing reef that lines the shores of the 
Red Sea [2], with a length, if stretched out, exceeding 
2,500 miles (4,000 km). Barrier reefs are composed of 
a series of reef structures arranged roughly parallel to 
the coast, and separated from it by a fairly deep lagoon. 
The largest such aggregation of reefs in the world is the 
renowned Great Barrier Reef off the Queensland coast 
of Australia [2] with a length of more than 1,200 miles 
(almost 2,000 km). It varies in width from about 
10 miles (16 km) to almost 200 miles (320 km). 
The total area of reef habitat that it encompasses is 
about 25,000 km 2 [6]. The various reef structures 
within the barrier reef are separated from each other 
by lagoons of different sizes. Atolls are annular reefs 
that encircle a lagoon. They may arise from the deep 
sea, or from the shallower depth of the continental 
shelf [2]. Platform reefs are carbonate structures 


Coral Reef Ecosystems. Table 1 The four main reef types 


Platform 

reefs 

Fringing 

reefs 

Barrier 

reefs 

Atolls 

Frequently 
found in the 
lagoons 
created by 
atolls and 
barrier reefs 

Closely 

follow 

shorelines, 

narrow 

shallow 

lagoon 

Separated 
from land 
by 

a relatively 
wide, deep 
lagoon 

Horseshoe 
shaped or 
circular reef 
surrounding 
a central 
lagoon 
(often far 
from land in 
the open 
ocean) 

In the Great 
Barrier Reef 
lagoon, Belize, 
Red Sea, 
Bahamas 

Red sea. 

East Africa, 

Seychelles, 

and other 

Indo-Pacific 

islands, 

most 

Caribbean 

reefs 

The Great 
Barrier Reef 
in Australia, 
Belize 

Barrier 

Reef, off 
Mayotte in 
the 

Western 

Indian 

Ocean 

>95% of the 
atolls are in 
the 

Indo-Pacific, 
others are 
found 
outside 

Belize and in 

Western 

Atlantic 


Source: From [4] 
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of various shapes and sizes usually situated inside 
lagoons. 

A major factor controlling reef growth is sea level 
rise [1]. It is to be noted, however, that the more 
relevant metric is relative sea level change, because the 
actual sea level in an area is a net result of both sea level 
change and tectonic movement of the land due to the 
presence of active faults [7, 8]. Thermal expansion of 
the ocean due to increasing heat content caused by 
climate change also contributes to sea level rise [9], In 
terms of the effects of sea level changes on reef growth, 
more information exists for tropical Atlantic reefs 
compared to their counterparts in the Pacific [10]. 
Much geological research has focused on Holocene 
reefs. These are considered analogues of ancient reefs, 
and preserve records of eustatic changes in sea level and 
neotectonics. The measured vertical accretion rates 
during the Holocene display some variability (reviewed 
in [10]). Caribbean reefs had higher average rates, on 
the order of 6 m/ky, compared with Indo-Pacific reefs 
which have grown at an average rate of 4.4 m/ky. 
Reasons for this difference still remain unknown. 
In the Pacific, the Philippines, though it lies in the 
acknowledged center of shallow-water marine 
biodiversity, has been little studied compared to the 
Great Barrier Reef, the Ryukyus, and the islands of 
Hawaii and Polynesia. The study of Shen et al. [10] 
represents the first coral-based sea level record from the 
South China Sea. Reef accretion rate is estimated to 
have reached as high as 10-13 m/ky during the period 
9.2-8.2 ka. 

Amphitropical distributions are disjunct distribu¬ 
tions where the population or species occurs 
subtropically on both sides of the equator [11]. 
Such patterns have been most commonly observed 
in fish. In the case of corals, however, disjunct distri¬ 
butions in central Indo-Pacific species are observed 
by Veron [11] to display no significant amphitropical 
component. Veron [11] also claims similarities 
between corals of Hawaii and those in the southeast 
Pacific on a larger scale. Dispersal is believed to play 
a key role in the speciation and distribution of coral 
species. 

The tropical oceans consist of four major biogeo¬ 
graphic regions [4]: the Indo-West Pacific (IWP), 
the Eastern Pacific, the Western Atlantic, and the 
Eastern Atlantic. 


Knowledge about coral distributions is summarized 
as follows (quoted from [11]): 

• There are areas of high diversity in the tropical 
western margins of the world’s three great ocean 
basins, that of the Indian Ocean being relatively 
ill-defined. 

• Within the Indo-Pacific, there is little generic 
variation within the center of high diversity. 

• There is attenuation of species diversity latitudi- 
nally and longitudinally from these centers. 

• Regions distant from centers of species diversity 
tend to have similar genera. 

• Latitudinal attenuation occurs in the same or 
similar “drop-out” sequence in the northern and 
southern hemispheres. 

• Latitudinal attenuation is highly correlated with sea 
surface temperature. 

• The mean generic age of Caribbean corals is twice 
that of Indo-Pacific corals and centers of diversity 
have relatively young generic ages. 

In a recent analysis of biogeographic patterns of 
corals and reef fish across the Pacific and Indian 
Oceans, Bellwood and Hughes [12] established 
a “remarkable congruence” between the two groups in 
terms of gradients in diversity. The peak in diversity 
for both corals and fish was observed in the 
Indo-Australian region, between 120°E and 170°E 
longitude. There was a steep decline across the Pacific 
toward its eastern boundary, where assemblages are 
notably more depauperate. There also was a decline 
westward across the Indian Ocean, though not as 
steep. A similar congruence for corals and fish 
was observed in terms of biodiversity patterns along 
latitudinal gradients. The peak in diversity was near the 
equator, with declines on either side at higher latitudes. 

Biology and Ecology 

The main builders of modern-day reefs are the hard or 
stony corals belonging to the order Scleractinia of the 
phylum Cnidaria [13]. They constitute the main 
framework structure, as well as act as fillers and 
cementers of the reef matrix together with the coralline 
algae. The basic unit of the coral is an anemone-like 
structure called the coral polyp. Large coral forms 
are often colonial, consisting of numerous polyps that 
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have reproduced asexually by budding. However, there 
are a few solitary forms such as members of the 
genus Fungia , commonly known as “mushroom 
corals” Recent phylogenetic evidence indicates that 
deep-water coral species arose from shallow-water 
ancestors, following a long established process of 
diversification of marine taxa along a gradient from 
shallow to deeper habitats [14]. 

The majority of reef corals are in a symbiotic 
relationship with a dinoflagellate of the genus 
Symbiodinium. Dinoflagellates are golden brown 
single-celled algae belonging to the phylum Dinophyta, 
order Gymnodiniales [13]. They have been given 
the name “zooxanthellae” [15]. They inhabit the 
gastrodermal cells of the coral polyp where 
they perform the basic function of photosynthesis. 
The products of photosynthesis are used primarily 
by the algae for their own metabolic maintenance and 
for reproduction. However, under favorable conditions 
the zooxanthellae produce an excess of photosynthetic 
products such as simple sugars, lipids and amino acids 
which are then translocated to the coral host cell [15]. 
The photosynthate is utilized by the coral to supply its 
own nutritional needs. In addition, this supplementary 
energy source fuels the all-important mechanism of 
calcification which is the basis of reef building [16]. 
The symbiotic algae also provide oxygen, a by-product 
of photosynthesis, which helps support the metabolism 
of the coral host. Because of a finely tuned balance 
between primary production and energy consumption 
of all species within the ecosystem, coral reefs have 
relatively low rates of net primary production [17]. 
Hence, they are unable to sustain high levels of biomass 
extraction. 

Not all coral species harbor symbiotic algae, and are 
termed “azooxanthellate” [3], It is the zooxanthellate 
forms that possess the in-house power unit, so to speak, 
which helps support calcification on a large scale. 
Hence, the majority of these are so-called hermatypic 
or reef-building corals. They are colonial in form, and 
this trait together with their symbiosis with algae is 
believed to confer on them a competitive advantage 
over other benthic invertebrates, and helps explain 
their dominance in tropical reefs [14]. 

The extant members of the order Scleractinia 
are evenly divided between zooxanthellate and 
azooxanthellate species [14]. Most azooxanthellate 


species are solitary (as opposed to colonial) and 
are nearly absent from reefs. However, they 
have wider geographic and bathymetric distributions 
than reef corals, extending up to aphotic depths in 
the ocean. Recent phylogenetic analyses indicate 
that they originated from ancestors that were 
symbiotic [14]. 

With respect to the zooxanthellate coral taxa, it is, 
ironically, the possession of this singular advantageous 
feature that has rendered them extremely susceptible to 
a relatively recent stress in the form of increasing ocean 
temperatures. This phenomenon is part of what has 
been recognized as global climate change [18]. 
Temperature stress, that is, water temperatures 
significantly above normal for prolonged periods, 
causes zooxanthellae to lose their pigment, or to be 
expelled by the host polyp. In both instances, there is 
a loss of photosynthetic capability of the coral-algae 
symbiotic complex with a resulting impairment in the 
ability to meet nutritional needs of both coral host 
and symbiont. 

Scientific findings over the last few decades have 
established that there are a number of clades of 
zooxanthellae, originally thought to belong only to 
one species of the genus Symbiodinium. These clades 
are genetically distinct from one another. To date, 
there are eight known phylogenetic clades, with each 
clade containing multiple subclades or types [19, 20]. 
Scleractinian corals establish functional symbioses 
primarily with clades A-D. Their genetic distinction 
imparts different physiological properties to the 
various clades, properties that are manifested at 
the level of the host-zooxanthellae complex or 
“holobiont” [19-21], notably the ability to adapt to 
various ambient temperature regimes. Hence, some 
zooxanthellate corals are able to tolerate increases in 
water temperature better than others. In some 
instances, the algae are not expelled by the host coral 
and are able to continue photosynthesizing, providing 
the latter with vital nourishment during stressful 
thermal conditions. 

Reef accretion is due to a net positive balance of 
calcium carbonate production over calcium carbonate 
loss (Fig. 2, [22]). The major producers of calcium 
carbonate are the hard corals. Their skeletons 
constitute the main framework of the reef, which is 
a geological structure in three dimensions and which 
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Coral Reef Ecosystems. Figure 2 

Conceptual model depicting processes contributing to coral reef carbonate accretion or erosion, and the effects of 
environmental change (From [22]) 


can measure thousands of meters across. There are 
other important calcifying organisms as well, notably 
the coralline algae. The remains of these and other 
calcifiers such as molluscs and foraminiferans serve to 
fill in the reef matrix over geologic time. Another 
source of calcium carbonate on the reef is the import 
of sediment from outside sources via water currents. 
The loss of calcium carbonate is due to erosion, disso¬ 
lution, or export of sediment out of the reef. Erosion 
can be due to physical or biological processes. 
Important bioeroders are fish that graze on coral 
(notably parrotfish), sea urchins that scrape hard 
substrates in the process of feeding, sponges, molluscs, 
and worms that bore into coral skeletons, plus a variety 
of microborers. 

Optimal Environmental Requirements 

Because of the intimate symbiosis with algae, 
reef-building corals will survive and grow down to 
particular depth limits in the ocean due to specific 


light requirements for algal photosynthesis [15]. Most 
extensive reef structures occur in relatively shallow 
waters, typically down to depths of thirty (30) meters 
[ 15] . Hermatypic corals require high levels of aragonite 
supersaturation in order to maintain sufficient rates of 
calcification. Coral reef communities also thrive best in 
nutrient-poor waters [ 1 ]. 

Another critical environmental factor is tempera¬ 
ture. The physiology of the coral-alga symbiotic 
complex functions optimally within specific tempera¬ 
ture ranges, typically not lower than 18°C, with the 
limiting high temperatures believed to be sustained 
maxima of 30-34°C [1]. Although coral reefs achieve 
maximal development in the tropics, recent trends 
indicate that the highest summer temperatures 
in many tropical areas are probably exceeding 
the upper thermal tolerance limits of many coral 
species. The phenomenon of coral bleaching, whereby 
a coral colony becomes white because the symbiotic 
zooxanthellae either lose their pigments or are alto¬ 
gether expelled [23], is becoming more frequent [18]. 
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Different coral species are adapted to a wide range 
of hydrodynamic regimes, from calm to turbulent. 
They may be found in a corresponding range of 
habitats from sheltered lagoons to reef slopes exposed 
to the open sea. However, they require a minimum of 
water circulation around the coral colony to ensure that 
wastes are carried away, and that a sufficient amount of 
oxygen is made available for respiration. A microscopic 
boundary layer exists around the surface of the coral 
colony that controls the diffusion of nutrients and gases 
between the external environment and the internal 
environment of the coral-alga symbiotic complex. 

Because of their metabolic requirements, corals, 
just like all other organisms with aerobic respiration, 
need a minimum amount of dissolved oxygen in the 
water in order to survive. Hypoxic conditions that 
are generated by excessive microbial decay of organic 
matter, or by abnormally high water temperatures, can 
impair coral survival and health. Similarly, salinity, 
because of its effects on coral-algal physiology, must 
also be maintained within certain ranges (34-36 ppt) 
for optimal survival and growth of coral colonies [13]. 
If levels of salinity drop precipitously, as in periods of 
heavy rainfall coinciding with low tides, for example, 
corals in shallow areas can experience osmotic stress. 
If the period of immersion in low salinity water is 
protracted, then some colonies can experience 
mortality. Abnormally high salinity, above typical 
oceanic values of 35 ppt, can also inflict metabolic 
stress on corals. Exceptions, however, are coral 
populations in normally high salinity areas like the 
Red Sea [13] which have adapted over the millennia 
to such extreme conditions. 

Waters around flourishing reef areas are character¬ 
istically oligotrophic, with relatively low nutrient 
concentrations. The coral-alga symbiotic complex is 
efficient in taking up nutrients from surrounding 
waters, utilizing them for photosynthesis, and recycling 
them back into the external medium through 
metabolism. Important nutrients in reef areas are 
nitrate, ammonium, and phosphate. They exist in 
a fine balance with the living community, a balance 
that is easily disrupted by excessive influx of nutrients 
from external sources. An increased supply of nutrients 
favors the primary producers over the coral. Thus, 
depending on a particular situation, there would be 
increased abundances of zooxanthellae residing within 


coral colonies, of phytoplankton in the water column, 
or of algae inhabiting the bottom substrate. Algae 
compete with corals for space and light. 

Biodiversity 

Coral reefs are among the richest marine habitats in 
terms of species diversity [3,11]. They are often alluded 
to as the “rainforests of the sea.” The high species diver¬ 
sity is attributable to known factors, namely, the location 
of coral reefs in the tropical and subtropical latitudes 
with their attendant climatic regimes, and the complex 
topography afforded by the three-dimensional geologi¬ 
cal structure. The relatively high light intensities close to 
the equator support the levels of primary production 
that are needed to drive the multitude of trophic path¬ 
ways in a typical coral reef ecosystem. The relatively 
small ranges of temperature fluctuations, as compared 
to temperate and higher latitudes, enable a sustained 
level of metabolism throughout the entire year for 
many species, including several reproductive episodes. 

Recent analyses of the fossil record indicate that 
reefs were probably cradles of evolution during the 
Phanerozoic era (which covers about 500 million 
years from the present), with approximately 22% of 
examined genera originating in reef environments 
[24]. The class Anthozoa, which contains the 
reef-building scleractinian corals, contains the largest 
number of genera originating in reef environments. 
In addition to being sites of the appearance of 
new species, coral reefs are seen to have exported 
various taxa to other environments. The most likely 
explanation as to why reefs have played this role over 
their geologic history is their habitat complexity [24]. 

The complex three-dimensional structure of a typ¬ 
ical reef provides a plethora of habitable space for 
a wide variety of organism types with a corresponding 
wide range of behaviors. In an analysis of a large con¬ 
temporary data set, Bellwood and Hughes [12] deter¬ 
mined that patterns in the diversity of corals and reef 
fish on a biogeographic scale were best explained by the 
availability of habitat area, a key feature of topographic 
complexity. The various species interact primarily via 
feeding or trophic pathways, although the picture has 
sadly been drastically altered by excessive human 
exploitation, in addition to other natural and anthro¬ 
pogenic impacts [25]. 




2496 


c 


Coral Reef Ecosystems 


Coral reefs contain the greatest number of phyla of 
any habitat on earth, including 32 of the estimated total 
of 34 animal phyla [13]. The stony corals which build 
up the reef framework belong to the phylum 
Cnidaria, which also includes the hydroids and 
jellyfish, and the non-reef building soft corals. Another 
dominant component of the benthic community are 
the sponges belonging to the phylum Porifera. Other 
conspicuous invertebrates are represented by the 
phylum Echinodermata which includes the starfish, 
sea urchins, sea cucumbers, and crinoids and the 
phylum Mollusca covering a huge variety of clams 
and shells, not to mention squid and octopus. 
The phylum Arthropoda includes the class Crustacea, 
a very large one that includes crabs, lobsters, and 
shrimps. It is a member of the subphylum Vertebrata, 
however (apart from the hard corals themselves), that 
captures the most attention from coral reef enthusiasts. 
The most renowned vertebrate group is the fishes 
which comprise thousands of species on a pristine 
reef. Other vertebrates that used to dominate 
coral reef habitats in different parts of the globe, but 
that are sadly depleted now [25] include the sea cows, 
turtles, and crocodiles. Even sea birds that were once 
plentiful around reef ecosystems are now dwindling 
in numbers. 

Among reef habitats, the most diverse are found in 
the Indo-West Pacific region, particular around the 
islands of the Philippines, Indonesia, and Papua New 
Guinea [26-28]. These islands combined harbor more 
than 2,500 species of fish belonging in 165 families, 
more than 3,200 species of mollusc, and about 500 spe¬ 
cies of coral in almost 80 genera. A recent study 
involved mapping of the geographic ranges of 1,700 
species of reef fish, 804 species of coral, 662 species of 
snail, and 69 species of lobster [27]. For all taxa, the 
peak in species richness was determined to be in what 
has come to be known as the “coral triangle” of 
Southeast Asia, namely, the islands mentioned above. 
Species richness then falls off rapidly toward the 
east across the Pacific, and less rapidly toward 
the west, across the Indian Ocean. Sites in the southern 
Philippines and central Indonesia were found to be in 
the top 10% richest locations. 

In the analysis of Roberts et al. [27], 58 species of 
corals, or 7.2% of the total studied, had restricted 
geographic ranges. The values for fish, snails, and 


lobsters were 26.5%, 28.7%, and 53.6%, respectively. 
The implication of this finding is that many species 
may be vulnerable to localized extinctions, particularly 
if they are restricted to geographic locations receiving 
high levels of human impact. Areas with high degrees of 
endemism combined with significant levels of impact 
have been termed marine biodiversity “hotspots.” 
A renowned example of a place which has a high degree 
of biodiversity but whose reefs are among the most 
threatened (and degraded) in the world is the 
Philippines [27, 28]. 

A recent study has shown that the Philippines 
harbors a higher concentration of species per unit 
area than Indonesia, with a rigorous pattern emerging 
for marine shore fishes [29]. The authors hypothesize 
a center of origin on the Eurasian plate, with geological 
events being responsible for allopatric speciation, 
in addition to island integration. The Philippine center 
of diversity has significant abundance of both 
widespread Indo-Pacific species as well as species 
whose distribution is limited to coral reefs. 

Fisheries 

Over the millennia, the greatest benefit provided to 
mankind by coral reef ecosystems has been in the 
form of harvest, primarily of finfish, in addition to 
other vertebrates and invertebrates [25]. Human 
populations along the coast made extensive use of 
nearby reef areas that were situated typically in shallow 
water and often in sheltered lagoons or embayments, 
a fact that rendered them eminently accessible by even 
the most primitive means of transport such as raffs and 
hand-paddled canoes. Traditional methods involved 
the use of various kinds of nets, or of hook-and-line 
fishing. 

There are over 4,000 species of teleost fishes known 
to be associated with coral reefs [30]. Approximately 
10% of these are directly dependent on corals. They are 
either obligate coral dwellers, corallivores (i.e., they 
feed on coral tissue), or possess juvenile stages that 
always settle into live coral in the process of 
recruitment. The total global fisheries yield from coral 
reefs is estimated to be USD five billion annually [30]. 

In the central Philippines, which was recognized 
recently as among the richest in the world in 
terms of species diversity [29], the following fish 
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families contribute substantially to fish catch [31]: 
Acanthuridae, Siganidae, Scaridae, Labridae, 
Haemulidae, Lethrinidae, Lutjanidae, Mullidae, 
Serranidae, Carangidae, Scombridae, Sphyraenidae, 
Belonidae, and Caesionidae. In terms of total potential 
fish yield, Alcala and Russ [31] suggest a figure of 
350,000 metric tons annually for the entire country, 
based on an aerial estimate of 25,000 km 2 of reef area. 

Physical Connectivity 

Marine habitats such as coral reefs have the key 
distinguishing feature (when compared to terrestrial 
habitats) of being situated within a water medium, as 
opposed to air. The physical differences between water 
and air impart fundamental properties in terms of how 
the respective communities function, and how they are 
structured. One obvious difference is that water is 
considerably denser than air, and is capable of carrying 
a much larger quantity of material. Because of 
this, there is significant transport by water currents 
between habitats over a large range of distances, from 
microscopic (at the scale of molecular motion) to the 
macroscopic scale (up to hundreds of kilometers) [32]. 

Oceanic currents convey a whole spectrum of 
materials, from dissolved compounds, to particles 
(living and nonliving) to organisms of various sizes, 
from microscopic plankton to mammals weighing 
several tons. The relevant aspect for marine conserva¬ 
tion and sustainability is the ecological process of 
recruitment, whereby gametes or larvae are carried 
away from particular habitats and transported to 
other areas at varying distances from the point of 
origin. Recruitment is a vital process for the mainte¬ 
nance and renewal of populations. Communities that 
suffer significant decreases in recruitment over appre¬ 
ciable amounts of time are in danger of diminishing or 
even disappearing altogether. 

One of the first papers that advocated a study of 
surface oceanic currents in order to predict possible 
recruitment trajectories is that of Roberts [33]. 
The author, using the Caribbean as an example, 
proposed that if larvae can be regarded as passively 
dispersing particles, then upstream reefs, if harboring 
healthy populations of reef species, may be considered 
sources supplying downstream areas. The effective 
distance by which recruitment can occur successfully 


would, theoretically, be dictated by the length of time 
during which a larva remains viable in the water 
column, or the duration of the pelagic larval phase 
[34]. In an extension of this perspective to manage¬ 
ment applications, downstream reefs receiving viable 
recruits in sufficient amounts would be less at risk of 
decimation even if exploitation rates were high. 
Upstream reefs, on the other hand, would be in greater 
need of protection. These ideas helped to inspire the 
vigorous movement that emerged in a number of 
countries that have coral reefs in their territorial waters 
to establish networks of marine reserves or protected 
areas premised, among other criteria, on the possible 
existence of connectivity based on oceanographic 
patterns. Marine protected areas (MPAs) are discussed 
in section “Marine Protected Areas”. 

Subsequently, a large number of studies have been 
conducted to confirm or refute these ideas [6]. One 
limitation is that the only means of directly determin¬ 
ing the actual fate of individual larvae is by tagging and 
parentage techniques [6]. Other approaches are indi¬ 
rect, and include genetic analysis, the description of 
local population structure, and the application of bio¬ 
physical modeling which combines the characteriza¬ 
tion of current patterns and hydrographic regimes 
with larval behavior of particular species. 

Recent research indicates that the degree of larval 
retention within a natal reef, as well as the spatial extent 
of connectivity in both corals and fish appear to be 
independent of larval duration [6], contrary to 
previous expectations. In the case of corals, for 
both broadcast spawners as well as brooding 
species, both self-recruitment and inter-reef dispersal 
have been found to be significant. Dispersal can occur 
over distances as long as tens of kilometers. Broadcast 
spawners are corals that release gametes into the water 
column. The eggs and sperm may come from the 
same colony (hermaphroditic), or from colonies with 
separate sexes (gonochoric). Fertilization takes place in 
the water column. The entire series of developmental 
stages from cleavage, to formation of the embryo and 
then of the fully-formed larva all occur as part of the 
pelagic phase. Brooding species are corals where the egg 
remains inside the polyp, and is fertilized by sperm that 
find their way into the cavity. The larva develops inside 
the polyp and emerges after a period of time, fully 
formed, into the surrounding water. 
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In the case of reef fish, a number of methods 
indicate that, for a range of species, there are ecologi¬ 
cally significant levels of self-recruitment at the scale of 
individual reefs or reef clusters [6] . Results have yielded 
an average of as high as 45-50% self-recruitment. 
The amount of self-recruitment does not seem to be 
related to the mode of spawning nor pelagic larval 
duration. For both corals and fish, the patterns of 
dispersal may be influenced more by the geographic 
setting, size, and location of reefs rather than by the 
biological characteristics or individual life histories of 
particular species. This finding, again, has implications 
for the design of networks of marine reserves, including 
their size, spatial arrangement, and the distances 
between them. 

Ecological Goods and Services from Coral Reefs 

Coral reefs have provided benefits to coastal human 
populations for thousands of years. Perhaps the greatest 
value that these ecosystems have is as a source of food. 
A variety of organisms continue to be harvested, notably 
the finfish which still possess considerable economic 
value, in addition to their contribution to basic suste¬ 
nance of populations, particularly in developing coun¬ 
tries. Despite their limited cover in proportion to the 
entire ocean floor (0.1-0.5%), almost a third of marine 
fish species in the entire world are found on coral reefs, 
and about 10% of fish consumed by humans are caught 
from reef areas [4]. 

Other vertebrate groups that have been the object of 
traditional fishing are the dugongs (Order Sirenia) and 
turtles [25]. Invertebrates that have been traditionally 
harvested from reefs for human consumption include 
sea urchins, sea cucumbers, molluscs, and crustaceans. 
A variety of species of seaweed are also used for food. 

Coastlines that are lined by coral reefs are afforded 
natural protection from the destructive forces of the sea. 
The natural erosion that occurs within reef habitats due 
to physical abrasion by waves and currents produces 
sand which accumulates on beaches. Typically white 
sand is composed of pulverized coral, shells, coralline 
algae, and foraminiferans in varying proportions. 

In some localities, corals are collected for building 
material in the construction of houses, roads, and other 
infrastructure. Other less common uses of reef organ¬ 
isms are as sources of pharmaceuticals. 


Coral reefs have always been valued for their 
aesthetic beauty. In recent decades, they have 
supported a vigorous tourism and leisure industry 
that has extended its reach globally. Tourists would 
travel hundreds to thousands of kilometers to visit 
pristine reefs. The traffic usually emanates from the 
more affluent societies in Europe and North America, 
and heads in the direction of the Caribbean, the Red 
Sea, the Indian Ocean, and all the way to the high 
diversity reefs of the Indo-West Pacific. 

Table 2 presents a general listing of goods and 
services derived from coral reefs [4]. 

Threats to Coral Reefs 

Unfortunately, the degradation of coral reef habitats 
has become a worldwide trend [35, 36]. The current 
concern from the ecological and conservation 
perspective is how to prevent this trend from becoming 
irreversible. In the Indo-Pacific region, for example, 
which is the cradle of coral reef diversity, coral cover 
is estimated to already lie at least 20% below the best 
historical reference points [37]. The average loss of 
coral cover annually is estimated at 1%. This represents 
a figure of 1,500 km 2 of coral cover lost from the early 
1980s to around 2003. 

In a recent study, Carpenter et al. [38] compiled 
data on the status of all known zooxanthellate 
reef-building corals. These include 845 species from 
the order Scleractinia, plus reef-building species from 
other groups of the phylum Cnidaria, the octocorals 
and hydrocorals. There were sufficient data for 704 spe¬ 
cies so that conservation status could be assigned. Of 
these, 32.8%, or roughly one third, were in categories 
with elevated risk of extinction (Table 3). Following 
criteria from the International Union for Conservation 
of Nature (IUCN), these species were considered to be 
under serious threat of population declines within their 
established geographic ranges. A combination of natu¬ 
ral and anthropogenic factors are responsible for the 
losses in coral cover in different parts of the world. 

Since corals are the natural framework builders for 
the habitat of other reef species, notably fish, declines in 
coral cover and abundance are usually accompanied by 
similar decreases in the other biota as well. A study in 
the Indian Ocean, for example, documented changes in 
size structure, diversity, and trophic composition of 
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Coral Reef Ecosystems. Table 2 Ecological goods and services derived from coral reef ecosystems 


Goods 


Ecological services 







Physical 

Biotic services 




Social and 

Renewable 

resources 

Mining of 
reefs 

structure 

services 

Within 

ecosystems 

Between 

ecosystems 

Biogeochemical 

services 

Information 

services 

cultural 

services 

Sea food 
products 

Coral 
blocks, 
rubble and 
sand for 
building 

Shoreline 

protection 

Maintenance 
of habitats 

Biological 

support 

through 

"mobile 

links" 

Nitrogen fixation 

Monitoring 

and 

pollution 

record 

Support 

recreation 

Raw 

materials for 
medicines 

Raw 

materials 

for 

production 
of lime and 

cement 

Build up of 
land 

Maintenance 

of 

biodiversity 
and a genetic 
library 

Export of 
organic 
production, 
and plankton 
to pelagic 
food webs 

C0 2 /Ca budget 
control 

Climate 

record 

Aesthetic 
values and 
artistic 
inspiration 

Other raw 

materials 

(seaweed 

and algae 

for agar, 

manure, 

etc.) 

Mineral oil 
and gas 

Promoting 
growth of 
mangroves 
and sea 
grass beds 

Regulation of 
ecosystem 
processes 
and functions 


Waste 

assimilation 


Sustaining 

the 

livelihood of 
communities 

Curio and 
jewelery 


Generation 
of coral 
sand 

Biological 

maintenance 

of resilience 




Support of 
cultural, 
religious and 
spiritual 
values 

Live fish and 
coral 

collected for 
the 

aquarium 

trade 









Source: From [4] 


reef fish communities following decreases in coral 
cover as a result of climate change [39]. 

Natural Threats 

The recent literature highlights the phenomenon of 
global change, particularly change in the climate, as 
posing an increasing threat to the health of natural 
ecosystems, both terrestrial and aquatic [40]. 
The main culprit is commonly believed to be the 
large-scale release of carbon dioxide into the 
atmosphere due to the burning of fossil fuel which 


reached unprecedented levels since the beginning of 
the industrial revolution [41, 42]. Another cause 
for the release of large quantities of this gas is land 
use practices such as deforestation [42]. The presence 
of large amounts of carbon dioxide in the atmosphere 
has created the so-called greenhouse effect, 
whereby long-wave radiation emitted by the earth 
system is trapped instead of being released into outer 
space. The result is a net warming of the planet 
over time. Temperature has been determined to 
follow a significant upward trend since the start of the 
industrial era in the 1800s. Thus, although variations in 
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Coral Reef Ecosystems. Table 3 Families of reef-building 
coral species that are near threatened (NT) and/or 
threatened (Thr) according to Red List categories of the 
International Union for the Conservation of Nature (IUCN). 
Criteria are defined by population size reduction and 
changes in geographic range 


Family 

Total species 

NT + Thr 

Thr 

Acroporidae 

271 

71.6% 

49.5% 

Agariciidae 

45 

38.1% 

26.2% 

Astrocoeniidae 

15 

18.2% 

9.1% 

Caryophylliidae 

3 

0.0% 

0.0% 

Dendrophylliidae 

15 

71.4% 

50.0% 

Euphylliidae 

17 

100.0% 

64.3% 

Faviidae 

130 

65.6% 

20.0% 

Fungiidae 

46 

27.3% 

15.9% 

Helioporidae 

1 

100.0% 

100.0% 

Meandrinidae 

10 

42.9% 

42.9% 

Merulinidae 

12 

36.4% 

9.1% 

Milleporidae 

16 

42.9% 

35.7% 

Mussidae 

52 

53.3% 

26.7% 

Oculinidae 

16 

70.0% 

40.0% 

Pectiniidae 

29 

50.0% 

25.0% 

Pocilloporidae 

31 

48.3% 

31.0% 

Poritidae 

101 

56.0% 

34.1% 

Rhizangiidae 

1 

0.0% 

0.0% 

Siderastreidae 

32 

42.3% 

19.2% 

Trachyphyliidae 

1 

100.0% 

0.0% 

Tubiporidae 

1 

100.0% 

0.0% 

Total 

845 




Source: Modified from [38] 


temperature are a natural feature of the environment, 
and occur on a range of time scales ranging from 
diurnal and seasonal all the way to millions of years 
(corresponding to the geological epochs), the specific 
causes as to why it is now perceived to be a problem are 
human-related. 

Monitoring long-term trends in sea surface 
temperature (SST) has been aided greatly by advances 
in technology, as exemplified by the suite of satellite 


products developed by the National Oceanic and 
Atmospheric Administrations (NOOA)- Coral Reef 
Watch [26]. Specifically, services have been derived 
from the 0.5°C (approximately 50 km) resolution 
twice-weekly product delivered by the Advanced Very 
High Resolution Radiometer (AVHRR). This product 
has been effective in detecting large-scale thermal stress 
in the ocean, and in relating this to coral bleaching 
events, for example. In the Indo-Pacific region, 
particularly the islands of the Philippines, Indonesia, 
and Papua New Guinea also known as the “Coral 
Triangle,” an upward trend in SST has been established 
for the period 1985-2006, with an average rate of 
increase of 0.2°C per decade [26]. However, it should 
be noted that satellites detect the temperature of what is 
essentially the oceans “skin,” so that satellite-derived 
data should, wherever possible, be complemented by 
actual temperature measurements made on site at 
different depths of the reef [43]. 

With respect to estimates of warming of the 
entire global ocean, reliable data exist for the upper 
300 m [44] . The majority of in situ measurements have 
been contributed by expendable bathythermograph 
(XBT) data. After accounting for multiple sources of 
uncertainty, a statistically significant linear warming 
trend has been obtained for the period 1993-2008 
amounting to 0.64 Wm“ 2 (reflecting the entire surface 
area of the Earth) with a 90% confidence interval of 
0.53-0.75 Wm“ 2 [44]. 

The increased levels of carbon dioxide in the 
atmosphere have resulted in another phenomenon 
that is detrimental to coral reefs. Because of increased 
partial pressure of the gas (PC0 2 ) which results in 
steeper diffusion gradients, there is greater drawdown 
into the ocean from the atmosphere. An international 
survey effort consisting of two research programs, the 
World Ocean Circulation Experiment and the Joint 
Global Ocean Flux Study, yielded an estimated oceanic 
uptake of anthropogenically produced C0 2 for the 
period 1800-1994 of 118 zb 19 petagrams of carbon 
[42]. This amount represents about 48% of the total 
emissions from fossil fuel burning and cement 
manufacturing, indicating the magnitude of the 
oceanic sink. Furthermore, this value is projected to 
be about one third of the long-term potential of the 
ocean for carbon storage. A more recent estimate made 
in 2008 puts the value at 140 ± 25 petagrams of carbon, 
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with the Southern Ocean being the primary conduit 
for entry [41]. 

Carbon dioxide reacts with seawater to form 
carbonic acid, decreasing oceanic pH [45]. If the 
PC0 2 concentrations in the ocean surface continue to 
increase in proportion to the increase in atmospheric 
concentrations of this gas, it is estimated that a 
doubling of the latter from preindustrial levels will 
result in a 30% decrease in carbonate ion concentration 
and a 60% increase in hydrogen ion concentration [42] . 
The result is increasing acidification of the oceans 
which is believed to pose a threat to calcifying 
organisms, including the hard corals [46]. The higher 
acidity values depress aragonite saturation state which 
is crucial in the net carbonate balance, and which can 
spell the difference between net accretion and net 
dissolution of calcium carbonate. Not only corals but 
other calcifying organisms such as shellfish, foraminif- 
erans, and echinoderms are at risk [47], with significant 
consequences for the rest of the ecosystem. For example, 
trophic pathways will be altered if certain species 
decrease in abundance as a result of reduction in calci¬ 
fication rates, which imply diminution or weakening of 
protective shells or armor, thus compromising growth, 
reproduction and survival from predation. 

A long-term study of growth rates of juvenile coral 
in the Caribbean suggests that corals <40 mm have 
actually grown more slowly than previously thought 
[48]. The period of decline in growth rate corresponds 
with observations of rising seawater temperature and 
depressed aragonite saturation state, implying effects of 
climate change. This finding does not bode well for the 
future viability of coral communities in the region 
because juvenile stages are critical for the successful 
establishment of populations of any species. On the 
other side of the world, in the Great Barrier Reef, 
experimental manipulations of temperature directly 
affected the metabolism of coral larvae, with higher 
temperatures increasing rates of development, and 
thus the onset of competency [49]. Thus, expected 
increases in sea surface temperatures will likely cause 
changes in larval dispersal ranges. 

Another natural phenomenon that is perceived to be 
growing in severity is the occurrence of diseases in corals 
and other marine species [50, 51]. Although diverse 
communities of microbes normally inhabit the surfaces 
of coral colonies and all other types of surfaces on a 


reef [13] increasing experimental evidence combined 
with modeling indicates that environmental factors 
such as increasing water temperatures might promote 
a shift in dominance from antibiotic-producing 
beneficial microbes to pathogenic forms such as certain 
species of Vibrio. The loss of antibiotic protection 
would render corals more susceptible to disease and 
increase their risk of mortality [52]. Other causative 
agents of coral disease that have been identified 
include Aspergillus sydowii (that attacks sea fans) and 
Sphingomonas sp. [51]. 

Predation on corals has long been considered an 
important natural threat to the survival of corals [1]. 
Two notable examples of this phenomenon involve the 
crown-of-thorns (COT) starfish, Acanthaster planci , 
and the gastropod Drupella. Damage on a fairly large 
scale, up to hundreds of square meters on a number of 
reefs, has been documented due to active grazing by 
these two invertebrate predators. Cyclic fluctuations in 
populations of many species are considered natural, 
and are connected with changes in climate, food supply 
or a combination of these. With respect to COT 
outbreaks, however, the causes are presumed to relate 
to unnatural factors such as human overharvesting of 
natural predators and increases in planktonic food 
supply of the larvae [18, 53]. Recent evidence from 
the Great Barrier Reef, for example, suggests that 
nutrient enrichment of the waters (probably due to 
land-derived pollution) has triggered blooms of 
phytoplankton that serve as food for COT larvae, thus 
enhancing their survival and growth to adulthood [53] . 

Anthropogenic Threats 

Up till recent times, the most severe impacts inflicted 
on coral reefs have been due to human activities, 
directly or indirectly [22, 54]. A recent review of 
anthropogenic threats to coral reefs worldwide is that 
of Wilkinson [36]. 

Water quality is generally influenced by levels of 
nutrients such as nitrogen and phosphorus, concentra¬ 
tions of total organic carbon, turbidity, and light 
attenuation [55]. Water quality, in general, has been 
documented to correlate directly with species richness 
of hard corals and of phototrophic octocorals [56]. 
Clarity of the water, as commonly measured in terms 
of Secchi depth for example, is a good indicator of 
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water quality. Low water quality has been observed 
to be associated with proliferation of macroalgae. 
The most common cause of diminution in water 
quality is effluent from the land. 

Land-derived impacts have been in existence ever 
since humans started to modify the land extensively for 
their use. In terms of documented effects on reefs, 
the most important activities are agriculture and 
deforestation. These activities involve the release of 
huge amounts of sediment which eventually find their 
way into coastal waters via run-off. Rivers, for instance, 
transport large amounts of sediment into the ocean. 
The sediment settles directly on the bottom, physically 
smothering benthic life-forms. 

Another land-derived impact is pollution from 
both point and nonpoint sources. It can take the 
form of organic pollution, such as from sewage, and 
inorganic pollution - for example, heavy metals. 
Sewage in the water is ultimately degraded by bacteria 
to yield inorganic nutrients, primarily nitrogen and 
phosphorus. These elements fuel photosynthesis by 
both phytoplankton and benthic plants, resulting in 
high rates of primary production and an increase 
in biomass of the primary producers. The latter 
are effective competitors with hard corals for space 
and light. 

In coral reef habitats, the nutrients of interest are 
nitrate, ammonium, and soluble reactive phosphate 
assimilated by plants [57]. In addition, dissolved 
organic forms of nitrogen and phosphorus are signifi¬ 
cant as well. They are remineralized into the inorganic 
forms that are utilized by primary producers. Coral 
reefs that are close to sites of dense human habitation 
are susceptible to detrimental effects of nutrient 
pollution. The effects are aggravated if there is limited 
flushing, such as within embayments and lagoons. 

Destructive fishing practices are another cause of 
major disturbances to coral reef communities [58]. 
In many parts of the developing world, the use of 
explosives to catch fish is still a significant problem. 
The use of the metabolic poison, sodium cyanide, to 
catch ornamental fish for the aquarium trade, also 
continues to be widespread [28]. Both categories of 
disturbance inflict direct harm on the habitat. 
Explosives cause catastrophic destruction of the reef 
substrate and all living organisms within the effective 
radius of the explosion. They leave behind telltale 


craters of varying size, depending on the blast, and 
expanses of rubble and broken coral. 

Cyanide is usually dissolved in sea water and 
squirted directly into crevices and other places where 
the target fish shelter. It kills virtually all living 
organisms with which it comes into contact, including 
coral colonies which are common hiding places for fish. 
The dose of the poison applied is meant to merely stun 
the fish for ease of capture, because these are intended 
to be shipped live to prospective customers. 

Another major impact is simply the excessive 
harvest or overfishing of organisms, be they finfish, 
invertebrates, or economically important species of 
algae. For coastal ecosystems in general, the ecological 
extinction of species due to excessive exploitation 
precedes the effects of other human-related impacts 
such as pollution and anthropogenic-related climate 
change [25]. Jackson et al. [25] hypothesize that it is 
the depletion of entire species groups, or even trophic 
levels, that paved the way for the occurrence of subse¬ 
quent disturbances such as eutrophication, outbreaks 
of disease and the invasion of species. 

Impacts on Reef-Associated Organisms 

Decreases in diversity and/or abundance of hard corals 
have been accompanied by declines in other reef- 
associated species, notably the fish [30]. The families 
that are directly dependent on corals are most at risk. 
These include the butterflyfish (Chaetodontidae), the 
cardinalfish (Apogonidae), and the gobiids (Gobiidae). 
A recent empirical study demonstrated that live coral 
provides effective shelter to both adult and juvenile 
stages of the damselfish Pomacentrus amboinensis [59]. 
In contrast, juveniles attempting to shelter in bleached 
and dead coral were driven away by larger fish, thus 
exposing them to greater predation. They subsequently 
suffered higher mortality rates. In addition, factors asso¬ 
ciated with climate change and various human impacts 
will affect the biology and ecology of the fishes them¬ 
selves. Temperature, for example, exerts a direct effect on 
fish physiology, and abnormally high levels can impair 
reproduction. The aggregate effects of perturbations to 
the coral reef ecosystem will be manifested in changes in 
the diversity, abundance, and distribution of fish. Cli¬ 
mate change per se can exert a direct influence by caus¬ 
ing shifts in the geographic ranges of various fish species. 
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One of the most widely discussed phenomena in the 
scientific literature is that of the coral-algal phase 
transition or “shift” meaning that reef communities 
once dominated by scleractinian corals are now largely 
populated by macroalgae or turf algae [22, 60] or some 
other reef-associated species [61]. This phenomenon 
gained widespread attention in the Caribbean [25, 62] 
with the occurrence of mass mortalities of animals that 
normally graze on algae. Such grazers include sea 
urchins and herbivorous reef fish (scarids and 
acanthurids), and incidents of mortality have been 
attributed to disease or overfishing of these functional 
groups. The resulting depletion in populations of 
herbivores has been observed to correlate with 
explosions in abundance of macroalgae, which are 
competitors of hard corals for light and space. 
A recent study appears to confirm this belief for some 
Caribbean locations, where recovery of the sea urchin 
Diadema antillarum has occurred [63]. In sea urchin 
zones where there is significant grazing on algae, 
the survival of juvenile scleractinian corals was 
observed to be higher. 

For coral reefs in other parts of the world, however, 
the shift from coral to macroalgal dominance does not 
appear to be as widespread as previously assumed [64] . 
Bruno et al. [64] compiled empirical data from reef 
surveys in the Indo-Pacific, the Great Barrier Reef, the 
Caribbean, and the Florida Keys. The results of their 
analysis indicate that the replacement of corals by 
macroalgae as the dominant benthic component on 
reefs is neither common nor geographically extensive 
as previously thought. Apart from a depletion of 
populations of grazers, nutrient pollution also contrib¬ 
utes to algal blooms [65, 66]. Some evidence suggests 
that the effects of nutrient enrichment tend to be local¬ 
ized to reefs located near centers of dense human 
populations, and in areas where there is limited 
flushing of water, such as lagoons and embayments [57] . 

Management 

Given the problems facing coral reefs enumerated 
above, it is clear that management action should 
operate on a range of scales, from highly localized to 
regional and global. It will have to encompass the whole 
gamut from reducing destructive fishing practices, to 
implementation of land use schemes that control 


sedimentation and pollution, and to activities such as 
restoring herbivore populations [37]. The global policy 
arena will also need to be involved as this is the effective 
scale at which global climate change can be addressed. 

The field of coral reef management has seen uneven 
progress over the past decades [54, 62, 67] . Like in most 
other areas involving natural resources, the effective¬ 
ness of management has been diminished by the fact 
that responsibility for what is essentially a common 
domain has been fragmented and distributed over 
a number of sectors, such as agriculture, fisheries, 
tourism, human settlements, and industry. The result 
of this is the often conflicting mandates of different 
government departments and agencies. 

An emerging consensus is that human population 
growth has begun to far outstrip the capacity of the 
natural resource base to provide goods and services, 
and to absorb wastes [67]. This reality manifests itself 
more clearly in developing countries, where the 
inability of many citizens to maintain an adequate 
livelihood results in chronic want, hunger, and 
disease. In the case of developed countries, the better 
managed economic sectors (including manufacturing 
and trade), the superior infrastructure, and the more 
efficient social delivery system tend to compensate for 
the diminishing returns from natural resources. 
Nevertheless, the heavy burden of human demands 
on natural resources is true for coral reefs in both the 
developed and developing world. 

For coral reef ecosystems, the most pressing goal of 
management action should be the protection of 
whatever levels of biodiversity are remaining. 
Ecological science has demonstrated the advantages of 
maintaining high species numbers in any natural 
habitat [61]. One benefit of high species diversity is 
that a number of different species can play similar 
ecological roles, a prime example in coral reefs being 
herbivory or grazing. This aspect is sometimes referred 
to as “redundancy.” If one species becomes locally 
extirpated because of overfishing, for example, another 
species can continue its ecological function, thus 
maintaining the overall balance in the system in terms 
of fluxes of matter and energy. Examples of reef organ¬ 
isms with similar grazing functions are sea urchins 
(belonging to the genera Diadema , Tripneustes and 
Salmacis) and fish belonging to the families 
Acanthuridae, Chaetodontidae and Scaridae. 
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Protecting, and then enhancing, existing levels of 
diversity would necessarily require limits on the 
extraction of living resources. This would require strict 
regulation of fishing. If at all possible, fishing for 
subsistence should be discouraged, and alternative 
livelihoods developed for traditional fishers. All 
manner of destructive fishing should be banned, with 
the imposition of severe penalties. 

Marine Protected Areas 

The designation of marine protected areas (MPAs), 
where no extractive activity whatsoever is allowed, is 
by far the most widely advocated management tool for 
coral reefs [54, 67, 68] . The core argument for this type 
of intervention is compelling: If carefully selected areas 
are protected from human activities, particularly 
resource extraction, then natural ecological processes 
will be allowed to proceed without interruption. 
The expected result is a healthy, thriving community 
with increased abundance and diversity of many 
reef-associated species [69]. 

The natural behavior of the motile species, partic¬ 
ularly the finfish, would cause them to move beyond 
the boundaries of an MPA into areas that are open for 
fishing. This so-called spillover should be of benefit to 
subsistence and, where applicable, commercial fishers. 
An additional benefit of functioning MPAs is in 
the form of “recruitment subsidy” whereby spawning 
populations of reef species that are under protection 
produce adequate quantities of gametes or larvae which 
are then carried beyond the MPA boundaries to 
replenish depleted stocks in openly exploited areas. 

These two expected benefits of MPAs require 
further research and experimental testing [70], with 
available supporting evidence still somewhat sparse 
and ambiguous [68]. One study providing possible 
evidence of export of fish from a coral reef reserve is 
that of Abesamis and Russ [71] in the Philippines. 
The reserve has been maintained for over two decades, 
and the observed movement into surrounding areas 
was attributed to behavior of adult fish as they 
increased in size and density. 

One documented effect of MPAs is increase in the 
populations of herbivores as a result of protection from 
fishing [72, 73]. Herbivores include invertebrates such 
as sea urchins, and fish such as parrotfish (family 


Scaridae) and surgeonfish (family Acanthuridae). 
In studies in the Philippines and the Bahamas, 
adequate populations of herbivores were found to be 
effective in keeping macroalgal cover in check [72, 73]. 
It is strongly advocated that the maintenance or 
restoration of herbivore populations within reef areas 
should be part of management, in addition to control¬ 
ling nutrient pollution (see section “Improving Water 
Quality”). A recent study of a Caribbean marine reserve 
focused on coral population dynamics after the 
occurrence of two types of major disturbances, namely, 
bleaching and hurricane impacts. There was more 
effective recovery of corals within as compared to 
outside the reserve [72]. Coral cover increased 
significantly along with a corresponding reduction in 
macroalgal cover brought about by grazing of 
herbivorous parrotfishes. The latter had presumably 
increased in abundance due to effective protection. 

The effects of marine reserves have not always 
been consistent, however, particularly in the face of 
large-scale disturbances such as those related to climate 
change. In a large-scale study of no-take areas (NTAs) 
spanning several countries across the Indian Ocean, 
there was little difference in the decline in coral cover 
between NTAs and areas open to fishing [39]. This 
finding suggests that management efforts should 
encompass spatial scales larger than the patchwork of 
MPAs along with their intervening, openly exploited 
areas. More attention should be given to reef areas 
that serve as regional refugia, and which act as viable 
sources of recruits even in the face of large-scale 
impacts. 

To be effective in the long run, fisheries manage¬ 
ment must take a comprehensive approach, and com¬ 
bine the designation of MPAs with other interventions 
such as restrictions on gear types and catch size, as well 
as the provision of alternative forms of livelihood to 
fishers [74]. 

Improving Water Quality 

Effective management action to maintain the health 
and diversity of coral reef communities must include 
the maintenance of good water quality [56]. Coral 
reef communities are known to thrive best under 
conditions of clear water, low nutrient levels, adequate 
sunlight, and moderate temperatures. The key 
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intervention in this regard is the strict control of pol¬ 
lutant loading in coastal waters known to harbor sen¬ 
sitive ecosystems such as coral reefs, sea grass beds, and 
mangrove forests. Thus, all countries that possess these 
ecosystems in their territorial waters should enforce 
strict antipollution legislation. 

Since another major land-derived stress is sediment 
loading, management must also address land-based 
activities that cause the mobilization of significant 
quantities of sediment and their release into coastal 
waters. The main sectors of concern would be agricul¬ 
ture (which also employs the use of chemicals in the 
form of fertilizers and pesticides), deforestation, and 
the building of coastal infrastructure (for urbanization 
and industry). 

Restoration 

The field of coral restoration has, as its current focus, 
the hard corals themselves, belonging to the order 
Scleractinia. These are the main structural components 
of present-day reefs, both as framework-builders and 
as fillers of the reef matrix (section “Biology and 
Ecology”). The science of coral restoration is still fairly 
young and very much at an experimental stage [75, 76] . 
Perhaps one of the earliest works alluding to the 
possibility of restoring damaged reef habitats by 
means of the transplantation of either branches or 
whole colonies of hard corals is that of Maragos [77] 
in Hawaii. Examples after that include investigations 
conducted in the Philippines on the survival and 
growth of transplanted coral fragments [78]. At the 
present time, the field has seen the entry of new 
experimental techniques geared toward augmenting 
live coral cover by increasing the abundance of either 
adults or recruits [75]. 

The main rationale for intervention in the form of 
restoration is if a damaged reef community has not 
recovered naturally over appreciable amounts of time. 
The causes of reef degradation are enumerated in 
section “Threats to Coral Reefs.” Unfortunately, many 
reefs throughout the world are presently in a degraded 
state, showing little signs of natural recovery. The 
dominant cover in such reefs typically consists of 
macroalgae, turf algae, sponges, soft corals, hydroids, 
or any combination of these in various proportions. 
Coral restoration, by definition, is the attempt to bring 


a reef community back to a dominance by scleractinian 
corals within a relatively short period of time 
(as opposed to the decades it would take for a reef to 
recover naturally, if at all). Some experimental 
evidence indicates that the establishment of live coral 
cover can increase the local diversity of reef-associated 
species [79]. 

Before any attempt at restoration is initiated, 
a thorough assessment of the candidate site should be 
made. The causes of damage, both proximal and long 
term, should be identified and their current degrees of 
severity evaluated. It would be ideal if the causes 
of destruction are mitigated prior to intervention. 
For example, if causes of reef degradation involve poor 
water quality due to land-based pollution or sedimen¬ 
tation, then these issues should be resolved first. Simi¬ 
larly, if damage is caused by harmful fishing practices, 
then these should be controlled or eliminated. A further 
assessment of the candidate site for restoration should 
consider the prevailing environmental conditions such 
as suitability of the substrate, water clarity, salinity, and 
nutrient levels. These are factors that directly affect coral 
physiology and should be determined to be suitable for 
coral survival and growth before juveniles or trans¬ 
plants are introduced. 

The most common approach in the past has been to 
transfer live whole colonies, or fragments of colonies 
from healthy coral populations to degraded sites within 
the same reef [76]. It is necessary to source material 
from within the same general area because long periods 
of transportation would inflict significant stress on the 
coral. This would potentially lead to unacceptable mor¬ 
tality rates of the transplants. The whole coral colony or 
transplant is translocated to what is essentially a new 
environment. One essential criterion for site selection 
is that the recipient site should show evidence of the 
existence of coral communities in the past, usually in 
the form of dead skeletons. This is an indication that 
the area could possibly support a new coral-dominated 
community again, provided that environmental condi¬ 
tions are not permanently inimical to the survival and 
growth of introduced corals. 

Since the use of whole colonies or fragments of 
colonies usually causes some collateral damage to the 
source population, other strategies have been devel¬ 
oped in recent years to avoid this. One strategy is to 
collect fragments found on the reef substrate and “rear” 
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these under special conditions of high maintenance 
(“nurseries”) until they reach a size suitable for 
reestablishment on the natural substrate. Another 
strategy being tested is to induce spawning of coral 
colonies under laboratory or hatchery conditions and 
collect the resulting larvae which can number in the 
tens of thousands. The larvae are induced to settle on 
material of a convenient size, such as plastic pins or 
concrete tiles. When the larvae metamorphose into 
juveniles, the latter are reared up to a size considered 
suitable for “outplanting” to the natural reef. 

A major problem of coral restoration efforts is the 
relatively high mortality suffered by either recruits or 
transplants introduced into a degraded reef [80]. As 
mentioned above, a degraded reef surface is typically 
covered by a variety of organisms including turf algae, 
macroalgae, soft corals, sponges, and hydroids. These 
are forms that have proven to be able to adapt and 
thrive under the altered conditions that probably have 
led to the demise of the original hard coral inhabitants 
in the first place. Examples of such environmental 
conditions are increased turbidity and nutrient loading 
in the water. In addition to being tolerant of the new 
conditions, these organisms effectively compete with 
the hard coral recruits and transplants for space. 

Coral transplants and recruits introduced into 
degraded reef environments have also been observed 
to suffer from predation (pers. obs.). Common preda¬ 
tors that occur in the reefs of the Philippines and other 
similar geographic areas are the crown-of-thorns star¬ 
fish, Acanthaster planci , and the gastropod Drupella. 
There are possibly other causes of coral mortality that 
are as yet undiscovered, and that have hampered 
progress in restoration. 

Efforts at the present time are focused on developing 
cost-effective techniques that would minimize expendi¬ 
ture in terms of finances, manpower, and resources, and 
maximize output in terms of live coral material 
(juveniles or adults) that can be used in restoration 
interventions. 

A parallel activity to restoration of the hard corals is 
restocking of other important reef invertebrates. 
A notable example is the giant clam (Family 
Tridacnidae) which is a major inhabitant of coral 
reefs, can grow to appreciable sizes, and can contribute 
to the carbonate budget of a reef by accreting fairly 
massive shells. Many species of giant clams have 


become rare or locally extinct in many reef areas 
throughout the world, primarily due to overharvesting. 
Scientific efforts have yielded a fair degree of success in 
terms of producing large numbers of juveniles under 
hatchery conditions and then reintroducing these into 
suitable reef sites [81]. Such established populations 
have been thriving for several decades in the 
Philippines, for example. A recent experiment has 
demonstrated that restocking denuded reef areas with 
live coral together with giant clams can significantly 
enhance fish abundance and diversity [82]. 

Future Directions 

Coral reef ecosystems, particularly in the developing 
world, are clearly no longer able to support major 
extractive activities, including fishing, on a sustained 
basis. They are confronted with the twin challenges 
of limited biological productivity and demands 
from a relentlessly increasing human population. It is 
imperative that, as far as possible, coral reef areas all 
over the world be put under protection to conserve 
whatever is remaining of their biodiversity and 
ecological function. However, equally urgent attention 
should be paid to providing alternative livelihood to 
the numerous populations of coastal fishers who will be 
displaced once their traditional fishing grounds are 
declared off limits for protection. In addition to 
protecting nature, the fundamental obligation of 
resource management is to help ensure a decent quality 
of life for the human inhabitants. 

A scrutiny of the existing global economic order 
would not be unwarranted in this respect, since many 
coastal populations in developing countries have been 
reduced to economic desperation. In the management 
of the use of coastal resources, the perspective should 
be extended beyond the limited confines of the coastal 
zone to the national, regional, and even global arenas. 
For example, it is about time that, in the spirit of 
interdisciplinary research, fiscal and trade policies 
also be examined along with traditional ecological 
investigations. An inherent injustice may still be 
embedded in legal international structures that tend 
to disenfranchise the poor of the control over their 
own natural resources (not just fisheries) and over 
the generation of economic wealth from the proper 
use of these resources (including forest, agriculture, 
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and mineral assets). Unless this aspect is properly 
addressed, impoverished populations all over the 
world will continue to deplete natural resources at 
unsustainable levels simply to ensure their own sur¬ 
vival. The related issue of controlling human popula¬ 
tion growth rate would be a crucial consideration. 

Alongside this basic initiative, vigorous efforts 
should be put in place to control or abate pollution, 
and to improve water quality. Coral reef communities 
are extremely sensitive to factors that diminish 
light availability, such as turbidity, and to abnormally 
high nutrient levels that promote the proliferation of 
weedy species such as phytoplankton and benthic algae. 

The field of coral restoration is still very much at an 
experimental stage, with relatively few documented 
reports of success. It is worthwhile to continue research 
in this area, as certain coral species have proven to 
be resilient to variations in climatic factors such as light 
and temperature, and the enhancement of topographic 
complexity by coral transplantation has been demon¬ 
strated to increase species diversity on local scales. 
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Glossary 

Climate model A numerical model consisting of four 
components: atmosphere, ocean, land, and sea ice. 

Earth system model A climate model with additional 
components, which must include a carbon cycle in 
the land, atmosphere, and ocean components. 

Troposphere The lower part of the atmosphere where 
most of the weather occurs. 

Stratosphere The region of the atmosphere above the 
troposphere, and is the location of the ozone layer. 

Carbon cycle The processes by which carbon in all its 
forms interacts and moves around in the land, 
atmosphere, and ocean components of the climate 
system. 

Positive feedback A set of processes whereby a small 
perturbation in the climate system amplifies and 
increases in size. 

Negative feedback A set of processes whereby a small 
perturbation in the climate system decays and 
reduces in size. 

Control simulation A run of a climate model or earth 
system model where the forcing is kept constant in 
time. 

Ensemble simulations A set of runs which have the 
identical forcing, but start from slightly different 
initial conditions. 

Chaotic system A system of equations with the prop¬ 
erty that two runs starting from slightly different 
initial conditions diverge from each other, often 
quite quickly. 

Climate projection A simulation of the climate system 
into the future with prescribed forcing, where the 
model has not been initialized to the observed 
climate. 

Climate forecast A simulation of the climate system 
into the future with prescribed forcing, where the 
model has been initialized to the observed climate. 

Equilibrium climate sensitivity The increase in the 
globally averaged surface temperature in a model 
when the atmosphere concentration of carbon 
dioxide is doubled. 


Atmosphere Model Intercomparison Project 

A standard simulation of the atmosphere compo¬ 
nent of a climate or earth system model, which 
allows different models to be compared to each 
other. 

El Nino-Southern Oscillation The largest interannual 
signal in the climate system, which occurs primarily 
in the tropical region of the Pacific Ocean. 
Thermohaline circulation The overturning circula¬ 
tion in the global oceans where water sinks at very 
high latitudes, spreads very slowly horizontally to 
all the ocean basins, and then slowly returns toward 
the surface. 

Conveyor belt Another popular name for the thermo- 
haline circulation. 

Deep water formation The process by which very 
dense water near the surface sinks to near the 
ocean bottom at high latitudes, which forms the 
sinking part of the thermohaline circulation. 

Definition of the Subject 

We are all familiar with weather forecasts that predict 
the local weather for the next few days. These are made 
using a high-resolution numerical model of the atmo¬ 
sphere, and sometimes extend out as far as 10 days. 
Most meteorological centers also produce seasonal out¬ 
looks, which give probabilities of the average tempera¬ 
ture and precipitation being above, near, or below 
normal. These outlooks do not forecast the weather 
for a particular day, but give predictions of the seasonal 
averages. Seasonal outlooks are also made with an 
atmosphere model, but use climatological observed 
values for the evolving state of the surface ocean, 
land, and sea ice conditions. However, if forecasts are 
to be made more than a season ahead, then using just 
an atmosphere model is not sufficient, and the evolu¬ 
tion of the ocean, land, and sea ice states must also be 
made using numerical models for these components of 
the climate system. The reason is that the surface ocean, 
land, and sea ice states interact strongly with the atmo¬ 
sphere and influence its future evolution because they 
change on a much slower timescale than the 
atmosphere. 

A climate model is used to understand how the 
climate system works, and how the various compo¬ 
nents interact with each other. It is used to simulate 
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the present day climate, the recent past climate, and the 
climates of different paleoclimatic epochs. It can also be 
used to simulate the future statistical state of the atmo¬ 
sphere a decade or a century into the future, but does 
not predict the local weather on particular days. The 
atmosphere resolution of a climate model is much 
reduced compared to that used in a weather forecast, 
so that climate information is given on regional to 
global scales, and not on local scales. The climate state 
a long time ahead depends on the future levels of 
quantities that force the climate system, such as the 
concentrations of carbon dioxide and other greenhouse 
gases, several different atmospheric aerosols, and the 
levels of solar and volcanic activity. Therefore, these 
climate projections depend on many future choices to 
be made by mankind, which will determine the con¬ 
centrations of greenhouse gases and aerosols over the 
next century. Each climate projection needs a scenario 
for the future concentrations of greenhouse gases and 
aerosols before it can be carried out. 

Thus, a physical climate model consists of four 
components; atmosphere, ocean, land, and sea ice. 
These components are used to calculate the future 
state of the component given an initial state and the 
various quantities that force the component. These 
four basic components have to interact with each 
other, so that most climate models have a fifth compo¬ 
nent, often called the coupler, see Fig. 1, which has two 
main functions. The first function is to start, oversee 
the time evolution, and finish each model simulation. 
The second is to receive all the information from each 
component that is required by the other components 
and to send back to each component all the 
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Configuration of a climate model 


information that it requires to continue its simulation 
forward in time. For example, the ocean component 
needs the atmosphere-ocean wind stress that drives the 
ocean currents, the net heat flux and net fresh water 
flux (precipitation plus river runoff and sea ice melt 
minus evaporation) going from the atmosphere, ice, 
and land into the ocean. These are most often calcu¬ 
lated in the coupler, and depend on the atmosphere 
surface wind, temperature, and humidity, etc., and the 
ocean sea surface temperature and currents, which are 
fields that are sent to the coupler. 

There is another reason why the set up using 
a coupler shown in Fig. 1 is extremely useful. Only 
a relatively small fraction of climate model runs are in 
fully coupled mode, and there is a large number of 
different ways to run the model components. In runs 
described in more detail later, one or more of the 
components is replaced by its data equivalent, which 
provides the observed data required by the coupler to 
force the active components. For example, in an Atmo¬ 
sphere Model Intercomparison Project (AMIP) run, 
the numerical ocean and sea ice components are turned 
off and replaced by simple data components that pro¬ 
vide observed time series of surface ocean and sea ice 
temperatures to the coupler. The coupler framework 
shown in Fig. 1 then ensures that the fluxes exchanged 
between various components are always calculated 
consistently, whether using observations or predicted 
model fields. 

There is no universally accepted definition of an 
Earth System Model (ESM), but it must have more 
components than the four in a climate model. The 
usual additional components are a model for the dis¬ 
tribution of carbon on the land surface, and an ocean 
ecosystem component, which are required if the ESM is 
to simulate the earth’s carbon cycle. However, an ESM 
will often have additional components as well. The 
commonest of these is an atmospheric chemistry com¬ 
ponent, but some ESMs have an atmosphere compo¬ 
nent that simulates the upper levels of the atmosphere, 
including the stratosphere, not just the troposphere, 
which is the lowest layer of the atmosphere where 
most of the weather takes place. Finally, several ESMs 
will soon include a component that simulates the 
Greenland and Antarctic ice sheets, in order to estimate 
the future rate of ice loss that will raise the level of the 
earth’s oceans. 


c 

















2512 


c 


Coupled Climate and Earth System Models 


Introduction 

Numerical model simulation of the atmosphere has 
a long history that goes back over 60 years. The first 
integrations were done on the ENIAC machine at the 
Advanced Study Institute in Princeton by 1950 [1]. It 
took another 10 years for this to develop into weather 
forecasts that used models that had vertical structure 
and could be initialized using atmospheric observa¬ 
tions. The first numerical ocean models were developed 
in the mid 1960s by Kirk Bryan at the Geophysical Fluid 
Dynamics Laboratory (GFDL) in Princeton [2], which 
used simplified sector geometry for the ocean basins. 
The first coupled atmosphere/ocean model was devel¬ 
oped at GFDL when the global atmosphere model of 
Syukuro Manabe was coupled to Bryans ocean model, 
and the results were published in 1969 [3]. However, 
the first real coupled climate model that had realistic 
geometry for the ocean basins and very elementary 
components for the land and sea ice was developed 
over the first half of the 1970s. The first results were 
published in two landmark papers by Manabe, Bryan, 
and coworkers in 1975 [4, 5]. The horizontal grid¬ 
spacing of this model was 5° x 5°, and there were 
nine vertical levels in the atmosphere component, and 
five levels in the ocean component. Even this coarse 
resolution was sufficient that the climate model ran 
slowly on GFDL’s supercomputer of the early 1970s. 
Other meteorological and weather centers in several 
countries followed the GFDL lead and produced simi¬ 
lar climate models of their own over the 1980s. As 
supercomputers became faster and larger, so the four 
components became more sophisticated, and the reso¬ 
lution of climate models improved. 

However, there was a serious problem with all cli¬ 
mate models when trying to obtain a control run for 
the present day climate. All the components would be 
initialized using the best set of observations available. It 
is most important to initialize the ocean component 
because it has by far the largest heat capacity, and its 
evolution is governed by much longer time scales than 
the other components. The problem was that, as the 
control run continued in time, the ocean and sea ice 
solutions would drift away from the realistic initial 
conditions. The drift was fast enough that rather 
quickly the model climate became significantly differ¬ 
ent than that of the present day earth. 


The cause of this problem was diagnosed as follows. 
When the atmosphere and ocean components were run 
in standalone mode with the other component replaced 
by a data component that provides observations, the 
fluxes of heat and fresh water at the air-sea interface 
can be calculated. These fluxes from the atmosphere 
and ocean were very different, so that they were incom¬ 
patible when coupled together. The problem was over¬ 
come by a very unphysical fix called flux correction [6] . 
The diagnosed heat and fresh water fluxes from atmo¬ 
sphere and ocean stand alone runs were differenced, 
and this difference was added to the fluxes exchanged 
between the atmosphere and ocean every time step of 
the coupled run. This enabled a climate model to 
maintain a non-drifting solution in a present day con¬ 
trol run. However, it disguised the fact that the climate 
model components needed further development work 
to improve the simulations and make their fluxes of 
heat and fresh water across the air-sea interface com¬ 
patible with each other. Use of flux corrections in 
climate models remained the standard method of run¬ 
ning until the late 1990s. 

The first model that could maintain the present day 
climate in a control run without the use of flux correc¬ 
tions was the first version of the Community Climate 
System Model (CCSM) developed at the National Cen¬ 
ter for Atmospheric Research (NCAR). A 300 year 
present day control simulation that showed virtually 
zero drift was run during the second half of 1996 and 
documented in 1998 [7]. The reason for this success 
was further refinement of the atmosphere, and espe¬ 
cially the ocean [8] components, so that the surface 
heat and fresh water fluxes produced by the two com¬ 
ponents were compatible. Quite quickly, the climate 
centers in Australia and the UK implemented two of 
the new ocean parameterization improvements from 
the CCSM and were also able to run their models 
without flux corrections [9, 10]. Now, a large majority 
of climate models run without flux corrections, 
although some of the coarser resolution models still 
use this technique. Coarse horizontal resolution now 
means a grid-spacing of about 3° x 3°, whereas many 
climate models currently use about 1° x 1° grid¬ 
spacing, or slightly higher, for their standard runs. 

The number of climate models maintained around 
the world has steadily increased over the last decade, so 
that results from 18 different models were submitted to 
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the 4th Assessment Report of the Intergovernmental 
Panel on Climate Change (IPCC), which was published 
in February 2007. This 4th Assessment Report [11] was 
the joint recipient of the 2007 Nobel Peace Prize. 

Earth System Models 

All ESMs contain components that enable the carbon 
cycle in the land, ocean, and atmosphere to be 
predicted, rather than being passive in simulations of 
the earth’s climate, for the following reason. Only about 
half the carbon dioxide (C0 2 ) emitted into the atmo¬ 
sphere over the past 150 years has stayed in the 
atmosphere; the other half has been taken up by the 
land and oceans in about equal measure. Climate 
models need past and future concentrations of C0 2 
and other greenhouse gases in order to simulate the 
past and future climates. For future climate projec¬ 
tions, it is currently assumed that the land and oceans 
sinks will continue to be as effective as in the past in 
taking up C0 2 , so that future atmosphere concentra¬ 
tions will be based on about half of the future emissions 
staying in the atmosphere. However, there are real 
concerns that in the future, the ocean especially will 
not be able to take up the same fraction of C0 2 emis¬ 
sions because it is becoming warmer and more satu¬ 
rated with C0 2 [ 12] . Whether the land will continue to 
take up the same fraction of C0 2 is also not obvious 
and strongly depends on future land use practices. Over 
the last 30 years, deforestation of tropical forests has 
rapidly increased, which results in less C0 2 taken up by 
the land and more emitted into the atmosphere if the 
wood is burnt. This is now the cause of a substantial 
fraction of the recent increase in atmospheric C0 2 
concentration. In contrast, there has been reforestation 
at some locations in the northern hemisphere mid¬ 
latitudes, such as the eastern part of the USA. Rather 
than assuming how much of the emitted C0 2 stays in 
the atmosphere, this fraction is predicted by an ESM 
with a carbon cycle. Thus, if the model predicts that the 
ocean will take up less C0 2 in the future, then a larger 
fraction will stay in the atmosphere to act as 
a greenhouse gas. This is a positive feedback in the 
climate system that is in ESMs, but not in climate 
models. Interactive carbon cycles have been put into 
a number of climate models around the world, and 
there has been an intercomparison project that 


compares their results [ 13] . The strength of the positive 
feedback from the carbon cycle is quite different in 
these various models, so the strength of this positive 
feedback is presently quite uncertain and needs to be 
constrained better. 

There is some evidence that the stratospheric circu¬ 
lation can affect phenomena such as the Arctic and 
Antarctic Oscillations [14, 15] and will be important 
in how quickly the observed “ozone hole” in the south¬ 
ern hemisphere stratosphere will recover over the first 
half of the twenty-first century. If these processes are to 
be included in an ESM, then the atmosphere compo¬ 
nent needs to include all of the stratosphere, which is 
located above the troposphere. The region usually 
modeled by the atmosphere component of a climate 
model is the troposphere and just the lower part of the 
stratosphere. How important these processes are to the 
future trajectory of climate change has not been fully 
evaluated at present. In addition, an atmospheric 
chemistry component may be important to model the 
future levels of atmospheric aerosols. These are impor¬ 
tant in reflecting incoming solar radiation and in the 
formation of clouds, which are extremely important in 
the radiation budget of the atmosphere. A chemistry 
component is also needed if an ESM is to evaluate 
future levels of natural and man-made pollution in 
the very large cities of the future. 

Another component of the earth system that has 
recently taken on more importance is the role of the 
Greenland and Antarctic ice sheets. There is growing 
evidence that the Greenland ice sheet has lost mass 
more quickly in the first decade of the twenty-first 
century than previously [16, 17], and there are changes 
in how quickly it is moving [18]. There are also obser¬ 
vations of accelerations in Antarctic glaciers, especially 
after small ice shelves have collapsed [19, 20]. This has 
two important effects. The first is that the fresh water 
input to the ocean from these ice sheets increases the 
mean sea level [21], although it is important to note 
that this increase is not uniform over the ocean. The 
second is that fresh water input from the Greenland ice 
sheet can possibly cause a future weakening of the so- 
called thermohaline circulation in the North Atlantic 
Ocean [22, 23]. This circulation carries a lot of heat 
northward and certainly affects the climate of Western 
Europe, and is discussed in more detail in the next 
section. These possible future effects are not included 
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in climate models at present. A new ice sheet compo¬ 
nent to evaluate these future climate change possibili¬ 
ties will be a vital component of ESMs over the next few 
years. 

Climate Model Simulations 

One or Two Active Components 

As a climate model is being built and assembled, the 
first type of simulation that is performed and analyzed 
is runs using either one or two of the components in 
active mode, with the other components being replaced 
by simple data components that provide observed time 
series of the required fields. The best known of this type 
of run is when the atmosphere and land components 
are active, and the ocean and sea ice are replaced by 
observations of sea and sea ice surface temperature. 
When the observations are over the period 1960- 
2005, this is called an AMIP run, which is named after 
the Atmosphere Model Intercomparison Project, which 
first formalized this type of run. Results from AMIP 
runs made with the atmosphere and land components 
of many different climate models have been compared 
in this type of intercomparison for 20 years or more 
[24]. These comparisons have given, and continue to 
give, insight into the validity of the parameterizations 
used to simulate the many important processes in the 
atmosphere component of different climate models. 

Scientists developing the land component of 
a climate model use these AMIP runs to validate their 
component. However, in order to isolate parameteri¬ 
zations in the land component, they frequently make 
simulations with just the land component active. In 
this type of run, the land is forced by a time series of 
observations from 1960 to 2005 of all the surface atmo¬ 
sphere variables that are required to force the land 
component. 

This same time series of surface atmosphere vari¬ 
ables, but over the oceans, is very frequently used to 
force the ocean component of climate models run in 
standalone mode. This type of run is done to validate 
the ocean component because the ocean observations 
available for comparison are mainly from the period 
1960-2005. One of the difficulties in setting up this 
type of run is how to force the ocean under sea ice. 
The interaction between the ocean and sea ice is 
very important, especially when ice is being formed. 


Sea ice is formed with a salinity of about 5 parts per 
1000 from sea water with a salinity of about 35 parts per 
1000. Therefore, this process rejects brine into the 
surface water, and at cold temperatures, the ocean 
salinity is more important than temperature in deter¬ 
mining its density. Thus, sea ice formation produces 
very dense surface water, and when this is denser than 
the water below, the water column overturns down to 
a depth of 2 km or more, resulting in what is called 
“deep water formation.” This only occurs in winter in 
a very few locations in the world oceans. Off Antarctica, 
it occurs in the Weddell and Ross Seas, producing 
Antarctic Bottom Water, which is the densest water 
mass in the oceans. It also occurs in the North Atlantic 
Ocean in the Greenland-Iceland-Norwegian Seas 
north of Iceland and in the Labrador Sea between 
Canada and western Greenland. This forms North 
Atlantic Deep Water, which flows south at 2-3 km 
depth, and is the return flow of the North Atlantic 
thermohaline circulation. This overturning circulation 
is often called the “Conveyor Belt,” following Broecker 
[25] , and a schematic is shown in Fig. 2. The deep water 
formation regions in the North Atlantic and off Ant¬ 
arctica are the sinking branches of the Conveyor Belt. 
The dense water near the bottom of the ocean very 
slowly makes its way into the Indian and Pacific 
Oceans, and then slowly rises toward the surface in all 
the oceans. It has been estimated from ocean observa¬ 
tions and models that deep water formed near Antarc¬ 
tica, which then goes into the Pacific Ocean, will take 
between 800 and 1,000 years before it returns to the 
ocean surface. It is also interesting to note that deep 
water formation does not occur in the North Pacific 
Ocean. The main reason is that the salinity there is 
much less than in the North Atlantic, and the surface 
water is never dense enough to overturn. 

In order to overcome the difficulty of how to force 
the ocean under ice, the ocean and sea ice components 
are sometimes run together in active mode, forced by 
the time series of atmospheric surface observations. 
Often the scientists developing the sea ice component 
wish to isolate that component, and make stand alone 
sea ice runs forced by atmospheric observations, and 
allowing the sea ice to interact with a much simpler 
ocean component called a slab ocean. A slab ocean 
component only models the upper mixed layer near 
the ocean surface. This is needed because there are no 
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Schematic of the thermohaline circulation, or the "Conveyor Belt," after Broecker [25] 
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observations of the surface ocean under ice, so that 
a slab ocean component is used which exchanges heat 
and salt with the sea ice above. As stated earlier, given 
this very large variety of ways required to run the 
climate model components, it becomes obvious why 
the setup using a coupler shown in Fig. 1 is extremely 
useful. 

Fully Coupled Simulations 

The first fully coupled simulation performed with 
a new version of a climate model is a present day 
control run. The model is given the year 2000 values 
of C0 2 and other greenhouse gases, the observed levels 
of natural and man-made aerosols, and the level of 
solar radiation. As discussed in the Introduction, the 
first requirement of the model is that the drift in this 
control run is small, so that the model does not drift 
very far from the present day initial conditions. Once 
that is established by a run of at least 100 years, then the 
simulation is continued for a longer period, sometimes 
for as long as 1,000 years, and carefully examined for its 


variability. There is variability on all time scales, such as 
the diurnal cycle, seasonal variability, the annual cycle, 
interannual variability, for example, the El Nino- 
Southern Oscillation (ENSO), and decadal variability. 
There is also plenty of data for comparison, see the next 
section. However, this control run assumes that the 
climate forcings are fixed, and the earth’s present day 
climate is in a statistical equilibrium, which means that 
the climate is not in a truly steady equilibrium state, but 
has variability on all time scales around a steady state 
climate. This is clearly not the case in 2000, as the levels 
of C0 2 and other greenhouse gases have been increas¬ 
ing substantially over the twentieth century. 

The last time the earth’s climate was essentially in 
a statistically steady state was before mankind had 
started making large changes to the planet. This date 
can be argued over because man’s changes to how land 
was used and trees felled changed the earth’s climate 
somewhat. However, the date is usually taken to be 
before the atmospheric C0 2 level had increased signif¬ 
icantly over the level at the Industrial Revolution. In 
simulations to be submitted to the 5th IPCC 
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Assessment Report, this date has been chosen to be 
1850. Therefore, most climate models will run another 
control for 1850 conditions, forced by the C0 2 , aerosol, 
and solar values of that year. A very desirable outcome 
of this control run is that the simulated climate system 
does not lose or gain heat and fresh water over the 
duration of the control run. In practice, it is extremely 
difficult to balance these budgets precisely to zero, 
especially for heat, and all climate models lose or gain 
some heat from the ocean during any control run. 
However, in modern climate models this drift is very 
small, and is not a substantial problem. The real prob¬ 
lem is that we do not have observations of the climate 
system in 1850 to compare with the model results. For 
example, we do not know the extent or thickness of 
Arctic and Antarctic sea ice in 1850. 

The real purpose of an 1850 control run is to pro¬ 
vide initial conditions for runs that simulate the period 
from 1850 to 2005, which are often called twentieth 
century runs. Time series over this period of four 
quantities are needed to force this type of run. They 
are the atmospheric concentrations of C0 2 and other 
greenhouse gases, the levels of natural and man-made 
aerosols, the level of solar output, and the level of 
aerosols in the atmosphere from volcanic eruptions. 
The last quantity is determined from the observed 
levels of aerosols from recent eruptions, such as El 
Chichon in 1982 and Pinatubo in 1991, and then scaled 
by the size of significant eruptions earlier in the 1850- 
2005 period. Very often an ensemble of these twentieth 
century simulations is run, where the initial conditions 
are taken from different times in the 1850 control run. 
If a climate model is to be useful, then its twentieth 
century runs must reproduce well many of the observed 
changes in the earth’s climate over the last 150 years. 
Most of the comparisons with observations will use the 
last 50 years of these runs, which is when virtually all of 
the observations were made. 

Note that for ESMs, which have an active carbon 
cycle, the twentieth century runs will be forced by C0 2 
emissions, rather than atmospheric concentrations. 
A severe test for twentieth century ESM simulations 
will be to reproduce the time history of atmospheric 
C0 2 concentration over the time period 1850-2005. 
The reason is that to accomplish this, the ocean and 
land components of the ESM will have to take up the 
correct fraction of C0 2 emitted into the atmosphere. 


This nicely illustrates the fact that as a climate model or 
ESM becomes more complicated with more compo¬ 
nents, then it is required to perform at a higher level. 
The reason is that very important quantities, such as 
the atmospheric C0 2 concentration, are now being 
predicted by the model, instead of being prescribed 
from observations. 

The ensemble of twentieth century runs will then be 
continued to make projections of future climate 
changes over the rest of the twenty-first century. In 
order to make a future climate projection, time series 
of two quantities are required: the atmospheric C0 2 
concentrations (for climate models) or emissions (for 
ESMs) and other greenhouse gases, and the levels of 
natural and man-made aerosols. In these projections, 
the solar output is kept constant at its 2005 level, and 
only a background level of volcanic aerosols is used to 
account for future small volcanic eruptions. In all cli¬ 
mate models, the magnitude of future climate change 
depends crucially on the concentrations of C0 2 and 
other greenhouse gases in the future, and to a smaller 
extent on the future levels of man-made aerosols, 
which are expected to keep reducing, as they have 
done over the last 30 years. For the 4th IPCC Assess¬ 
ment Report, three scenarios for the future concentra¬ 
tions of C0 2 were used, which all had C0 2 levels 
strongly increasing until 2100. For the 5th Assessment 
Report, scenarios will be used where the C0 2 concen¬ 
trations increase at a much slower rate during the 
second half of the twenty-first century because it has 
been assumed that emissions will be much reduced 
over that period. 

The second crucial factor that determines the mag¬ 
nitude of a model’s future climate change over the 
twenty-first century is its climate sensitivity. Equilib¬ 
rium climate sensitivity (ECS) is defined as the increase 
in the globally averaged surface temperature that 
results from a doubling of C0 2 in the atmosphere 
component when it is coupled to a slab ocean model. 
This setup of a climate model only takes about 30 years 
to come into equilibrium, whereas the full depth ocean 
component takes about 3,000 years. However, it has 
recently been shown [26] that the ECS using a full 
depth ocean is not very different than that obtained 
using a slab ocean model. Transient climate sensitivity 
(TCS) is defined as the increase in globally averaged 
surface temperature that occurs when C0 2 has doubled 
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after 70 years of a transient simulation where C0 2 
concentration increases at the rate of 1% per year. In 
general, a model with a small (large) ECS will also have 
a small (large) TCS, but the relationship is not one-to- 
one because models differ in the rate of heat uptake 
into the ocean and the timescales of other feedbacks. It 
is interesting to note that the ECS of every climate 
model ever developed has been positive, which is 
a very strong indication that the equilibrium climate 
is warmer when there is an increased concentration of 
atmospheric C0 2 . Almost all models used in the IPCC 
4th Assessment Report have an ECS in the range of 
2°C-4.5°C. Despite dramatic improvements in climate 
models over the last 20 years, this range of ECS is the 
same as in the IPCC 1st Assessment Report [27]. It can 
be viewed as a disappointment that the range of ECS in 
climate models has not been reduced over this time 
period, but it reflects the fact that climate models still 
have to parameterize several important processes that 
affect climate sensitivity, the most important of which 
is clouds. The earth’s climate sensitivity has also been 
estimated using observations [28] , but this estimate has 
also not reduced the possible spread in its value. This 
brings up the subject of how climate models are 
validated. 

Model Validation 

The atmosphere component is the easiest to validate 
because there is a whole host of observations to com¬ 
pare its results against. These include observations 
taken by instruments, including satellites, and the so- 
called atmospheric reanalyses, which use a numerical 
model to assimilate many different observations to 
provide a time history of the state of the global atmo¬ 
sphere. These observations and reanalyses are com¬ 
pared with the results from AMIP simulations, which 
are described in the previous section. AMIP runs use 
a time history of observed sea surface temperature 
(SST), which is a relatively accurately observed quan¬ 
tity, especially since the start of the satellite era. There 
are a large number of variables that can be compared, 
which include temperature, winds, pressure, cloud 
amount, precipitation, shortwave solar radiation, and 
long-wave radiation emitted by the earth. These quan¬ 
tities can also be compared on many timescales from 
diurnal, seasonal, annual to interannual variability. 


In general, most of these comparisons are quite good, 
with cloud amount and precipitation being two of the 
more difficult variables for the atmosphere component 
to simulate well. As an example, Fig. 3a shows the mean 
annual cycle of precipitation from an AMIP simulation 
using the CCSM4 atmosphere component compared to 
long-term observations made at the Southern Great 
Plains site in Oklahoma. The difference between the 
model and observations is plotted in Fig. 3b, and shows 
that the model has too little precipitation during the 
fall and winter, but has too much precipitation in the 
late summer. Overall, the comparison is reasonable 
because the annual mean values from the model and 
observations are quite close. Literally hundreds of such 
comparisons can be made, but what is a lot more 
difficult is how to synthesize and interpret the compar¬ 
ison results in order to produce better parameteriza- 
tions for the clouds and precipitation in the 
atmosphere component. 

It is a different story for the ocean component 
because there are far fewer observations to compare 
to ocean alone simulations. There is a compendium 
of temperature and salinity observations at prescribed 
depths [29] that can be used to compare to average 
conditions in the late twentieth century. In the best 
observed oceans, these observations can be split into 
the four seasons, so that the annual cycle in the upper 
ocean can be verified. It should be pointed out that 
satellites can only measure surface ocean quantities, so 
that their observations do not give information about 
the ocean vertical structure, unlike the atmosphere. 
However, there are direct observations in a few regions 
of the ocean, such as the upper, tropical Pacific Ocean, 
which can be used to make comparisons. Figure 4 
shows the zonal current along the equator in the 
upper 400 m of the Pacific Ocean from an ocean 
alone simulation of the CCSM4 and observations 
[30]. It shows that the component does quite a good 
job in reproducing the westward surface current, and 
the very strong eastward equatorial undercurrent, 
which is one of the fastest ocean currents with 
a maximum speed of about 100 cm/s. The model sim¬ 
ulation depends on the atmosphere winds used to force 
it, as well as some of the model parameterizations, and 
it is frequently difficult to decide whether a poor com¬ 
parison with observations is the result of poor forcing 
fields or a problem with the model parameterizations. 
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Coupled Climate and Earth System Models. Figure 3 

Mean annual cycle of precipitation from an AMIP run of the 
Southern Great Plains of the USA 
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CCSM4 atmosphere component and observations in the 


The situation is worse for sea ice because there are 
even fewer observations. Sea ice extent and concentra¬ 
tion were not well known until they began to be 
observed from satellites in 1979. Sea ice thickness is 
still not well observed, although the general spatial 
patterns are known from accumulating point measure¬ 
ments over the years. However, there are many pro¬ 
cesses that affect sea ice, such as ridging, the formation 
of polynas, and melt ponds, and how snow aging affects 
the albedo that have to be parameterized, although 
there are few observations of them. There are also not 
too many measurements to compare with the variables 
in the land component, although more than for sea ice. 
The measurements of quantities such as soil tempera¬ 
ture and moisture, albedo, and the leaf area amount 
have to be taken in areas with natural vegetation, as well 
as in man-made areas such as croplands. Again, there 
has been a large increase in observations over the past 
20 years or so during the satellite era, and from land 
based observations at several specific sites. 

As mentioned in the previous section, there are 
difficulties comparing both present day and 1850 con¬ 
trol simulations with observations because the present 
climate is not in equilibrium, and there are not many 
observations from 1850. The only quantity from that 


time that can be estimated directly from observations is 
the globally averaged surface temperature. Also, the 
global SST pattern from 1850 to the present has been 
estimated in the HadlSST dataset [31] by determining 
the principle variation patterns from the period when 
SST has been well measured and using these patterns to 
produce global data in the early part of the period when 
there were only a few measurements. However, the best 
simulations to compare with observations are the 
ensemble of twentieth century runs from 1850 to 2005. 

There are a very large number of variables that can 
be compared to observations from the second half of 
the twentieth century, but some of the most important 
are large-scale patterns of interannual variability, such 
as ENSO and the North Atlantic Oscillation [32]. 
ENSO is the largest interannual signal in the earth’s 
climate and much about it has been learned from 
observations over the last 25 years. The variable that 
is most often used to characterize ENSO is called the 
nino3 SST, which is the SST averaged over the area 
90°W-150°W, 5°S-5°N in the central Pacific Ocean. 
Figure 5 shows the nino3 monthly SST anomalies, and 
a wavelet analysis, which is a method to plot the time 
variation of the amplitude of the anomalies as 
a function of the frequency content. The three smaller 
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Coupled Climate and Earth System Models. Figure 4 

Comparison of zonal velocity along the equator in the upper 
and the observations in [30] 

boxes show the power spectrum (variance against 
period in years), the autocorrelation against lag time 
in months, and the annual cycle of the variance 
amplitude. 

Figure 5 shows that the amplitude of nino3 SST 
anomalies in the CCSM4 is a little smaller than in the 
HadlSST observations, especially the warm events 
which have a maximum amplitude of just over 3°C in 


> acific Ocean between the ocean component of the CCSM4 

the data, but are only 2.5°C in the model. This means 
that the wavelet and power spectrum are also a little 
weak in the CCSM4. However, the amplitude of nino3 
SST anomalies from earlier periods of the twentieth 
century run is larger than the HadlSST data, which 
shows there is strong decadal variability in the 
CCSM4 ENSO amplitude. The CCSM4 power spec¬ 
trum peaks at a period of 3-4 years compared to 
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Coupled Climate and Earth System Models. Figure 5 

(Continued) 


4-5 years in the data, the autocorrelation compares 
quite well, and the annual cycle of variance is quite 
good with a minimum in May compared to April in 
the HadiSST data. This good comparison is indepen¬ 
dent of the period of the run examined and is a very 
important improvement over all the previous versions 
of the CCSM, which had ENSO spectra that had 
a dominant peak at 2 years. This improvement was 
due to two changes made to the convection parameter¬ 
ization scheme in the atmosphere component [33] . The 
CCSM was one of many climate models that had a poor 
ENSO simulation for a long time [34], which was not 
a good situation given that ENSO is the largest 
inter annual signal in the earth’s climate. 

Probably the only well-measured variable that can 
be compared to the model over the whole period of 


a twentieth century run is the globally averaged surface 
temperature. Figure 6 shows this comparison over 
1890-2000 between the HadiSST data and an ensemble 
of twentieth century runs using the CCSM version 3. 
The red line is the mean value from the ensemble, and 
the shading indicates the standard deviation across the 
eight member ensemble. This comparison is not per¬ 
fect, but the data is not too often outside the shading. 
The model was then integrated forward to make an 
ensemble of projections for the twenty-first century 
[35] that were submitted to the IPCC 4th Assessment 
Report. 

Another quantity that has been given a lot more 
attention in recent years is the sea ice extent in the 
Arctic Ocean. In order to give realistic projections of 
the future state of Arctic sea ice, a climate model must 
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Coupled Climate and Earth System Models. Figure 5 

Nino3 monthly SST anomalies between 1966 and 2005, wavelet power, power spectrum, autocorrelation, and variance 
from (a) HadlSST observations, and (b) a twentieth century simulation using the CCSM4 


simulate it well at the end of the twentieth century. 
Arctic sea ice has a minimum extent in September, and 
this has been well measured by satellite since 1979. 
Figure 7 shows the observed September Arctic sea ice 
extent from observations and the latest two versions of 
the CCSM. For the CCSM3, the twentieth century run 
forced by observed concentrations of C0 2 ends in 2000, 
and the model then used a scenario for the future levels 
of C0 2 . The projected decline in the CCSM3 ice extent 
between 2000 and 2009 is not quite as large as has been 
observed. However, the actual rise in C0 2 concentra¬ 
tion in the earth’s atmosphere in the decade since 2000 
has been somewhat larger than in the forcing scenario 
used in the CCSM3 projection shown in Fig. 7. It is 
important to remember that results from future 


projections strongly depend on the forcing scenario 
used. This same projection suggests that the Arctic 
Ocean will become virtually ice free in September by 
2040 [36], but again the actual year when this might 
occur will depend on the concentrations of C0 2 and 
other greenhouse gases over the next 30 years. The 
result from the CCSM4 shown in Fig. 7 is just from a 
twentieth century run, which goes to the beginning of 
2005. Again the comparison is good, and the CCSM4 
will be used to make future projections for the IPCC 
5th Assessment Report. 

Other comparisons to validate the ability of a climate 
model to simulate the historical evolution of the earth’s 
climate can be made, but the observations are probably 
not as accurate as for surface temperature and sea ice 
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Global surface temperature anomalies 
from 1890-1919 average 



Coupled Climate and Earth System Models. Figure 6 

Globally averaged surface temperature anomalies from HadlSST data plotted against the results of an ensemble of 
twentieth century runs using the CCSM3 



Coupled Climate and Earth System Models. Figure 7 

Arctic September sea ice extent from observations, CCSM3, and CCSM4 
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extent. Two examples are ocean heat content and the 
distribution of chlorofluorocarbon-11 (CFC-11) in 
the ocean. The time history of CFC-11 concentration 
in the atmosphere is well known, so this can be used as 
an input to the ocean component during a twentieth 
century simulation. Observations of both ocean heat 
content and CFC-11 are sparse in both time and space, 
but estimates of their changes can be made and com¬ 
pared to model results [37]. This comparison helps to 
determine whether the ocean component is taking up 
quantities at the correct rate. It is very important in an 
ESM that the ocean takes up the correct fraction of the 
C0 2 that is emitted into the atmosphere. It is more 
difficult to make comparisons of changes in the land 
component because the largest changes in land use over 
the twentieth century are man-made and not changes 
in the natural vegetation. Changes in how land has 
been, and might be, used are often imposed in the 
land component during twentieth and twenty-first cen¬ 
tury simulations, which allows an assessment of how 
these changes have affected the past climate and might 
affect future climate changes [38]. On the global scale, 
these changes are much smaller than changes due to 
increases in C0 2 and other greenhouse gases, but they 
can be important in affecting the climate locally. 

Climate Forecasts 

First, the difference between a forecast and a projection 
needs to be explained. When a weather forecast is 
made, there are two separate factors that determine 
the quality of the forecast. The first is the quality of 
the atmosphere model used; all models are not perfect, 
but some are better than others. However, just as 
important is the quality of the analysis of the current 
state of the atmosphere that is used as the initial con¬ 
dition for the forecast. Even if the model were perfect, if 
the initial condition is slightly incorrect, then the 
model forecast and the real evolution of the atmo¬ 
sphere will diverge. The reason is that both the real 
atmosphere and the forecast model are examples of 
a chaotic system. What defines a chaotic system of 
equations is that, if they are integrated forward from 
two very slightly different initial conditions, then the 
two future solutions will diverge from each other, often 
quite quickly. Chaos theory was founded by a famous 
meteorologist, Edward Lorenz, who published a classic 


paper in 1963 [39]. He made drastic approximations to 
the equations that represent the atmosphere to produce 
a set of three, quite simple ordinary differential equa¬ 
tions. When he integrated them forward in time from 
two slightly different initial states, the solutions 
diverged, which is the characteristic of what came to 
be called a chaotic system. In practice, it is very difficult 
to separate these two sources of error in a weather 
forecast because the model is also used to create the 
initial conditions on the model grid using all the latest 
meteorological observations from around the world. In 
order to reduce the likelihood of a bad weather forecast, 
an ensemble of forecasts is made using a set of slightly 
different initial conditions. If all the ensemble forecasts 
predict that something will occur, then it is forecast 
with a high probability; whereas if the ensemble fore¬ 
casts differ markedly, then it is forecast with a low 
probability. 

Virtually all long climate model simulations of the 
future done so far are projections, not forecasts. The 
reason is that most twenty-first century runs are just 
a continuation of a twentieth century run of the model, 
and no attempt is made to initialize the climate model 
to the observed climate in 2005, or whatever year the 
twenty-first century run starts. All results submitted to 
the IPCC 4th Assessment Report were from future 
climate projections. If something, such as a large 
ENSO event or a sudden reduction in the September 
extent of Arctic sea ice for example, occurs in 2015 in 
a projection, then this is not a forecast that it will 
actually happen in 2015, but a strong indication that 
this type of event might very well occur in the years 
around 2015. The case of ENSO events is interesting 
and instructive because ENSO forecasts up to a year in 
advance are now regularly made by a number of centers 
around the world using climate models [40]. 

For a weather forecast using an atmosphere model, 
it is important to start with the correct initial state of 
the atmosphere. However, for a seasonal or ENSO 
forecast, a full climate model must be used because 
the land, ocean, and sea ice evolve on these time scales. 
For these forecasts, therefore, initial conditions for the 
climate model are needed, and the most important 
component to initialize correctly is the ocean because 
it has the slowest time scales and by far the largest heat 
capacity. For an ENSO forecast, what is needed is the 
correct thermal state of the upper 300-400 m of the 
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tropical Pacific Ocean between about 15° north and 
south. ENSO forecasts could not become a reality until 
there was an observing system in the tropical Pacific to 
continuously measure and report upper ocean temper¬ 
atures [41]. An analysis is performed on these observa¬ 
tions to produce a temperature field on the model grid, 
and this is used as the ocean component initial condi¬ 
tion. As the forward integration starts, the tropical 
atmospheric circulation comes into balance with the 
sea surface temperature field in about a week, which is 
why it is not necessary to initialize the atmosphere 
component. As always, an ensemble of ENSO forecasts 
is made by slightly changing the initial conditions used 
in the ocean component. For an ENSO forecast, it is 
important to initialize correctly the upper tropical 
Pacific Ocean, but for a climate forecast over 
a decade, there are many more aspects of the climate 
model that need to be initialized correctly: the ocean 
deeper than the upper 400 m, especially in the North 
Atlantic Ocean where the thermohaline circulation 
occurs, the sea ice distribution in both the Arctic and 
Antarctic, and some aspects of the land component, 
such as where the soil moisture content is above or 
below normal. We do not know precisely all the quan¬ 
tities that need to be initialized correctly, but we are 
absolutely certain that there are not adequate observa¬ 
tions of all these quantities. 

Forecasts of climate changes over the next decade 
on a regional basis are what would be most helpful in 
planning for the future. A few preliminary decadal 
forecasts have been made by centers in the UK and 
Germany [42, 43], and many centers will submit 
a suite of decadal forecasts to the IPCC 5th Assessment 
Report. However, the science of decadal climate fore¬ 
casts is in its infancy [44], and there is a very large 
amount of research to be done before they will become 
reliable. Decadal forecasts are now where weather fore¬ 
casts were 50 years ago, but they have another disad¬ 
vantage. Weather forecasts are made and verified every 
day, so that there is a very large number of realizations 
that can be used to make improvements. By their very 
nature, decadal forecasts are only verified after 10 years, 
so that the number of opportunities to compare model 
predictions to observations is reduced enormously. 
However, the outlook for decadal forecasts has 
improved over the last few years. First, there is now 
an ocean observing system called ARGO floats (named 


after the mythical Greek ship used by Jason and the 
Argonauts to seek the Golden Fleece), that since about 
2003 has been giving nearly global coverage of temper¬ 
ature and salinity down to a depth of 2 km, which has 
improved enormously our ability to correctly initialize 
the ocean component [45]. However, no decadal fore¬ 
cast initialized using ARGO data, which can start at the 
beginning of 2005 at the earliest, has yet had enough 
time to be verified. There are also satellite observations 
of Arctic and Antarctic sea ice extent and the soil 
moisture content over the continents, which could 
potentially be used in the initialization. Second, as the 
computing capacity continues to increase, then the 
resolution of climate models used for predictions will 
continue to improve, which will enable decadal fore¬ 
casts to be more accurate on the regional scales that are 
required for future planning. 

Future Directions 

The computational power available to climate modelers 
will continue to increase in the future, so how should it 
be used? Should it be used to increase the resolution of 
present day climate models or used to increase the range 
of components in ESMs? This is an extremely difficult 
question to answer definitively. Increased resolution will 
undoubtedly improve some aspects of climate model 
simulations, but omitting an additional component 
may well leave out feedbacks that are potentially impor¬ 
tant. The answer will almost certainly be to push for¬ 
ward in both directions because scientists with different 
interests will lead the work in the two different direc¬ 
tions. Another possibility is to increase the ensemble 
size used in future projections and predictions, which 
will give more reliability to simulated changes in 
extreme events [46], for example, which is a very 
important factor in planning for the future. 

As mentioned above, clouds have to be parameter¬ 
ized in the atmosphere, and the way this is done can 
change the ECS of a climate model. Clouds also have to 
be parameterized in weather forecast models, but are 
often done so in a different way because the weather 
forecast model is run at a much higher resolution than 
the climate model. Over the last few years, there has 
been the suggestion, called Seamless Prediction, that 
the same atmosphere component should be used in 
both weather and climate prediction. In this situation, 
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the cloud parameterization used would have to work 
well across all the scales involved in both weather and 
climate predictions. This is not as easy as it sounds 
because both groups have developed their own param- 
eterizations over past years, which make rather differ¬ 
ent assumptions. There have even been suggestions that 
both models should use extremely fine resolution on 
the order of 1 km, so that clouds can be resolved rather 
than parameterized, but running climate models at this 
resolution is still many years away. Seamless Prediction 
is a long-term goal, but it will probably not be realized 
over the next few years. 

Another example of a phenomenon that is not 
resolved in present day climate models is mesoscale 
eddies in the ocean. These are the equivalent of atmo¬ 
spheric highs and lows, but occur at a range of scales 
from 200 to 300 km near the equator to 20-30 km in 
the very high latitude oceans. Only the equatorial 
eddies are partially resolved if the ocean component 
has a grid-spacing of about 1°. So, the effect of these 
energetic eddies on the large-scale mean flow has to be 
parameterized in present day climate models. However, 
it has been shown that a majority of these eddies can be 
resolved when using a grid-spacing of 1/10° in the 
ocean component [47, 48]. Diagnosis of these simula¬ 
tions has shown that the eddy parameterization used in 
the 1° simulations works quite well, but still the ques¬ 
tion remains: will future climate change projections in 
models that resolve the mesoscale eddies give very 
similar answers to future projections where they are 
parameterized? The answer to this important question 
should be found in the next few years because some 
climate change runs with resolved eddies are now pos¬ 
sible with the available computer time. 

Examples of new components that are currently 
being incorporated into ESMs have been discussed 
earlier, and include chemistry-air quality, hydrological, 
dynamic vegetation, and crop model components. The 
new component to simulate the Greenland and 
Antarctic ice caps is a very nice example of an impor¬ 
tant new component. However, there is a long list of 
possible feedbacks that have not been included in any 
ESM so far. Good examples are the increased release of 
methane, which is a very potent greenhouse gas, from 
Arctic tundra as the Arctic region warms [49], the 
possible release of methane from ocean clathrates 
[50, 51], and the possible fast breakup of the West 


Antarctic Ice Sheet [52]. All these are examples of 
possible abrupt climate changes that could result in 
large future changes that would have very far reaching 
consequences. However, all are very difficult to simu¬ 
late accurately in an ESM and to assess quantitatively 
the possibilities that they will occur. 

The science of decadal forecasts will also be 
advanced in the near future, both by new ideas and 
experience in how they should be initialized, and by 
increasing the resolution of the components used that 
will give more regional information. As explained in 
the previous section, there is a lot to learn and much 
experience needs to be gained before decadal forecasts 
become reliable. However, they will produce the most 
useful kind of information that is required by people 
planning for the future. 

Finally, what motivates the scientists working to 
develop climate models and ESMs? First, it is a very 
stimulating intellectual challenge to understand what 
controls the earth’s past, present and future climates, 
and to build an ESM that gives a faithful representation 
of this. This requires the expertise of many scientists 
across a large and diverse set of sciences ranging from 
several earth sciences to computer science. It is a real 
challenge to make these models run correctly and effi¬ 
ciently on several of today’s massively parallel super¬ 
computers. I also know from experience, that 
managing an ESM project is very challenging because 
it is such a diverse scientific enterprise. A second moti¬ 
vating factor is also very important to many scientists 
working on ESMs. It is that they believe these models 
are the best means we have available to anticipate 
possible future changes to the earth’s climate, and that 
their results should be made freely and widely available 
to anyone who wants to see them. 
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Glossary 

Association mapping Association mapping is a high- 
resolution method for mapping quantitative trait 
loci (QTLs) or gene(s) for traits of interest based on 
linkage disequilibrium (LD) and holds great prom¬ 
ise for the dissection of complex genetic traits. 

Back cross (BC) Back cross is a cross of the F : with 
either of the parental genotype and the resultant 
progeny is called BQ. The progeny of the cross 
between BQ and the recurrent parent is called as 
BC 2 . 

Gene pyramiding Gene pyramiding is a process of 
accumulating the favorable genes/alleles from dif¬ 
ferent genotypes into an elite/commercial cultivar. 
Gene pyramiding is often performed through 
marker-assisted selection (MAS). 

Genome-wide selection or genomic selection (GS) 
Genome-wide selection or genomic selection is 
a concept for accelerating genetic gain especially 
for complex traits in elite genotypes by utilizing 
genomic information and estimating their breeding 
values in breeding strategies. GS is becoming very 
popular over marker-assisted selection that was 
focused on few individual genes or few QTLs to 
improve genotypes, especially when recent 
advances in genomic technologies have drastically 
reduced the cost on marker genotyping. 

Genomics-assisted breeding (GAB) Genomics- 
assisted breeding is a holistic approach, where 
genomics technologies including molecular 
markers, trasncriptomics, metabolomics, proteo- 
mics, bioinformatics, and phenomics are integrated 
with conventional breeding strategies for breeding 
crop plants resistant/tolerant to biotic and abiotic 
stresses or improved for quality and yield. 

Haplotype Haplotype is a set of alleles of closely linked 
loci on a chromosome that tend to be inherited 
together. 


Linkage disequilibrium (LD) Linkage disequilibrium 
is a nonrandom association of alleles at different 
loci, describing the condition with non-equal 
(increased or reduced) frequency of the haplotypes 
in a population at random combination of alleles at 
different loci. LD is not the same as linkage, 
although tight linkage may generate high levels of 
LD between alleles. 

Marker-assisted selection (MAS) Marker-assisted 
selection is a process of indirect selection for 
improving the traits of interest by employing mor¬ 
phological, biochemical, or DNA-based markers. 
DNA-based markers/molecular markers, in the 
recent past, were proven to be the markers of choice 
for MAS. 

Narrow genetic base Narrow genetic base does fre¬ 
quently exists in modern crop cultivars or breeding 
lines due to the continuous use of small number of 
elite genotypes in breeding programs. In fact, it is 
a serious obstacle to sustain and improve crop 
productivity due to rapid vulnerability of geneti¬ 
cally uniform cultivars to emerging biotic and 
abiotic stresses. 

Next-generation sequencing (NGS) technologies 

Next-generation sequencing (NGS) technologies 
include various novel sequencing technologies for 
example 454/FLX (Roche Inc.), ABI SOLiD 
(Applied Biosystems), Solexa (Illumina Inc.), etc., 
that have surpassed traditional Sanger sequencing 
in through-put and in cost-effectiveness for gener¬ 
ating large-scale sequence data. 

Polygenes Polygenes are a group of non-allelic genes, 
each having a small quantitative effect, that together 
produce a wide range of phenotypic variation. 

Quantitative trait loci (QTLs) Quantitative trait loci 
are the loci or regions in the genome that contribute 
towards conferring tolerance to abiotic stresses 
(e.g., drought, salinity) or resistance to biotic 
stresses (e.g., fungal, bacterial, viral diseases) or 
improving agronomic traits (e.g., yield, quality) 
which are generally controlled by polygenes and 
greatly depend on gene x environmental (G x E) 
interactions. 

Sustainable agriculture Sustainable agriculture refers 
to efficient agricultural production while 
maintaining the environment, farm profitability, 
and prosperity of farming communities. 
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Sustainable development Sustainable development is 
defined as balancing the fulfillment of human needs 
with the protection of the environment so that 
these needs can be met not only at the present 
time, but also in the future. 

Definition of the Subject 

There has been significant improvement in production 
and productivity of important cereal crops globally as 
a consequence of the “Green Revolution” and other 
initiatives [1]. However, today the stage has reached 
that the available traditional methods of crop improve¬ 
ment are not sufficient to provide enough and staple 
food grains to the constantly growing world population 
[2]. This situation is projected to be worse by the year 
2050, especially in context of climate change [3], In 
other words, the conventional plant breeding practices 
may not be able to achieve the sustainability in today’s 
agriculture. 

It is under such circumstances that advances in 
plant genomics research are opening up a new era in 
plant breeding, where the linkage of genes to specific 
traits will lead to more efficient and predictable breed¬ 
ing programs in future. Several initiatives have been 
started towards use of genomics technologies in num¬ 
ber of crop plants to ensure the sustainable production 
of healthy and safe crops and the results are encourag¬ 
ing. It is therefore expected that the genomics will be 
the integral part of the agricultural/plant breeding 
practices in future for improving crop productivity 
leading to achieve food security and sustainable 
production. 

Introduction and Importance of Sustainable 
Agriculture 

The goal of agricultural science is to increase crop 
productivity coupled with the quality of the products, 
and maintain the environment [1]. Food security is 
a growing concern worldwide, and more than 1 billion 
people are estimated to lack sufficient dietary energy 
availability [2] . The issue of “food security” has become 
so important that prominent scientific journals, 
including Science, have also published a special issue 
on this subject recently (February 12, 2010 issue). With 
the current rate of growth, the global population is 
likely to plateau at some 9 billion people by roughly 


the middle of this century [3]. With this ever- 
increasing human population and amidst the fear of 
shrinking resources in terms of cultivable area, irriga¬ 
tion resources, newly emerging insect pests, stagnated 
yields, etc., it has become difficult to maintain agricul¬ 
tural sustainability. In order to make today’s agricul¬ 
ture sustainable, it is necessary that plant breeders 
adopt innovative technologies that can increase 
the efficiency of selection with more precision [4]. 
Under such circumstances, molecular approaches 
including modern genomics and genetic engineering 
technologies have emerged as powerful tools to assure 
rapid and precise selection for the trait(s) of interest. 
Maintaining effective and environmentally friendly 
agricultural practices is a necessary prerequisite for 
maintaining sustainability. 

Plant genomics is a rapidly developing field, which 
is radically improving our understanding of plant biol¬ 
ogy by making available novel tools for the improve¬ 
ment of plant properties relevant to sustainable 
agricultural production. Recent advances in high- 
throughput genomics technologies, including that of 
next-generation sequencing and high-throughput 
genotyping, have helped immensely in understanding 
the functions and regulation of genes in crop plants [5] . 
The ever-increasing availability of genome sequences in 
crop plants have facilitated greatly the development of 
genomic resources that will allow us to address biolog¬ 
ical functions and a number of basic processes relevant 
to crop production leading to sustainable agriculture. 

One of the myths linked to sustainable agriculture 
means going back to past techniques/farm practices, 
which were followed by our ancestors. In fact, sustain¬ 
able agriculture can be achieved by combining some of 
the wisdom of past practices with careful use of current 
technology, including the vast array of information 
technologies now available. Sustainable agriculture is 
a key element of sustainable development and is essen¬ 
tial to the future well-being of the human race and the 
planet. A compelling need exists for restorative and 
sustainable agriculture to help address the pressing 
trends of population, climate, energy, water, soil, and 
food. Sustainable agriculture needs to be economically 
viable, environmentally sound, and socially acceptable. 
In other words, it is a system of agricultural production 
that, over the long term, will: (1) satisfy human food, 
feed, and fiber needs; (2) enhance the environmental 
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quality and the natural resource base upon which the 
agricultural economy depends; (3) make the most effi¬ 
cient use of available technologies, nonrenewable 
resources, and on-farm resources, and integrate, 
where appropriate, natural biological cycles and con¬ 
trols; (4) sustain the economic viability of farm opera¬ 
tions; and (5) enhance the quality of life for farmers 
and society as a whole. 

There are various components of sustainable agri¬ 
culture, which include technological interventions, and 
environmental and socio-economic factors. As the fac¬ 
tors related to socio-economics and environments have 
been discussed in a number of reviews earlier, in this 
article, we focus on the interventions of plant genomics 
technologies in crop breeding. 

Contribution of Plant Genomics Technologies 
to Crop Breeding 

Plant genomics technologies have contributed 
immensely in today’s agriculture which has led to 
better understanding of how plants function, and 
how they respond to the environment. This has also 
helped in achieving targeted objectives in breeding pro¬ 
grams to improve the performance and productivity of 
crops. The DNA-based molecular markers have facili¬ 
tated smarter and knowledge-based breeding, by 
enabling early-generation selection for key traits, thus 
reducing the need for extensive field selection. Besides 
this, the molecular tools can effectively be used for the 
characterization, conservation, and use of genetic 
resources. 

Recent advances made in the area of molecular 
biology and bioinformatics offer substantial opportu¬ 
nities for enhancing the effectiveness of classical plant 
breeding programs. These tools can be integrated into 
breeding work in order to analyze efficiently high num¬ 
bers of crosses at the early seedling stage. This approach 
is known as “genomics-assisted breeding” [6] . Through 
this approach, both the phenotype and the genotype of 
new varieties can be analyzed and the performance of 
new specific introgressed traits can be predicted. The 
goals of the integration of these technologies in classical 
breeding are to create genotype-to-phenotype trait 
knowledge for breeding objectives and to use this 
knowledge in product development and deployment 
for the resource poor farmer. 


For successful utilization of genomics-assisted 
breeding approach in a crop, availability of basic 
molecular tools, such as molecular markers, genetic 
maps, etc., is a prerequisite. Among molecular markers, 
though a variety of molecular markers such as restric¬ 
tion fragment length polymorphism (RFLP), randomly 
amplified polymorphic DNA (RAPD), microsatellite 
or simple sequence repeat (SSR), amplified fragment 
length polymorphism (AFLP), and single nucleotide 
polymorphism (SNP) markers have been developed 
in a range of crops, SSR and SNP markers have emerged 
as the markers of choice [7, 8]. Because of advent of 
NGS technologies [5] and high-throughput 
genotyping platforms, SNP marker system and array- 
based genotyping platforms are becoming more popu¬ 
lar [9, 10]. An overview on availability of genomic 
resources in some selected important crop species is 
shown in Table 1. It is evident that cereal crops, espe¬ 
cially rice, maize, wheat, barley, etc., are on top in terms 
of availability of genomic resources (see [11]). Genome 
sequences have already become available for several 
crop species, including rice (http://rgp.dna.affrc.go.jp/ 
IRGSP/), sorghum [12], and maize (http://gbrowse. 
maizegdb.org/cgi-bin/gbrowse/maize/). Recent invest¬ 
ments coupled with advances in genomics technologies 
have contributed towards developing a good resource 
of genomics tools in legumes as well [13, 14]. 

Some Modem Breeding Approaches 

The availability of genomic resources in almost all 
important crops combined with information on 
pedigrees as well as optimized methods of precise 
phenotyping make it possible to undertake genomics- 
assisted breeding approaches for crop improvement. 
In fact, some molecular breeding approaches like 
advanced backcross QTL (AB-QTL) analysis and 
marker-assisted selection (MAS) have been successfully 
employed in several crops, leading to improved culti- 
vars, some other approaches such as marker-assisted 
recurrent selection (MARS) or genomics selection (GS) 
are being used in several crops [15, 16]. 

Marker-Assisted Selection (MAS) 

There are three major steps involved in MAS: (1) identi¬ 
fication of molecular marker(s) associated with trait(s) 
of interest to breeders, (2) validation of identified 
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cereals and legumes 


Crop plant 

Molecular markers 
(SSRs and SNPs) 

Molecular maps (Genetic/QTL map/ 
comparative/physical maps) 

Transcript data and 
expression profiling 

Genome 

sequence 

data 

Rice 

++++ a 

++++ b ' c 

++++ d 

++++ e 

Maize 

++++ f 

++++ b,9,h 

++++ d 

++++ 1 

Wheat 

+++ j 

+++b ,k,i 

+++ d 

++ m 

Sorghum 

+++ n 

++++ b '° 

+++ d 

++++ p 

Barley 

+++ q 

+++ r 

+++ d ' s 

+++ i 

Soybean 

++++ u ' v 

+++ w,x,y 

+++ z 

++++ aa 

Groundnut 

++ bb 

+ cc 

+ z 


Cowpea 

+++ bb 

++ cc 

+ z 


Common 

bean 

++ bb 

++ cc 

+ z 


Chickpea 

+++ bb 

++ cc 

+ z 


Pigeonpea 

+++ bb 





+, Very few; ++, Few; +++, Moderate; ++++, Abundant 

a http://www.gramene.org/markers/index.html 

b http://www.gramene.org/cmap/ 

c http://www.gramene.org/db/qtl/qtl_display?query=&search_field=&species=Oryza+sativa&submit=Submit 

d http://www.ncbi.nlm.nih.gov/dbEST/dbEST_summary.html; http://compbio.dfci.harvard.edu/tgi/plant.html 

e http://www.g ramene.org/Oryza_sativa/lnfo/lndex 

f http://www.maizegdb.org/probe.php 

9 http://www.maizegdb.org/map.php 

h http://www.gramene.org/db/qtl/qtl_display?query=*&search_field=trait_name&species=Zea+mays+subsp.+mays&submit=Submit 

'http://www.maizesequence.org/Zea_mays/lnfo/lndex 

j http://wheat.pw.usda.gov/cgi-bin/graingenes/browse.cgi?class=marker 

k http://wheat.pw.usda.gov/GG2/maps.shtml#wheat 

'http://wheat.pw.usda.gov/cgi-bin/graingenes/quickquery.cgi?query=qtls&arg1=* 

m http://wheat.pw.usda.gov/cgi-bin/graingenes/search.cgi?class=sequence 

n http://www.gramene.org/db/markers/marker_view? 

marker_name=*&marker_type_id=&taxonomy=sorghum&action=marker_search&x=0&y=0 
O http://www.gramene.org/db/cmap/map_set_info?species_acc=sorghum&map_type_acc=-1 
p Paterson et al. [12] 

q http://wheat.pw.usda.gov/GG2/Barley/ 

r http://wheat.pw.usda.gov/GG2/maps.shtml#barley 

s http://wheat.pw.usda.gov/cgi-bin/graingenes/browse.cgi?class=sequence&query=barley1_* 

t http://www.public.iastate.edu/~imagefpc/IBSC%20Webpage/IBSC%20Template-home.html 

u http://soybeanbreederstoolbox.org/ 

v http://soybase.org/BARCSOYSSR/index.php 

w http://lis.comparative-legumes.org/cgi-bin/cmap/viewer?changeMenu=1 

x http://soybeanbreederstoolbox.org/search/search_results.php?category=QTLName&search_term= 

y http://soybeanphysicalmap.org/ 

z http://lis.comparative4egumes.org/lis/lis_summary.html?page_type=transcript 
aa http://www.phytozome.net/cgi-bin/gbrowse/soybean/?name=Gm09 
bb See Varshney et al. [13] 

cc See Varshney et al. [14] marker assisted recurrent selection (MARS) or genomic selection (GS) are being used in several crops [15, 16]. 
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marker(s) in the genetic background of the targeted 
genotypes to be improved, and (3) marker-assisted 
backcrossing (MABC) to transfer the QTL/gene from 
the donor genotype into the targeted genotype. In 
context of marker-trait association, linkage mapping 
has been extensively used for identifying the markers 
associated with a trait of interest in a range of crops 
including cereals, legumes, horticultural crops, 
etc. These studies have been reviewed in detail in sev¬ 
eral reviews [14, 17] and books [18]. Although hun¬ 
dreds of studies have been undertaken, only a few studies 
were taken further to marker validation and MABC. 
This may be attributed to (1) identification of few 
markers associated with small-effect QTLs, (2) non¬ 
validation of markers in elite genotypes, and (3) slow 
adoption of markers by breeders in their breeding pro¬ 
grams. Recent advances in association genetics, however, 
offer opportunities to overcome the first two constraints. 

Association mapping (AM) is considered an alter¬ 
native strategy to linkage mapping for identifying 
marker-trait associations and has been used extensively 
in human and animal systems. AM has a number of 
advantages over linkage mapping, including the poten¬ 
tial for increased QTL resolution and an increased sam¬ 
pling of molecular variation (for reviews see [19, 20]). 
AM involves studying a natural population rather than 
the offspring of crosses, and associations in natural 
populations are typically on a much finer scale because 
they reflect historical recombination events. Several 
examples on marker-trait association using AM are 
available [21]; however, there is a need for optimization 
of more advanced analytical tools in the area of associ¬ 
ation genetics [22]. It is anticipated that because of 
reduction in costs on marker genotyping [10], AM 
will be extensively used for trait mapping in the future. 

Once the markers associated with a trait of interest 
are identified through linkage mapping or AM, the next 
step is to use these markers in the breeding programs. 
In this context, the selection of one or a few genes 
(QTLs) through molecular markers using backcrossing 
is a very efficient technique [23, 24]. Important advan¬ 
tages of MAS are that it can be effectively utilized for 
traits with low heritability; for gene pyramiding, selec¬ 
tion can be made at seedling stage; and, above all, there 
are no issues involving GE crops [25]. Although use of 
markers in breeding programs through MABC is 
a common practice in the private sector [26], MAS is 


in routine use in wheat and barley breeding programs 
in Australia [8, 27-29] and USA (www.maswheat. 
ucdavis.edu; http://barleycap.cfans.umn.edu/). Never¬ 
theless, there are several success stories in many crops 
including wheat, rice, barley, maize, soybean, etc., where 
MAS has successfully been utilized to develop superior 
lines/varieties/hybrids for improving quality, resistance to 
diseases or tolerance to abiotic stresses. For example, 
Gupta et al. [17] has recently summarized success stories 
of molecular breeding in wheat. 

A widely discussed success story of molecular 
breeding is the introgression of the FR13A Subl locus 
conferring resistance against submergence in an Asian 
rice cultivar, Swarna [30, 31] that can confer tolerance 
up to 2 weeks of complete submergence. This has 
offered great relief to the large number of Asian farmers 
whose rice land is located in deltas and low-lying areas 
that are at risk from flooding during the monsoon 
season every year. Some selected examples of molecular 
breeding in rice and wheat (adopted from [17]) are 
summarized in Table 2. 

Advanced Backcross QTL (AB-QTL) Analysis 

Although MAS has been quite successful, it has always 
been a difficult task to tackle linkage drag, especially 
when a QTL or a gene is to be introgressed from wild/ 
exotic species. Furthermore, in MAS, QTL/gene discov¬ 
ery and variety development are two separate processes. 
To deal with this problem and to harness the potential of 
the wild/unadapted germplasm in breeding programs, 
a new approach referred as advanced backcross QTL 
(AB-QTL) analysis was proposed by Tanksley and Nel¬ 
son [32]. AB-QTL aims at simultaneous detection and 
transfer of useful QTLs from the wild/unadapted rela¬ 
tives to a popular cultivar for improvement of a trait. In 
this context, a superior cultivar/variety is crossed with 
a wild species leading to the production of a backcross 
population (BC 2 , BC 3 ), and molecular markers are used 
to monitor the transfer of QTLs by conventional 
backcrossing. The advanced backcross approach has 
already been successfully utilized in different crops, 
including tomato [59], rice [60, 61], barley [62], and 
wheat [63] . It is anticipated that the use of AB-QTL will 
be accelerated in a range of crops for improving impor¬ 
tant traits such as disease resistance as well as yield 
traits. 
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Crop Breeding for Sustainable Agriculture, Genomics Interventions in. Table 2 Some examples of improved cultivars 
or varieties developed through marker-assisted introgression of important genes/QTLs in rice and wheat 


Crop 

Genes/QTL 

introgressed 

Function 

Variety 

developed/ 

released 

Reference 

Rice 

GBSS 

Unique cooking and processing 
quality traits including amylose 
content 

Cadet and 
Jacinto 

[33] 


Xa33t 

Bacterial blight resistance 

BC 3 F 2 

[34] 


Xa21 

Bacterial blight resistance 

Zhongyou 6 
and Zhongyou 
1176 

[35] 


Subl 

Submergence tolerance 

BC 3 F 2 

[30] 


Sub1 

Submergence tolerance 

Samba 

Mashuri-Sub7 

IR64-Sub7 

TDK1-Sub 7 

CR1009-Sub 7 

BR11-Sub 7 

[36] 


SUB1QTL 

Submergence tolerance 

Subl 

introgression 

lines 

[37] 


Piz-5+Xa2 1 

Blast and bacterial blight 
resistance 

bc 4 f 2 

[38] 


XaA+xaS and 
Xa4+Xa7 

Bacterial blight resistance 

Angke and 
Conde 

[39] 


xq7 and Xa21 

Bacterial blight resistance 

Zhenshan97 x 
Minghui 63 

[40] 


xa13 and 
xa21 

Bacterial blight resistance, strong 
aroma 

Pusa1460, 

IET18990 

[41] 


xa13 and 

Xa21 

Bacterial blight resistance 

Improved Pusa 
RH1 

[42] 


xa5,xal3 and 
Xa21 

Bacterial blight resistance 

Improved 

PR106 

[43] 


Xa5,xa13 and 
xa21 

Bacterial blight resistance 

BC 3 F 2 

[44] 


XaS, Xa'\3 
and Xa21 

Bacterial blight resistance 

IET19046 

http://www.drricar.org/four_varieites.htm 


Xa4, xa8, 
xa13 and 

Xa21 

Bacterial blight resistance 

bc^ 

[45] 


Xa4, XaS, 

XaU and 

Xa2 1 

Improved bacterial blight 
resistance 

Pusa 1526-04- 

25 

http://www.iari.res.in/?q = node/233 
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Crop 

Genes/QTL 

introgressed 

Function 

Variety 

developed/ 

released 

Reference 


- 

Bacterial blight resistance 

Xieyou218 

[46] 


QTL 

Drought-tolerant aerobic rice 

MAS946-1 

www.hindu.com/2007/11 /17/stories/ 

2007111752560500.htm 


q SALTOL and 
qSUBl 

Enhanced salt and submergence 
tolerance 

F 6 

http://open.irri.org/sabrao/images/ 

stories/conference/site/papers/ 

apb09final00098.pdf 


QTL 

Improved performance under 
drought 

Birsa 

VikasDhanl 11 
(PY 84) 

http://claria13.securesites.net/News/ 
releases/2009/may/26018.htm 

Wheat 

QPhs.ccsu- 
3A.1 and Lr24 
+Lr28 

Preharvest sprouting tolerance 
and leaf rust resistance 

bc 3 f 3 

[47] 


Lr47 

Resistance to leaf rust 

BIOINTA2004 

[48] 


Gpc-B\ 

High grain protein content 

Lillian 

[49] 


Qfhs.ndsu- 

3AS 

Resistance to fusarium head blight 

Bena 

[50] 


Sml 

Resistance to the insect orange 
blossom wheat midge 

Goodeve 

[51] 


Stb4 

Resistance to Septoria 

Kern 

Cited from [17] 


Wsm-1 

Resistance to wheat streak mosaic 
virus (WSMV) 

Mace 

[52] 


Yr15 

Seedling stripe rust 

BC 3 F 2:3 

[53] 


Qss.msub-3BL 

Resistance to wheat stem sawfly 

McNeal, 

Reeder, Hank 

http://www.wheatworld.org/pdf/ 

dubcovsky.pdf 


Bdv2 

Resistance to yellow dwarf virus 

Above, 

Avalanche, 

Ankor 

http://www.wheatworld.org/pdf/ 

dubcovsky.pdf 


Yr17 and Lr37 

Stripe rust and leaf rust resistance 

Patwin 

[54] 


CreX and CreY 

Cyst nematode resistance 

F 3 progenies 

[55] 


Yr36 and Gpc- 
B-\ 

Resistant to stripe rust and high 
grain protein content 

Westmore 

[56] 


Yr17 and Yr36 

Resistance to stripe rust 

Lassik 

Cited from [17] 


Lr19 and Sr25 

Resistant to stem rust race UG99 

UC1113 

(PI638741) 

[57] 


Yr36 and Gpc- 
B1 

Resistance to stripe rust, high grain 
protein content 

Farnum 

(WA7975) 

h tt p://www.a rs-g r i n .g ov/cg i- b i n/n pg s/ 
acc/display.pl?1671746 


Yrl5 and Gpc- 
B 1 

Resistance to stripe rust and high 
grain protein content 

Scarlet 

(WA7994) 

http://css.wsu.edu/Proceedings/2005/ 

2005_Proceedings.pdf 


Lrl Lr9, Lr24, 
Lr47 

Leaf rust resistance 

BQF, 

[58] 
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Marker-Assisted Recurrent Selection (MARS) 

In the majority of traits of interest, quantitative varia¬ 
tion is controlled by many QTLs, each with minor 
effect. Moreover, minor QTLs show an inconsistent 
QTL effect in different environments and over different 
seasons. Even when the effect of these minor QTLs is 
consistent, their introgression into the desired geno¬ 
type through MABC becomes extremely difficult as 
a larger number of progenies are required to select 
appropriate lines. In such cases, MARS has been pro¬ 
posed for pyramiding of superior alleles at different 
loci/QTLs in a single genotype [64, 65]. 

It was demonstrated in recent studies that the 
response of MARS is larger when prior knowledge of 
the QTLs exists and the response decreases as the knowl¬ 
edge of the number of minor QTL associated with the 
trait decreases [66] . In sweet corn, MARS was employed 
to fix six marker loci in two different F 2 populations 
which showed an increase in the frequency of marker 
allele from 0.50 to 0.80 [64]. Similarly, in a separate 
study, enrichment of rust resistance gene ( Lr34/Yrl8 ) 
with an increase in frequency from 0.25 to 0.60 was 
reported in wheat BCi through MARS [28]. MARS can 
be utilized effectively for selection of traits associated 
with multiple QTLs by increasing the frequency of 
favorable QTLs or marker alleles. Several companies 
are using MARS in their maize, soybean, etc., breeding 
programs [66, 67]. Recently, some institutes like Interna¬ 
tional Crops Research Institute for the Semi-Arid Tropics 
(ICRISAT), the French Centre for International Agricul¬ 
tural Research (CIRAD), and the University of California- 
Riverside, USA, have also initiated MARS programs in 
chickpea (PM Gaur, personal communication 2010), sor¬ 
ghum (J-F Rami, personal communication 2010), cowpea 
(J Ehlers, personal communication 2010), etc., for 
pyramiding favorable drought-tolerant alleles. 

Genome-Wide or Genomic Selection (GS) 

Although MAS has been practiced for the improve¬ 
ment of quantitative traits, it has its own limitations. 
Therefore, in addition to MARS, Genomic Selection 
can be used to pyramid favorable alleles for minor 
effect QTLs at the whole genome level [68, 69]. Geno¬ 
mic selection predicts the breeding values of lines in 
a population by analyzing their phenotypes and high- 
density marker scores. A key to the success of GS is that, 


unlike MABC or MARS, it calculates the marker effects 
across the entire genome that explains the entire pheno¬ 
typic variation. In simple terms, genome-wide selection 
refers to marker-based selection without significance 
testing and without identifying of a subset of markers 
associated with the trait [68] . The genome-wide marker 
data (marker loci or haplotypes) available or generated 
on the progeny lines, therefore, are used to calculate 
genomic estimated breeding values (GEBV) as the sum 
of the effects of all QTLs across the genome, thereby 
potentially exploiting all the genetic variance for a trait 
[68, 69]. The GEBVs are calculated for every individual 
of the progeny based on genotyping data using a model 
that was “trained” from the individuals of another 
training populations having both phenotyping and 
genotyping data. These GEBVs are then used to select 
the progeny lines for advancement in the breeding cycle. 
Thus GS provides a strategy for selection of an individ¬ 
ual without phenotypic data by using a model to predict 
the individuaFs breeding value [69]. 

Recently, Wong and Bernardo [70] simulated the 
comparative responses of phenotypic selection (PS), 
MARS, and GS with small population sizes in oil 
palm, and assessed the efficiency of each method in 
terms of years and cost per unit gain (i.e., the time and 
cost saved by these different methods over each other for 
making selection). They used markers significantly 
associated with the trait to calculate the marker scores 
in MARS, whereas all markers (without significance 
tests) to calculate the marker scores in GS. Responses 
to PS and GS were consistently greater than the response 
to MARS. Furthermore, with population sizes of N = 50 
or 70, responses to GS were 4-25% larger than the 
corresponding responses to PS, depending on the heri- 
tability and number of QTLs. In terms of economics, 
cost per unit gain was 26-57% lower with GS than with 
PS when markers cost US $1.50 per data point, and 
35-65% lower when markers cost $0.15 per data point. 
Reduction in costs in sequencing and high-throughput 
marker genotyping may enhance uptake of GS for crop 
improvement in the future. 

Challenges in Adoption of Genomics Technologies 

Developing sustainable approaches to agriculture is 
one of the most difficult challenges facing growers 
and scientists today. Agricultural sustainability involves 
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successful management of resources for agriculture to 
satisfy changing human needs, while maintaining or 
enhancing the quality of the environment and conserv¬ 
ing natural resources [71]. However, sustainable pro¬ 
duction is hampered by the decline in land and soil 
productivity as a result of inappropriate soil and water 
management and other agricultural practices, as well as 
misguided policies and frequent opposition to techno¬ 
logical advances that have the potential to improve the 
quality of life of billions of people worldwide. This is in 
addition to the postulated challenges of climate change, 
the number of hazardous chemicals (pesticides and 
fungicides) that are constantly being released into the 
environment and are becoming increasingly toxic to 
human and animal life [71]. In recent years, the use of 
promising biotechnology tools like genetic engineering 
(GE) has offered potential solutions to the above prob¬ 
lems. However, the adoption of any new technique, 
particularly related to genetic engineering, remains 
a policy matter and as mentioned above faces stiff 
opposition many a times. In a recent review, Farre 
et al. [25] addressed several of these issues and advo¬ 
cated to overcome on the major barriers to adoption, 
which are political rather than technical, for realization 
of the potential of GE crops in developing countries. 

It is thus obvious that the challenges facing agricul¬ 
ture are massive, particularly with the controversies over 
GE crops world over. It is clear that current methods of 
food production, in both the developing as well as the 
developed countries, are neither sufficient nor sustain¬ 
able [72]. Under these circumstances, genomics inter¬ 
ventions have great role to contribute to sustainable 
agriculture. As mentioned in this article, genomics 
approaches are very powerful to predict the phenotype, 
with higher precision and efficiency, based on the geno¬ 
type. A variety of approaches ranging from MAS to GS 
are available to become integral part of plant breeding. 
While in the past, plant breeders were hesitant to use 
genetic variation existing in wild relatives of crop spe¬ 
cies in commercial breeding programs due to the long 
time it takes to recover desired phenotypes because of 
linkage drag, approaches like AB-QTL, in addition to 
MAS, can be successfully utilized. Availability of NGS 
technologies, associated with low costs and high- 
throughput, offers the opportunity to sequence either 
entire or major proportion of the germplasm collection 
for a species present in the gene banks around the word 


to understand genome variation. In case, the genome 
variation can be associated with the phenotype, which 
is not trivial, it will be possible to develop the ideotype, 
based on haplotype, of the variety to be developed. 

Future Directions 

While success stories of genomics-assisted breeding are 
available in several crops, it must also be recognized 
that much of the genome information generated is not 
being routinely used by plant breeders, especially in 
public breeding programs [26]. This may be due to 
shortage of trained personnel, inadequate access to 
genotyping, inappropriate phenotyping infrastructure, 
unaffordable bioinformatics systems, and a lack of 
experience of integrating these new technologies with 
traditional breeding [4, 26]. However, recently, several 
international initiatives such as the Integrated Breeding 
Platform (formerly Molecular Breeding Platform, 
www.mbp.genertaioncp.org), a joint initiative of The 
Bill and Melinda Gates Foundation and The Genera¬ 
tion Challenge Program, have been started so that plant 
breeders especially from developing countries can have 
access to many genotyping, phenotyping, as well as 
information technologies to integrate their breeding 
programs with modern genomics approaches. 

We believe that integration of modern genomics in 
combination with other cutting edge technologies in 
breeding programs is invaluable for crop improvement 
(Fig. 1) and will lead to sustainable agriculture for food 
security, especially in developing countries. 

A vital task facing the plant breeding community 
today is to enhance food security in an environmentally 
friendly and sustainable manner. Though genomics 
interventions will not solve all the problems associated 
with agricultural production leading to sustainability, 
they have the potential, especially when they are used 
in an integrated manner as described in Fig. 1, to 
improve the breeding efficiency to address specific prob¬ 
lems. These include increasing crop productivity, diver¬ 
sification of crops, enhancing nutritional value of food 
(biofortification), and reducing environmental impacts 
of agricultural production. However, only through judi¬ 
cious, rational, and science- and need-based exploitation 
of genetic resources through genomic technologies 
coupled with conventional plant breeding and genetic 
engineering will lead to sustainable agriculture. 
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Crop Breeding for Sustainable Agriculture, Genomics Interventions in. Figure 1 

Schematic representation of genomics technologies for crop improvement and sustainable agriculture 
In general, traditional crop improvement programs (shown in the box on the right hand side in Fig. 1) employ different 
breeding strategies integrated with physiology, pathology, entomology, etc., and generate superior lines or improved 
crop varieties. These approaches, however, take more time, and sometimes such breeding is referred as "chance breeding" 
due to uncertainty in successes predicted in these approaches. On the other hand, genomics technologies (shown in the 
box on the left hand side) such as a number of next-generation sequencing (NGS) technologies, availability of high- 
throughput genotyping such as capillary electrophoresis for large-scale SSR genotyping, microarray-based DArTs, 
GoldenGate/Infinium/BeadXpress assays for large-scale SNP genotyping, and a range of-omics technologies provide 
candidate markers, gene(s), and QTLs to be integrated into the breeding programs by using high-throughput genomics 
platforms. Integrated breeding approaches (shown in the box in the middle) such as marker-assisted selection (MAS), 
marker-assisted recurrent selection (MARS), and genome-wide selection (GS) offer "precision breeding" with a great 
potential, versus "chance breeding" to contribute to sustainable crop improvement. 
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Glossary 

Base temperature Lower temperature threshold 
below which development ceases. 

Epigenetics Genetic information other than DNA 
sequence information. 

Phenology Study of the sequence of developmental 
stages of a plant and how it relates to climate. 
Photoperiod sensitivity Requirement for a minimum 
(or maximum) day length for reproductive phase 
induction. 

Phytomer Fundamental building block of plant cano¬ 
pies. A vegetative phytomer is comprised of leaf, 
node, internode, and axillary bud. 

Phyllochron Rate of appearance of leaves on a shoot. 
Shoot apex The tip of the shoot where usually there is 
meristematic tissue producing new organs. 
Thermal time Temperature response curve used to 
estimate development rate. 

Vernalization sensitivity Requirement for a period of 
low temperatures for reproductive induction. 

Definition of the Subject 

Plant development, or the progression of plants 
through their life cycle, has been of great interest in 
human history because of the need to know and predict 
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when the harvested part of the plant was at the opti¬ 
mum stage. This knowledge was especially important 
(even vital) in medicinal plants, where the timing of 
harvesting defines the medicinal value of the product. 
This interest increased as groups moved from hunting 
and gathering to agrarian societies. 

Crop development can be defined with the number 
and rate of appearance, growth, and senescence of 
phytomers. However, that definition lacks information 
about when the switch of vegetative to reproductive 
phytomers occurs, which is defined by the phenology of 
the crop. Crop development is of great importance in 
agriculture because it is the main mechanism for plants 
to escape both biotic and abiotic stresses, and adapt to 
the environment. At a more practical level, it affects the 
management of the crop because cultural practices are 
more effective at specific stages of crop development. 

Introduction 

Major food, feed, and industrial crops were domesti¬ 
cated in a few centers of origin, including the “Fertile 
Crescent,” the Americas, and China. The wild relatives 
of modern crops were adapted to survival in the envi¬ 
ronment prevalent in the center of origin. Those origi¬ 
nal crops were locally grown in the region of origin, but 
some species showed a significant ability to adapt to new 
environments and were spread globally with human 
migrations and trade. For instance, wheat 
was domesticated in the Middle East (the “Fertile Cres¬ 
cent”) [1, 2], around 32°N and is currently cultivated 
from between 30°N and 60°N and from 27°S to 40°S. 
However, it can grow beyond those limits, in lower 
latitudes, in high altitudes, or even in the arctic circle. 
This range of environments where wheat can grow 
makes wheat one of the most plastic crops currently 
grown. It requires a large range of developmental mech¬ 
anisms to adapt to such different environments, where 
photoperiods vary from 13-14 h to nearly 20 h. One of 
the major accomplishments of the “Green Revolution” 
was the discovery of photoperiod insensitive mutants 
in wheat that could be grown at lower latitudes. 

Plasticity in development is key for the adoption 
of crops in a wide range of environments. Although 
growth and development are related, they are different 
processes. Development is the initiation and differen¬ 
tiation of organs and the progression of stages through 


which cells, organs, and plants go during their life cycle, 
whereas growth is the change in size or weight of 
the initiated organs. Biotic (e.g., genetics, weeds, and 
diseases) and abiotic (e.g., temperature, light, water, 
and nutrients) factors influence the initiation, growth, 
and senescence of plant organs. 

Since the “The Metamorphosis of Plants” by 
Johann Wolfgang von Goethe, originally published in 
1790 [3], there has been extensive research on how 
plants proceed from germination to maturity in an 
orderly and predictable manner. This research has led 
to an extensive conceptual framework of plant develop¬ 
ment resulting in many tools to predict plant develop¬ 
ment. Plant shoots develop by forming a series of nearly 
identical building blocks, called phytomers [4, 5], The 
vegetative phytomer is associated to a leaf, and 
phytomers are produced in an orderly manner on 
a shoot; for example, the phytomer of leaf 2 is formed 
after the phytomer of leaf 1. 

Phenology is the study of the plant (or animal) life 
cycle and how it is influenced by seasonal and inter¬ 
annual variations in climate. The phenology of a crop is 
defined by the sequence of stages, which, in turn, define 
phases. Identification of certain stages may require 
examination of the shoot apex. For example, after 
germination, the apical meristem produces vegetative 
structures such as leaf primordia. When temperature 
and photoperiod requirements are met, the shoot apex 
will start initiating reproductive structures (e.g., spike- 
let and floret primordia). 

The rate of appearance of phytomers and changes 
at the shoot apex are regulated by the genetics of the 
plant, the environment, and often an interaction of 
both. The main environmental drivers of plant devel¬ 
opment are temperature and photoperiod; their effects 
on phenology interact with the genetics of the plant 
responses by photoperiod sensitivity genes, vernaliza¬ 
tion genes, and earliness per se. 

This entry covers how plant development is regu¬ 
lated by temperature and photoperiod. Temperate 
cereals are emphasized because they are adapted to 
a wide array of environments and show a diverse set of 
adaptive mechanisms regulating their development. 
However, defining plant parts and how canopies are 
built as well as the sequence of events throughout the 
crop cycle is required before their regulation can be 
explained. This entry starts by defining the canopy 
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structure and the sequence of events that define the 
developmental processes of a crop, followed by the cur¬ 
rent knowledge on how temperature (including vernal¬ 
ization) and photoperiod regulate crop development. 
This entry finalizes with two important parts: modeling 
crop development and future directions. Modeling crop 
development is twofold important; first to understand 
the physiology and genetic basis of crop development, 
and second to predict when key developmental events 
are likely to happen as accurately as possible. 

Canopy Structure: Phytomers, Phyllochron, 
and Plastochron 

The phytomer is considered the basic building block 
of plant canopies and is most commonly defined as 
the leaf, node, internode above the node, and the 
axillary bud [6]. Therefore, canopy architecture and 
development is determined by the dynamic appear¬ 
ance, growth, and abortion/senescence of phytomers 
(and components of the phytomer). Phytomers origi¬ 
nate at the shoot apex with the initiation of a leaf, and 
the potential for a new shoot is formed with the pres¬ 
ence of the axillary bud. The growth and differentiation 
of each component of a phytomer will lead to their 
visual appearance. For example, the internode can con¬ 
tinue differentiating and growing, resulting in its 
appearance from the leaf sheath. Tillers may appear 
when the axillary bud differentiates and grows. 

Naming Plant Parts 

Because plant development is an orderly process, accu¬ 
rately identifying plant parts aids in describing the 
process and quantifying the developmental rate. 

Several naming systems of plant parts have been 
proposed, but most are quite similar. For example, 
true leaves can be numbered acropetally for each 
shoot with an L [7-9] (Fig. 1) beginning with the first 
foliar leaf, LI [9]. Similarly, Jewiss [7] proposed 
a system for naming tillers that has been modified 
and extended by many, but the modified system pro¬ 
posed by Klepper et al. [8, 9] is increasingly being 
adopted. This system uses the leaf axil number of the 
parent shoot to name the tiller. The first shoot to 
appear from the seed is the main stem (MS). Tillers 
appearing from the axils of leaves on the main stem are 
primary tillers and are named with a T and a digit 


corresponding to the leaf number. For example, the 
tiller appearing from the first leaf (LI) on the main 
stem is called Tl. This system is however limited by the 
potential production of tillers from the axil of leaf 10 
and above, which rarely happens. Primary tillers can 
produce tillers that are called secondary tillers. Second¬ 
ary tillers can arise from the axil of the prophyll of 
primary tillers and their second digit is a zero (Fig. 1). 

Haun [10] proposed a numerical leaf staging system 
to quantify the number of leaves appearing on the main 
stem (which can be extended to any shoot): 

Haunstage = (n — 1) + —, (o < < 1^ (1) 

where n is the number of leaves that have appeared on 
a shoot, L n _ 1 is the blade length of the penultimate leaf, 
and L n is the blade length of the youngest visible leaf 
extending from the sheath of the penultimate leaf. 
Therefore, when a shoot is identified, it can be further 
characterized using the Haun system. For example, 
the Haun stage for shoots on the plant shown in 
Fig. 1 is: MS (5.3), TO (1.5), Tl (2.4), T2 (1.4), and 
T3 (0.7). The Haun stage of a tiller with one leaf not 
fully unfolded is arbitrarily assigned as 0.1 or 0.9 when 
only the tip of the leaf is visible or most of the leaf blade 
has appeared, respectively. 

Similarly, the leaf and tiller naming scheme 
has been extended to the wheat inflorescence. 
Klepper et al. [11] defined a numerical index for the 
development of the inflorescence and Wilhelm and 
McMaster [12] extended it to uniquely identify each 
plant part. Spikelets are named with an S followed by 
the position from the base of the spike. Then, SI is the 
basal spikelet and S2 is the second spikelet from the 
peduncle (Fig. 2). Florets are designated with an F and 
numbered acropetally from the base of the rachilla. 
After fertilization, the letter F designating a floret is 
changed to a C for caryopsis. This system allows nam¬ 
ing reproductive structures in grasses with one spikelet 
per rachis node such as wheat. 

These naming systems of leaves, tillers, spikelets, 
and florets or caryopsis allow the accurate identifica¬ 
tion of each plant part. For example, the second cary¬ 
opsis on the third spikelet on the primary tiller from 
the axil of the second leaf of a wheat plant would be 
T2S3C2. 
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Naming leaves and tillers of a winter wheat plant 


The systems for naming individual plant organs can 
be easily extended to name vegetative and reproductive 
wheat phytomers [13]. A phytomer would be denoted 
with a “P” followed by an £C L ” if it is a phytomer associ¬ 
ated with a leaf or an “S,” if it is a reproductive phytomer 
and the leaf or spikelet number. The name of the shoot 
can be added to identify the tiller being described. For 
example, MS PL2 is the second phytomer on the main 
stem and T1 PS1 is the basal spikelet phytomer in the 
spike of the first tiller. 

Dynamic Appearance of Plant Organs: The 
Plastochron and Phyllochron 

The naming systems of plant parts described earlier are 
the landmarks to describe the development and struc¬ 
ture of a wheat plant. However, plant development is 


a dynamic process that follows the formation, growth, 
and senescence of phytomers and their components 
resulting in a continually changing architecture. The 
creation of the vegetative phytomer is dependent on the 
initiation of the leaf primordium. Therefore, the rate of 
leaf primordia initiation controls the timing of 
phytomer formation. The plastochron was first defined 
as the interval between the formation of two successive 
internode cells of the green alga Nitella flexilis [14], as 
cited in [15]. Milthorpe [16] and Esau [17] defined the 
plastochron as the interval between the formations of 
successive leaf primordia at the shoot apex, which is 
now commonly accepted. Similarly, the phyllochron 
was defined as the interval between appearances of 
consecutive leaves on a shoot [18] as cited in [6]. 
Wilhelm and McMaster [6] further refine the definition 
of phyllochron by defining appearance as “visible 
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Spike 


Spikelet 7 (S7) 



Crop Development Related to Temperature and Photoperiod. Figure 2 

Naming scheme for reproductive organs of spike inflorescence. Spikelet positions are denoted by the letter S and 
numbered acropetally along the rachis. Florets/caryopsis positions are denoted by the letters F/C and numbered 
acropetally along the rachilla (From Wilhelm and McMaster [12]) 


without magnification, dissection or changing leaf dis¬ 
play” The inverse of the phyllochron is termed devel¬ 
opment rate (DR), which can be generalized as the 
inverse of the time interval between two developmental 
events. The relationship between the plastochron and 
the phyllochron depends on the species. In wheat, leaf 
primordia are produced more rapidly than they appear, 
suggesting that different mechanisms are involved in 
regulating each process. Leaf primordia are initiated at 
the meristem on the shoot apex, where new cells are 
produced very quickly. After a leaf primordium is 
formed, it continues to grow in cell number and cell 
size, but this growth does not happen at the shoot apex 
meristem, rather at the intercalary meristem at the base 
of the leaf. The amount a leaf grows until it appears 
through the curl of leaves is much larger than the 
growth involved in forming a leaf primordia, therefore 
it is more dependent on the available resources (e.g., 
water, carbohydrates, and nutrients). 


Developmental Stages and Phases: Phenology 

Plants develop by the repetition of elementary building 
blocks (i.e., phytomers), whose morphological, dimen¬ 
sional, functional, and anatomical features change dur¬ 
ing ontogeny and according to several processes called 
heteroblasty, phase change, life stages, maturation, 
aging, age states, or morphogenetic progression [19]. 
In this entry, those changes will be referred as develop¬ 
mental stages (or simply stages) and the time between 
stages will be referred as developmental phases (or 
phases). It is common in the literature referring to 
these stages as “growth stages.” However, there is little 
growth in some developmental events such as anthesis. 
The phenology of a crop is the ordered succession of 
stages and phases that can have different lengths deter¬ 
mined by internal factors (e.g., the genetics of a variety 
or species) or biotic and abiotic external factors (e.g., 
diseases, temperature, light, and nutrients). 
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Plant development can broadly be divided into 
vegetative and reproductive phases that often overlap. 
The switch from vegetative to reproductive phase hap¬ 
pens at the meristem level, which stops producing 
vegetative phytomers (i.e., leaves, nodes, and inter¬ 
nodes) to start producing reproductive phytomers 
(rachis, spikelets, and florets). 

Phenological Scales 

Developmental stages occur in a consistent pattern in 
a crop each year and numerous approaches exist that 
characterize crop phenology. The most widely used 
phenological scales for temperate cereals are Feekes 
[20], Zadoks [21], Haun [10], and BBCH [22]. The 
Feekes scale is shown in Fig. 3 and described, jointly 
with the Zadoks 5 decimal code, in Table 1. The BBCH 


scale for cereals is mainly based on the Zadoks scale, 
and the Haun scale primarily describes the develop¬ 
ment of shoots until the last leaf is fully expanded. 

Phenological scales consider basic developmental 
stages like germination, emergence, tillering, stem 
elongation, heading, flowering, grain filling, and phys¬ 
iological maturity, with differences among scales pri¬ 
marily in how much detail each stage is characterized. 
Some developmental stages are not well defined, lead¬ 
ing to confusion in measuring and reporting these 
stages. For example, the beginning of stem elongation 
is usually recorded as the jointing date or when the first 
node is visible above ground; however, the first node is 
formed when the apex is underground and is only 
visible after the stem has elongated sufficiently to ele¬ 
vate the apex and the node above ground. Likewise, 
physiological maturity is defined as when the 
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Feekes developmental scale with the approximate timing of some shoot apex developmental events (From [13]) 
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Crop Development Related to Temperature and Photoperiod. Table 1 Description of the main developmental stages 
according to Feekes [20] and Zadoks [21], and suggested measurements characteristics (Modified from [23]) 



Description 


Measurement 

Stage or phase 

Feekes 

Zadoks 

characteristics 

Germination 

No stage 

Stages: 00 - dry seed, 

01 - beginning of 
imbibition, 03 - imbibition 
complete, 05 - radicle 
emerged from caryopsis, 

07 - coleoptile emerged 
from caryopsis 

Beginning of imbibition: 
Seed begins to swell 

Emergence 

Stage 1 - main shoot only 

Stage 09 - leaf at the tip of 
the coleoptile 

Beginning of emergence: 
First true leaf emerges 
through the coleoptile 
and the tip is visible 
above the soil surface 

Tillering 

Stage 2 - beginning of tillering 

Stages 21-29 - main stem 
plus 1 to 9 tillers 

Beginning of tillering: 

The first tiller is visible 

Single ridge 

No stage 

No stage 

Shoot apex shape 
changes from dome to 
more elongated and leaf 
primordia begin to form 
a ridge around the apex 

Double ridge 

No stage 

No stage 

Formation of double 
ridges around the apex. 
Bottom ridge is leaf 
primordia and top ridge 
is spikelet primordia 

Terminal spikelet 

No stage 

No stage 

Apical spikelet 
primordium appears and 
noted by a 90° rotation 
from the plane of 
previous spikelets 

Jointing 

Stage 6 - first node visible 

Implicit in stage 31 - first 
node detectable and 
change of plant habit from 
prostrate to erect 

First node visible above 
the soil surface 

Stem elongation 

Stages: 4 - change of plant 
habit from prostrate to erect, 

5 - pseudo-stem clearly erect, 

6 - first node visible, 7 - second 
node visible, 8 - last leaf 
appearance, 9 - last leaf ligule 
visible, 10 - last leaf sheath 
swelling 

Stages: 30 - pseudo-stem 
erect, 31 to 36 first to sixth 
node detectable, 39 - flag 
leaf ligule visible, 43 to 

49 - flag leaf sheath 
swelling 

Beginning when the first 
node is formed, usually 
below soil surface, and 
pushed above the soil 
surface 

Flag leaf 

Stage 9 - last leaf ligule visible 

Stage 39 - Flag leaf ligule 
visible 

Ligule of last leaf is visible 
and no new leaf is 
emerging 
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Crop Development Related to Temperature and Photoperiod. Table 1 (Continued) 



Description 


Measurement 

Stage or phase 

Feekes 

Zadoks 

characteristics 

Booting 

Stage 10 - last leaf sheath 
visible 

Stages 43-49 - flag leaf 
sheath swelling 

Begins at flag leaf and 
ends at heading 

Heading 

Stages: 10.1 - first ears just 
visible, 10.2 - !4 of heading 
completed, 10.3 - Vi of heading 
completed, 10.4 - % of heading 
completed, 10.5 - all ears out of 
the sheath 

Stages: 51 - first spikelet 
just visible, 53 - !4 of 
the inflorescence visible, 

55 - Vi of the inflorescence 
visible, 5 7 - 3 A of the 
inflorescence visible, and 

59 - inflorescence 
completely emerged 

Begins when first spikelet 
is visible and ends when 
the inflorescence is fully 
emerged 

Anthesis 

Stages: 10.5.1 - anthesis starts, 
10.5.2 - flowering completed to 
the top of the ear, 10.5.3 - 
flowering completed to the 
bottom of the ear, 10.5.4 - 
flowering over/kernel watery 
ripe 

Stages: 61 - beginning of 
anthesis, 65 - mid-anthesis, 
69 - anthesis completed 

Starts when the first 
anther is visible on an 
inflorescence and ends 
when no more anthers 
appear on the 
inflorescence 

Grain filling 

Stages: 10.5.4, 11.1 - milky ripe, 

11.2 - mealy ripe 

Stages: 71 to 77 - milk 
grain, 83-87 - dough grain 

Begins at fertilization, 
usually considered 
anthesis and ends at 
physiological maturity 

Physiological maturity 


No Stage 

When all spike 
components, internodes, 
and leaves lose green 
color 

Ripening 

Stages: 11.3 - kernel hard, 11.4 - 
ripe for cutting 

Stages 91-99 

Ripening and dormancy 


maximum dry weight is reached. In wheat, determin¬ 
ing physiological maturity is somewhat difficult 
because there is not a clear morphological change in 
the plant morphology as it happens in maize (and 
sunflower). In maize, a black layer near the base of 
the kernel appears when the maximum dry weight is 
reached. The Feekes scale defines harvest maturity as 
when the grain is difficult to divide along the crease and 
Zadoks [21] defines the 90% of ripeness of rice (Oryza 
sativa L.) when the kernel cannot be dented with the 
fingernail. These definitions likely are not precisely 
correlated with maximum seed biomass. Therefore, 
it is now commonly accepted practice to assume phys¬ 
iological maturity for temperate cereals occurs when 
all green color has disappeared from the spike. This 
definition seems reasonable as leaves and internodes 


have long since lost all green color so that no photo¬ 
synthesis occurs, and there is no report showing 
retranslocation from carbohydrate reserves to the 
grain at this time. 

Regulation of Crop Development 

Plant development is highly dependent on tempera¬ 
ture, which controls the rate of development and 
the switch from vegetative to reproductive states. 
Both high and low temperatures may have a major 
effect on plant development, especially controlling the 
switch to the reproductive state. For example, winter 
wheat requires a period of low temperatures to start 
producing reproductive structures (vernalization). The 
length of the cold period varies with genotype. 
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Besides vernalization, day length (or photoperiod) 
modifies the temperature-controlled rate of develop¬ 
ment. Photoperiod refers to the number of daylight 
hours, which changes through the seasons and with lat¬ 
itude. Photoperiod increases after the winter solstice 
(December 21 in the northern hemisphere or June 21 in 
the southern hemisphere) and decreases after the summer 
solstice. Crops and genotypes vary in their sensitivity to 
photoperiod and the minimum amount of daylight 
required to switch from vegetative to reproductive phases. 

Developmental Response to Temperature 

The relationship between temperature and phenology 
has been long recognized. Temperature is a better pre¬ 
dictor of many developmental processes than calendar 


time. Reamur [24] formalized this relationship by cre¬ 
ating the concept of heat units, now referred to as 
thermal time. The relationship between temperature 
and developmental rate is a curve with a maximum 
development rate at the optimum temperature (T 0 ) 
and developmental rate reaching zero at temperatures 
below the base temperature (T b ) or above the maxi¬ 
mum temperature (T m ; Fig. 4). This nonlinear relation¬ 
ship is shown in several studies [25, 26]. 

Thermal time has two components: (1) the average 
temperature (T a ) over some time interval (e.g., hourly, 
daily), and (2) a temperature response curve describing 
the effectiveness of T a on the development rate for the 
process (e.g., phyllochron, phenology). T a is the inte¬ 
gral of temperature over the time period of interest; 






Crop Development Related to Temperature and Photoperiod. Figure 4 

Development rate as a function of temperature. Temperature response curves: (a) linear response, (b) extended linear 
response with an upper temperature threshold, (c) bilinear model with two zero development temperatures (base 
temperature, 7" b , and maximum temperature, 7" m ) and an optimal temperature (T 0 ), and (d) trilinear model with two zero 
development temperatures (7" b and 7" m ) and a range of optimal temperatures defined by optimal lower temperature (T 0 |) 
and optimal upper temperature (T ou ) 
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however, in practice the average of the maximum 
and minimum temperatures in the time interval is 
often used: 

rp frnax T ^min /~\ 


This approximation is fairly accurate, but its error 
increases with deviations of from 12 h photoperiods 
and if sudden changes in temperature happen within 
the time interval. The time interval mostly depends on 
data availability, but common intervals range from 
daily to hourly time intervals, with daily the most 
commonly used. There are many temperature- 
response curves, which greatly diversifies the calcula¬ 
tion of thermal time ( Tt). The most simple form is 
a linear relationship with the temperature at which 
there is some plant development (Fig. 4a), i.e., the 
difference between T a and the temperature at which 
development is zero or base temperature (T b ): 

Tt = T a — T b , (Tt > 0) (3) 

where thermal time is expressed as growing degree- 
days (GGD, °C-days). Modifying Eq. 2 to include 
a maximum development rate (Fig. 4b) is useful if 
plants are grown at higher temperatures. Given that 
the relationship between temperature and develop¬ 
ment rate is not linear [25-27], further refinements to 
the accumulation of thermal time can include changing 
the development rate based on certain cardinal 
temperatures (Fig. 4c, d). These approaches assume 
development rate increases with temperatures above 
T b until an optimum temperature (T 0 , or a range of 
optimal temperatures, T 0 i to 7^ u ) is reached, and then 
decreases until development stops at a maximum 
temperature (T m ). This can be approximated by two- 
or three-segmented linear models, or curvilinear 
models such as a quadratic curve [25] or beta distribu¬ 
tion [27, 28]. 

Vernalization 

The term vernalization was first used by Lysenko in 
1928, but research on the need of a cold period for 
winter cereals to flower started as early as 1857 [29]. 
Vernalization can be viewed as an adaptive mechanism 
to avoid unfavorable periods for development (e.g., 
winter) and ensure flower development and subse¬ 
quent seed growth occurs under favorable conditions 


(e.g., spring and summer). Hence, vernalization syn¬ 
chronizes plant development with seasonal climate 
changes. 

Commonly, genotypes requiring vernalization are 
referred to as “winter” wheat (or “winter” barley) and 
are normally planted late summer or early fall with 
vernalization occurring during the late fall or early 
winter. Conversely, “spring” genotypes are commonly 
viewed as not requiring vernalization, and normally are 
planted in the spring or in regions where temperatures 
are often above effective vernalizing temperatures. This 
well-entrenched distinction between “winter” and 
“spring” genotypes does not reflect that “spring” geno¬ 
types (1) often have at least some vernalization require¬ 
ments, (2) reach flowering faster if experiencing 
vernalizing temperatures, and (3) mask the continuum 
of vernalization requirements present among all wheat 
genotypes. 

Effective vernalizing temperatures range from 0° C 
to 10° C [30] , and a few weeks of cold are usually sufficient 
to promote the switch from vegetative to reproductive 
phases and longer periods of cold temperatures 
can shorten the time to flowering until the vernalization 
response is saturated [30] . Genotypes vary in the length of 
the cold period required to saturate the vernalization 
response. For instance, time to flowering in wheat was 
reduced in response to longer cold periods [31, 32], and 
interestingly, genotypes thought to require vernalization 
to flower eventually flowered without undergoing 
vernalization treatment [32]. Quantifying responses to 
varying periods of vernalizing temperatures in calendar 
time or thermal time does not reflect the biology of the 
response. A better method to quantify the effects of 
vernalization would be by counting the number of 
leaves produced at flowering time [33]. Using this 
quantification method, vernalization reduces the time 
to flowering by reducing the number of leaves being 
produced rather than the phyllochron [31, 33]. That is, 
the number of leaves produced at flowering increases 
with shorter vernalization periods, while the 
phyllochron is not affected by the duration of the 
vernalization period. 

Genetic Regulation of Vernalization Although the 
mechanisms by which plants sense cold and initiate the 
cellular signaling to induce flowering are not known, 
the genetic regulation of vernalization is fairly well 
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known in cereals like wheat and barley, which benefited 
of the research done on the model plant Arabidopsis 
thaliana. Four major genes are involved in the expres¬ 
sion of vernalization sensitivity in wheat and barley, 
VRN-1, VRN-2 , VRN-3, and VRN-4. The first three 
genes have been cloned and identified [34-36]. How¬ 
ever, these three genes do not explain the spring habit 
of all varieties [35] , suggesting that other genetic mech¬ 
anisms may be involved. 

The VRN-1 gene encodes the MADS box transcrip¬ 
tion factor similar to APETALA-1 , which is responsible 
for meristem identity in several plants [34] . This gene is 
up-regulated by vernalizing temperatures and the 
degree at which it is up-regulated depends on the 
length of the vernalization period. However, spring 
varieties also show an up-regulation of VRN-1 during 
the initiation of the reproductive phase and remain 
high throughout the reproductive phase, suggesting 
an involvement of VRN-1 in meristem identity but 
not limited to vernalization response. 

The VRN-2 gene encodes the ZCCT1 protein, 
which shows high similarity to the CCT domain of 
the Arabidopsis CONSTANS and CONSTANS-like 
genes. VRN-2 represses flowering and is down- 
regulated by vernalization [35]. The spring allele 
vrn-2 in wheat has a point mutation at the CCT 
domain that replaces an arginine with a tryptophan. 
It has been suggested that the CCT domain may be 
involved in protein-protein interactions [37], so 
a mutation in this domain can alter these interactions. 

The VRN-3 gene is a RAF kinase inhibitor like 
protein with high homology to the Arabidopsis 
FLOWERING LOCUS T(FT ) gene [36], which induces 
flowering when expressed. FT is a flowering signal that 
moves from leaves to stem apices. 

VRN-1 , VRN-2 , and VRN-3 interact with each 
other regulating flowering under the vernalization 
pathway (Fig. 5). Before vernalization occurs, VRN-2 
is expressed and represses the expression of VRN-3. 
When plants are vernalized, VRN-1 is induced and it 
represses the expression of VRN-2 , allowing VRN-3 to 
express and promote flowering. At the same time, there 
is a feedback mechanism by which VRN-3 up-regulates 
VRN-1. It seems that VRN-1 is the primary target 
of vernalization and is essential for flowering. 

VRN-4 has been recently fine mapped and its clon¬ 
ing is underway [38]. Identifying and including this 


VRN-3 I- VRN-2 



Crop Development Related to Temperature and 
Photoperiod. Figure 5 

Model of the vernalization pathway in temperate cereals. 
Arrows mean induction (or up-regulation) of gene 
expression; for example, vernalization induces the 
expression of VRN-1. Bar-headed lines mean repression 
(or down-regulation) of gene expression; for example, 
VRN-2 represses the expression of VRN-3 

gene in the vernalization pathway may increase the 
understanding of vernalization responses in wheat. 

Photoperiod 

Photoperiod can be defined as the number of hours of 
light in a 24-h period, which changes throughout the 
season depending on the latitude. It has been long 
recognized that plants, including crops, normally 
flower only when the length of the day was favorable 
[39]. Crops sense the amount of light they receive daily 
and respond to it by accelerating or slowing their 
development. There is, however, genetic variation of 
a quantitative nature in the response to photoperiod 
within a crop, meaning that different varieties respond 
differently to changes in the photoperiod, and some 
show no response to photoperiod (or are photoperiod 
insensitive). Photoperiod sensitivity is thought to be 
the wild type phenotype because it is wide spread in 
wild barley ( Hordeum spontaneum L.) and confers 
strong adaptive features in the center of origin of barley, 
where there might be late spring frosts. Photoperiod 
insensitivity, however, brings wide geographical adapt¬ 
ability because plants do not require long days to 
flower, and, hence, they are suitable for environments 
with short seasons or latitudes where long days do not 
occur. For example, in wheat, day length neutrality, 
jointly with semi-dwarfism and rust resistance traits, 
were used to develop the high yielding varieties of the 
“green revolution” [40]. Photoperiod insensitivity 







Crop Development Related to Temperature and Photoperiod 


2551 



promotes earliness thought to be a desirable adaptive 
trait for environments closer to the equator where high 
temperatures and drought can be expected at the end of 
the season. Therefore, earliness allows wheat and barley 
varieties to escape this terminal stress. 

Although the rate of development mainly responds 
to temperature, when temperature is fixed, longer pho¬ 
toperiod alters the development rate by shortening the 
phyllochron [41, 42]. In Fig. 6, the appearance of leaves 
over time follows a linear model when plants are grown 
at a constant temperature, and the rate of appearance 
(slope of the curve) increases with the photoperiod; 
however, as day length increases, fewer leaves are 
formed per hour of light, hence there is a reduction in 
photoperiod efficiency for leaf emergence. 

Different crops respond differently to photo¬ 
period; crops can be classified as long day (LD) and 
short day (SD) depending on which photoperiod 
accelerates development (promotes flowering). 
Temperate cereals are long-day crops because flowering 
is promoted by photoperiod longer than 12-14 h, while 
maize and rice are short-day crops, meaning that short 
days induce the switch to reproductive phase. In wheat 
and barley, increased photoperiod shortens the time to 
flowering by modifying the length from emergence to 
terminal spikelet initiation. The effects of photoperiod 


on the duration from terminal spikelet initiation to 
flowering depend on what environments they are mea¬ 
sured (field versus controlled environments). This 
reduction of time to flowering is not only due to an 
accelerated phyllochron, but also by the production of 
fewer leaves. 
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Genetic Regulation of Photoperiod Sensitivity In 

temperate cereals, such as wheat and barley, sensitivity 
to photoperiod is mainly regulated by Ppd genes. 
Homologous genes have been identified in wheat and 
barley and have the function of a pseudo response 
regulator ( PRR ), most similar to the Arabidopsis PRR7 
[43]. The PPR proteins are characterized by a pseudo 
receiver domain near the amino-terminus and a CCT 
domain near the carboxy-terminus [44], which makes 
them distantly related to other CCT domains impor¬ 
tant in regulation of flowering such as CONSTANS [45] 
and VRN-2 [35]. 

The photoperiod insensitive allele ( ppd-Hl ) in bar¬ 
ley slightly delays the gene expression of CONSTANS 
( HvCOl ) that follows a circadian pattern [43] and is 
a transcriptional regulator of the FT gene [46]. In 
contrast, photoperiod insensitivity in wheat is regu¬ 
lated by a series of three homoeologous genes located 
in the colinear region on chromosome 2 group, which 




Crop Development Related to Temperature and Photoperiod. Figure 6 

Development rate responses to changes in day length at constant temperature, (a) Leaf appearance of plants grown in 
different day lengths and (b) development rate as a function of day length (Based on [41]) 
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Crop Development Related to Temperature and 
Photoperiod. Figure 7 

Model of the photoperiod pathway of flowering time. 
Arrows mean induction (or up-regulation) of gene 
expression; for example, CONSTANS is induced by the 
expression of Ppd genes 

seem to be upstream of the CONSTANS gene in the 
photoperiod pathway (Fig. 7). The Ppd-1 Da allele 
confers insensitivity to photoperiod in a semidominant 
fashion, allowing wheat plants to flower regardless of 
the photoperiod [47]. Sequence analysis of wheat 
varieties known to have this mutation shows that 
there is a 2,089 bp deletion upstream of the coding 
region responsible for the photoperiod insensitivity 
phenotype. Other sequence variations producing 
nonfunctional proteins (null alleles) at the 2A and 2D 
genes in wheat have been observed; however their 
effects on photoperiod sensitivity are difficult to assess 
because they might be masked by functional proteins 
from other homoeologous genes. These, and other 
mutations, however, may have a quantitative effect in 
photoperiod sensitivity and flowering time [47]. 

Coordinated Temperature and Photoperiod 
Regulation of Crop Development 

The temperature and photoperiod regulation of crop 
development has been described in previous parts of 
this article, but both environmental factors act in coor¬ 
dination. It is clear that the phyllochron varies with 
planting date in temperate cereals and it has been 
suggested that the phyllochron is fixed by the rate of 
change of the photoperiod at crop emergence [48]. 
However, in general, temperature and photoperiod 
change together in the field. Results from experiments 
in controlled environments show an effect of photope¬ 
riod on the temperature-response curve [49, 50]. The 
term “thermo-photo ratio” (the degree-days divided 
by day length in hours) has been used to study the 
coordinated effect of temperature and photoperiod 


on the phyllochron [49]. A linear relationship was 
found between the phyllochron and the thermo¬ 
photo ratio under both controlled environments and 
field conditions [49]. Slafer and Rawson [51] 
partitioned the photoperiod sensitivity of wheat 
phenophases into different parameters, which were 
affected by temperature, describing an interaction 
between genotype, photoperiod, and temperature. 
The effect of planting date on crop development has 
been studied extensively, yet it is difficult to draw 
conclusions of the coordinated effect of photoperiod 
and temperature on crop development. This is because 
when planting date is changed, both photoperiod and 
temperature are changed and their effects are difficult 
to separate. 

Genetic Framework of Flowering Time Several 
pathways regulate time to flowering in crops, namely, 
vernalization, photoperiod, autonomous, and 
gibberellic acid. The vernalization and photoperiod 
pathways have been described in parts 5.2 and 5.3 and 
FT/VRN-3 gene is in both pathways, thus integrating 
the response to both vernalization and photoperiod 
factors (Figs. 5, 7, 8). 

In the ancestral form of wheat and barley, after 
germination in the fall VRN-2 is highly expressed by 
the long days and represses the expression of FT/ 
VRN-3. As winter progresses, the photoperiod 
decreases and low temperatures induce the expression 
of VRN-1 in the leaves, which represses the expression 
of VRN-2 , allowing FT to be expressed by long days in 
the spring, a process regulated by photoperiod genes 
Ppd and CO. Then, the protein encoded by FT/VRN-3 
is translocated to the shoot apex, where it up-regulates 
VRN-1 , which will induce the switch to reproductive 
phase. The HAP (HEME ACTIVATOR PROTEINS) 
complexes may mediate the transcriptional regulation 
of the CCT domain of CO and VRN-2. In plants, HAP 
subunits are encoded by multiple genes that, together 
with CCT domain proteins that can interact with HAP 
complexes, generate a large number of molecular com¬ 
binations. These combinations provide a flexible sig¬ 
naling system that can integrate responses to 
environmental cues as photoperiod, vernalization, or 
stress (reviewed in [52]). 

In environments where temperate cereals usually 
grow and were domesticated, vernalization requirements 
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Flowering 


Crop Development Related to Temperature and 
Photoperiod. Figure 8 

Model of flowering related to photoperiod and 
vernalization. Thick blue arrows mean induction (or up- 
regulation) of gene expression or developmental 
process; for instance, photoperiod induces the 
expression of VRN-2. Square headed lines indicate 
repression (or down-regulation) of gene expression; for 
instance, VRN-3 up-regulates VRN-1 at the apex 


(reviewed in [53]) in Arabidopsis. In Arabidopsis , 
the FT gene is repressed by FLQ which is down- 
regulated by vernalization. The stable repression of 
FLC involves de-acetylation of histone 3 (H3) upstream 
of FLC , methylation of H3 Lys9 and Lys27, which 
allows the binding of HP1 inducing the stable silencing 
of FLC 

Less known is the genetic basis of the quantitative 
response to temperature (thermal-response curve). 
The Earliness per se Al ( Eps m Al ) gene affects time to 
flowering by reducing the vegetative phase and it has 
been associated with responses to temperature, proba¬ 
bly by modifying the optimum temperature [54]. Two 
candidate genes for Eps m Al are located in the genomic 
region where this gene has been located, Motl and 
FtsH4 [55]. The Motl gene has features of the SNF2 
family of transcriptional regulators. Other members of 
this family have been related to regulation of flowering 
in Arabidopsis , but the gene expression data do 
not show differences between the two alternative 
alleles [55]. The FstH4 is a member of the FstH 
family of proteases and is homologous to the 
Arabidopsis FstH4 , which has been found highly 
expressed in seed, and mutants in this gene show 
delayed germination that is carried over the growth 
cycle [55]. 
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are met long before the photoperiod is inductive of the 
reproduction phase. This requires plants to “remember” 
they had been vernalized. The mechanism by which 
temperate crops “remember” vernalization is not 
known, but it has been studied in the model plant 
Arabidopsis. In addition to the genetic information car¬ 
ried on the DNA sequence (genes and alleles), chromatin 
structure is recognized as another source of genetic 
information. The chromatin can be highly condensed 
(heterochromatin) or more relaxed (euchromatin). 
The DNA is combined with proteins called histones 
and specific covalent histone modifications of histones 
favor the formation of chromatin structure that 
influences the level of gene expression. Other nongenetic 
mechanisms of gene expression are related to DNA 
methylation. The DNA methylation at the promoter 
region of genes is generally related to lower levels of 
gene expression, and even gene silencing. Of the two 
mechanisms, it seems that histone modifications are 
involved in regulating the memory of vernalization 


Modeling Approaches 

For centuries, people have wanted to understand and 
predict aspects of crop development, particularly phe¬ 
nology. To do this, different conceptual, statistical, and 
mathematical models have been developed. Beginning 
in the 1970s, a variety of digital technologies began to 
emerge, one of which was crop simulation models for 
predicting growth, development, and yield. This sec¬ 
tion presents a broad overview of these crop simulation 
models, emphasizing wheat. 

Many crop simulation models exist for simulating 
growth, development, and yield, and they cover scales 
from specific processes to the agroecosystem. Crop 
simulation models are a simplified mathematical 
representation of the plant. At the most fundamental 
level [56], crop simulation models generally simulate 
a trait, for example, yield (7), as the function of 
daily growth rate ( GR ) that is partitioned to the 
yield component (P) and integrated over a daily time 
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step from emergence ( emerge ) through physiological 
maturity ( maturity ): 

Y = J GR*P (4) 

Implementing Eq. 3 in a model usually begins 
assuming non-limiting conditions, thereby allowing 
for potential production to be estimated. The parame¬ 
ters can either be generic for a crop or adjusted to 
a specific genotype. By incorporating environmental 
variables (e.g., temperature, water, light, C0 2 , and 
nutrients), crop simulation models can examine crop 
or genotype responses across a broad environmental 
range of limiting conditions, avoiding a common lim¬ 
itation of statistical (or regression) models. Depending 
on the purposes, including the role of biotic factors or 
management practices may also be important. 

How a model implements Eq. 3 depends on model 
objectives and interests of the model developers. The 
earliest crop simulation models tended to focus on the 
scale of whole-plant growth and development, with 
little detail on processes at lower scales. These models 
use an energy- or light-driven approach to determine 
the growth rate, and this approach remains popular 
today. The basic approach simulates leaf area index 
on a daily time step, which is used to capture energy/ 
sunlight and produce biomass that is then distributed 
to basic plant components of leaves (providing the 
feedback to the cycle), stems, roots, and seeds. 
Partitioning coefficients are often used to allocate the 
biomass produced, and phenology sub-models are 
essential in accurately predicting the timing when 
sources and sinks are present, and changing 
partitioning coefficients based on developmental stage. 

As crop simulation modeling progressed, certain 
trends emerged. First, greater attention focused on 
representing plant processes below the whole-plant 
level. In general, energy- or light-driven modeling 
emphasized functional physiology, particularly for 
assessing energy balance and leaf functioning at the 
individual organ level (e.g., [57, 58]). Second, consid¬ 
erable research on crop development during the 1970s 
and 1980s undoubtedly spurred interest in including 
this new knowledge in the models. This led to alterna¬ 
tive modeling approaches based on more developmen- 
tally driven approaches that recognized that plant 
development is orderly and predictable based on basic 


units (i.e., the phytomer) that dynamically appear, 
grow, and senesce over time as discussed earlier in 
this entry and shown in Fig. 9. 

Early efforts beginning in the mid- 1980s focused on 
developmental concepts such as leaf appearance (the 
phyllochron) and tillering that led to more accurate 
representation of canopy architecture. For instance, 
the AFRCWHEAT1/2 model developed in Europe 
[60-62] contained detailed tillering and leaf dynamics 
sub-models (e.g., appearance, growth, and senescence/ 
abortion), and the resulting effect on canopy LAI 
was simulated and then used to estimate biomass. 
Simultaneously and independently, another effort was 
underway in the USA that resulted in the developmen- 
tally driven MODWht3 [63] and SHOOTGRO [http:// 
arsagsoftware.ars.usda.gov, [59, 64-67]] models. 

SHOOTGRO is slightly more developmentally detailed 
for canopy processes than MODWht3, but less detailed 
in the root system and simulating biomass production. 
SHOOTGRO provides the foundation to simulate the 
development and growth of individual phytomers (and 
phytomer components as shown in Fig. 9) on each 
morphologically identified shoot (main stem and til¬ 
lers) on the median plant of up to six age classes, or 
cohorts, based on time of seedling emergence. 

The Sirius model has one of the most developed 
and robust leaf appearance sub-models of any wheat 
simulation model [68]. As with the MODWht3 and 
SHOOTGRO models, the assumption used is that the 
developmental “clock” from emergence to anthesis is 
best represented by the rate of leaf appearance and final 
number of leaves, rather than thermal time. Based on 
vernalization requirement and photoperiod sensitivity 
of the variety being simulated and leaf ontogeny, the 
final leaf number is determined [69, 70] . This allows for 
an elegant quantitative description of both spring and 
winter wheat leaf appearance and integration with 
developmental events. 

Regardless of modeling approach and goals, the 
ability to simulate genotype phenology across a broad 
range of environments for major crops such as wheat 
has been quite reliable. Many alternative approaches 
exist for predicting phenology, and approaches differ in 
input requirements and number of developmental 
stages simulated. Essentially all models are based on 
the thermal time approach, reflecting the importance 
of temperature discussed in Section Developmental 
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Crop Development Related to Temperature and Photoperiod. Figure 9 

Developmental sequence diagram of a generic winter wheat for optimal conditions. Question marks refer to 
uncertainty, important cultivar variation, or conflicting reports in the literature. Time line legend is TT is the thermal 
time for the interval; #LVS is number of leaves for the interval; S, sowing date; G, germination; E, seedling emergence; 
Tl, tiller initiation/appearance; SR, single ridge stage; DR, double ridge stage; J, jointing; B, booting; H, heading; 

A, anthesis; and M, physiological maturity (From [59]) 


Response to Temperature. An alternative to a strict 
thermal time approach, particularly for small-grain 
cereals, has been to use leaf numbers to estimate the 
time interval between developmental stages. In phenol¬ 
ogy sub-models, temperature effects are well consid¬ 
ered, but rarely are the effects of water deficits (or 
nutrient availability) considered [71]. Exceptions 
include the SHOOTGRO model and PhenologyMMS 
(http://arsagsoftware/ars.usda.gov) . 

Determining plant parameters and how to address 
the genotype by environment interaction are 
common concerns for all models. With the explosion 
of genome mapping and molecular biology research, 


opportunities for understanding and resolving these 
issues are emerging [72-74]. For example, the presence 
or absence of known alleles influencing a trait can be 
used to determine the parameters used in the algorithm 
representing the process [75] or the response to envi¬ 
ronmental factors [76]. Clarification of gene networks 
controlling processes such as time of flowering has 
considerably advanced the understanding and simula¬ 
tion of these processes [77]. 

Crop simulation modeling is increasingly benefit¬ 
ing from the advent of object-oriented design and 
programming languages such as C++, C#, and Java, 
both in terms of developing and maintaining models as 
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well as providing greater flexibility in representing 
plant processes within models. Initial object-oriented 
designs tended to view the plant as a collection of 
objects that equate to leaf, stem, root, and seed com¬ 
ponents. Recent attempts have begun to incorporate 
the phytomer approach of building plant canopies into 
the object-oriented design that can also be scaled up, or 
aggregated, into lower levels of resolution, such as the 
seed component of earlier designs [78-80]. 

Future Directions 

Crop development is regulated by environmental factors 
that interact with the genetics of the plant. Many devel¬ 
opmental responses related to temperature and photope¬ 
riod are well known and it is possible to predict them 
reasonably well at the crop level. Similarly, rapidly emerg¬ 
ing knowledge from genomics research is helping to 
provide understanding of the genetic basis of certain 
aspects of crop development. Unfortunately, the quanti¬ 
tative integration of genetic and physiological knowledge 
is largely unknown, and both genetic and physiological 
models would benefit from better integration of knowl¬ 
edge. For example, the genetic basis of photoperiod and 
vernalization pathways is fairly well known and as new 
genomic studies are carried out, more complex models 
are being built [81] showing the complexity of flowering 
time. However, the genetic mechanisms described in 
this entry interact with responses to other environmen¬ 
tal factors defining a network of signaling a highly 
complex response that is not fully understood. 

Knowledge in crop development has greatly 
benefited from research in model organisms; however 
there are key differences, like the lack of the vernaliza¬ 
tion gene FLC in temperate cereals. For example, four 
vernalization genes have been described in wheat, but 
only three have been cloned and located in the vernal¬ 
ization pathway. Completing this pathway would 
greatly increase the understanding of how temperate 
crops respond to nonfreezing cold temperatures. At the 
same time, the physiological response to temperature 
has been extensively studied, and genetic differences 
are well documented; however, the basis of genetic 
effects is not known. It is encouraging that new genes 
(e.g., Eps-1) are being identified, but the quantitative 
variation of crop development, once major processes 
(photoperiod and vernalization requirements) are 


solved, requires the identification of new genes, 
probably of small effect. For example, a recent study 
in maize found no large effect QTL for flowering time 
in a nested mapping population [82]. Although no 
major epistatic or environmental interactions were 
found, the individual QTL effect varied across founder 
lines of the population [82]. QTL or gene effects need 
to be accurately determined to build quantitative 
genetic models. 
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Glossary 

Biotroph A plant pathogenic microorganism which 
requires living host tissue in order to complete its 
life cycle. Rust and powdery mildew fungi are 
examples of biotrophs, as are viruses. 

Oomycetes Also known as water molds, the 
Oomycetes are a large group of terrestrial and 
aquatic organisms. They superficially resemble 
fungi in mycelial growth and mode of nutrition, 
but molecular studies and distinct morphological 
characteristics place them in the kingdom 
Stramenopila (or Chromista) with brown and 
golden algae and diatoms. 

Phytoalexin Antimicrobial substances synthesized de 
novo by plants and which accumulate rapidly at 
areas of infection by an incompatible pathogen. 
Phytoalexins are broad spectrum in action and are 
chemically diverse with different types characteris¬ 
tic of particular plant species. They can be grouped 
into several classes including terpenoids, alkaloids, 
phenolics. 

Saprophyte An organism, e.g., a fungus or bacterium, 
that grows on and derives its nourishment from 
dead or decaying organic matter. 

Virulence Refers to the relative ability of a pathogenic 
organism to cause disease. 

Definition of the Subject 

Plant diseases cause substantial crop losses every year 
and, historically, have led to considerable economic 
damage and human suffering. Controlling plant dis¬ 
eases is therefore vital to maintaining crop productivity 
and feeding the ever expanding human population. 
Crop diseases can be controlled using a variety of 
methods, notably cultural approaches, the use of resis¬ 
tance in the plant, and the application of chemicals 
(fungicides). However, the organisms that cause plant 
disease (plant pathogens) are genetically adaptable and 
can overcome plant resistance, and the toxic effects of 
fungicides. Ensuring that crops are adequately 
protected depends therefore on continually keeping 
one step ahead of the pathogens by improving existing 
control measures and developing new approaches. This 
article provides an overview of the various methods, 
traditional and novel, used to control crop diseases. 


Introduction: The Need for Disease Control 
in Crops 

Plant disease has plagued mankind ever since the 
beginnings of agriculture. Today, despite the many 
advances in crop protection technology, crop diseases 
continue to wreck havoc on crops, because of the 
genetic adaptability of the pathogens which cause 
plant disease. Crop losses at farm level can have serious 
implications for growers, but crop disease can inflict 
much more serious damage on a larger scale. A good 
example of just how devastating crop disease can be is 
the potato blight epidemic of the 1840s in Europe. 
This disease, caused by the Oomycete pathogen 
Phytophthora infestans , decimated crops across Europe 
and in Ireland, led to the death of some one million 
people and the emigration of several million more 
[1, 2]. Astonishingly, today, more than 170 years later, 
potato blight still poses a major problem for potato 
growers across the globe. 

Crop losses as a result of disease can be expressed in 
various ways, such as potential losses and actual losses. 
Potential loss compares yields in a system without any 
form of crop protection treatment, with yields from 
a system with a similar intensity of crop production, 
but receiving crop protection treatments. Actual losses 
are those sustained despite the use of crop protection 

[3] . The efficacy of crop protection can be calculated as 
the percentage of potential losses prevented. Potential 
losses range from 8.5% for cotton to 21.2% for pota¬ 
toes, while actual losses range from 7.2% for cotton to 
14.5% for potatoes, highlighting the importance of 
crop protection in reducing potential losses in all of 
these crops (Table 1). 

Globally, agricultural production has grown faster 
than the human population over the past few decades 

[4] . In most parts of the world, this has been the result, 
not of increased area of cropped land, but of increased 
inputs, including pesticides [4] . In the period from 1963 
to 2002, cereal yields increased by 114% globally, 
although the annual rate of growth fell from 3.14% in 
the period 1963-1976 to 0.84% in the period 1989-2002 
[4]. In the period from 1960 to 2004, pesticide sales 
worldwide increased more than ten-fold to some $30 
billion [3]. However, despite this increased pesticide 
use, crop losses as a result of pests, diseases, and weeds 
have not fallen significantly in the past 40 years. 
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Estimated loss potential and actual losses due to patho¬ 
gens (fungi and bacteria) in six major crops worldwide in 
2001-2003 (Adapted from [3]) 



Crop losses (%) due to pathogens 

Crop 

Potential 

Actual 

Wheat 

15.6 [12-20] 

10.2 [5-14] 

Rice 

13.5 [10-15] 

10.8 [7-16] 

Maize 

9.4 [8-13] 

8.5 [4-14] 

Potatoes 

21.2 [20-23] 

14.5 [7-24] 

Soybeans 

11.0 [7-16] 

8.9 [3-16] 

Cotton 

8.5 [7-10] 

7.2 [5-13] 


Controlling Crop Diseases 

Crop disease can be controlled using a variety of 
approaches. The first line of defense is the exclusion 
of the pathogen through plant quarantine and, for 
example, the use of pathogen-free propagating mate¬ 
rial. The next line of defense is to exclude, eliminate, or 
reduce pathogen inoculum. This can be achieved in 
various ways, including cultural control, use of host 
plant resistance, and chemical control. Finally, several 
of these approaches might be used together in an inte¬ 
grated program of disease control. In the sections 
below, these disease control options will be examined 
in more detail. 

Cultural Control 

Cultural control aims to prevent contact with the path¬ 
ogen, to create environmental conditions unfavorable 
to the pathogen or at least to avoid favorable condi¬ 
tions, or to reduce the amount of pathogen inoculum 
available to infect crop plants. Methods used include 
host eradication, crop rotation, sanitation, irrigation, 
tillage, and improving crop growth conditions, for 
example, through appropriate fertilizer use. Cultural 
control provides the foundation for disease control in 
crops, and yet its important is often overlooked. 

Host Eradication Host eradication refers to the 
removal and disposal of whole infected plants. This 
method is used routinely in nurseries, greenhouses, 
and fields to prevent the spread of pathogens, since it 


eliminates the infected plants that act as a source of 
inoculum. In potato cultivation, pathogens can over¬ 
winter in infected tubers left in the field and give rise to 
infected plants (known as volunteers) in the spring. 
These volunteers can acts as sources of inoculum, and 
their removal from the field and subsequent destruc¬ 
tion will reduce levels of pathogen inoculum. 

If a pathogen requires two hosts to complete its life 
cycle, control is possible by eradication of the less 
important host. The wheat stem rust fungus, Puccinia 
graminis f.sp. tritici, is a case in point. It requires two 
hosts, wheat and barberry, to complete its life cycle and 
until the 1950s was the most important pathogen of 
wheat in the United States [5]. Since the 1950s, how¬ 
ever, stem rust has declined in importance in the 
United States, due in part to successful eradication of 
its alternate host, common barberry [6]. 

Sanitation Sanitation refers to eliminating or reduc¬ 
ing the amount of inoculum present by various means, 
including removal of infected plant parts and plant 
debris. Destroying crop residues is an important prac¬ 
tice, but how it is performed depends upon the type of 
crop and the type of pathogen. For example, burying 
crop debris can destroy certain pathogens, particularly 
if the residues are plowed in deeply enough, while 
burning crop residue is common practice for cereal 
crops in some parts of the world and will destroy 
many pathogens. However, burning has some draw¬ 
backs, particularly loss of nutrients and increased soil 
erosion. 

Crop Rotation Crop rotation is an ancient cultural 
practice and its benefits include maintenance of soil 
structure and organic matter, and a reduction in soil 
erosion that is often associated with continuous row 
crops [7]. The main purpose of rotating crops in con¬ 
ventional arable rotations is to reduce the incidence of 
diseases, pests, or weeds that are difficult to control 
with pesticides, and for this reason, short rotations of 
two to three crops are usually employed. In the United 
States, for example, the majority of the maize crop is 
grown on a 2-3 year rotation, while in the UK, barley 
and wheat usually form the main part of the rotation, 
with breaks of oilseed rape, beans, peas, or potatoes [8] . 

Continuous cropping with the same susceptible 
host plant will result in the establishment of a soil 
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population of pathogenic microbes. Crop rotation 
avoids this and is often associated with a reduction in 
crop diseases caused by soilborne pathogens [7]. Using 
nonhost or less susceptible crop plants in the rotation 
can lead to a decline in specific populations of plant 
pathogens in the soil and is best suited for biotrophs, 
since they require the presence of the specific living 
host for survival, or pathogens with low saprophytic 
ability [9, 10]. Crop rotation is less suitable for con¬ 
trolling root-inhabiting pathogens that survive sapro- 
phytically or can exist for long periods in soil, e.g., 
pathogens with tough survival structures such as 
Rhizoctonia solani , Sclerotinia sclerotiorum , and 
Pythium spp. [11, 12]. 

Tillage Tillage has indirect effects on pathogen 
spread and can also be used to reduce pathogen inoc¬ 
ulum in the soil. Conventional tillage uses primary and 
secondary cultivation to prepare a seed-bed for plant¬ 
ing and results in considerable soil disturbance, while 
reduced tillage uses a single cultivation, or even no 
cultivation (no-tillage, zero tillage, direct drilling), 
and as a result leads to minimal soil disturbance. 
Minimum tillage and no-tillage practices can be 
grouped together under the generic term: conservation 
tillage [13]. 

Tillage can bury pathogens deeper in the soil where 
they are less likely to become a problem. It can alter soil 
texture, aeration, temperature, moisture, and density, 
and can also influence nutrient release in the soil, with 
benefits to the crop [8]. Tillage also leads to clear 
fluctuations in microbial activity and biomass in the 
soil [14]. Reduced tillage or no-till is often associated 
with higher microbial biomass and activity in upper 
soil layers compared to regular tillage (plowing) [15]. 
This concentration of crop debris in the top layers of 
the soil can promote the overwintering and survival of 
numerous pathogens and has prompted concern that 
increased disease and decreased yields will be the inev¬ 
itable result of using conservation tillage practices. 
Although this has proved to be the case under some 
conditions, there have also been reports of decreases in 
the incidence of soilborne diseases. As suggested by 
Sturz et al. [13], such contradictory reports may reflect 
differences in root development and soil microbial 
biomass and activity under the different regimes. 
Thus, conservation tillage practices can lead to 


pathogen inoculum concentrations several orders of 
magnitude greater than those found under conven¬ 
tional tillage [16, 17] and, as a result, plant roots grow¬ 
ing in the upper soil layers might be more prone to 
pathogen infection [13]. In contrast, increased micro¬ 
bial biomass and activity in the top soil layers can give 
rise to greater root density and root activity [18, 19], 
which may offset the damaging effects of disease on yield, 
and might also provide a highly competitive soil envi¬ 
ronment with resulting disease suppressive effects [20]. 

Severity of tan spot in wheat was found to increase 
under no-till conditions, but was reduced following 
reduced tillage [21]. To control blackleg 
(Leptosphaeria maculans) on canola (oilseed rape), it 
is recommended that crop debris is buried in the 
autumn and a nonhost crop be direct seeded the fol¬ 
lowing spring to avoid reexposing the buried residue 
[22, 23]. Recent research suggests that inoculum pro¬ 
duction by L. maculans decreased with increasing dura¬ 
tion of stubble burial in the field over 10 months before 
stopping completely [24]. This effect maybe due to the 
mycobiota associated with the buried stubble, and 
these workers suggest that it might be possible to 
manipulate the population of saprophytic microbiota 
present on oilseed rape stubble to facilitate the decline 
of L. maculans [24]. 

Sowing Practices 

Time of Sowing Altering the time of sowing to avoid 
high levels of pathogen inoculum or conditions 
conducive for development of a particular disease can 
lead to reduced severity of several crop diseases. For 
example, in the UK, sowing winter oilseed rape in 
August rather than September exposes the earlier 
sown crop to inoculum from stubble of the previous 
crop, resulting in more severe Alternaria infection on 
pods. In contrast, the risk of infection is reduced in the 
later sown crop because the stubble is buried by tillage 
[25]. Late sowing may also be recommended for 
autumn-sown barley crops, in order to decrease 
exposure of newly emerging seedlings to inoculum of 
Rhynchosporium secalis produced on previous barley 
crops in the area [26]. 

Depth of Sowing Sowing depth can influence the risk 
of infection, since the preemergence stage of the 
seedling, which is usually more susceptible to pathogen 
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infection, is longer when seeds are sown deeper. In 
Brassica rapa , for example, rapid emergence of 
seedlings reduces preemergence damping-off because 
the period of contact between the emerging seedlings 
and R. solani in the soil is reduced [27]. Thus, 
significantly higher seedling emergence was reported 
for several cultivars of B. rapa sown at a depth of 1.5 cm 
compared to 3.0 cm [27]. 

Crop Density Crop density can exert considerable 
influence over disease incidence due to the ease with 
which pathogen inoculum can be transferred between 
closely spaced plants and alterations in crop 
microclimate. In densely planted crops, temperatures 
are more uniform, humidity is increased, and leaves are 
wet for longer, all of which provides favorable 
conditions for pathogen infection and subsequent 
development. Crop density can be manipulated in 
various ways, e.g., sowing, pruning, and fertilization. 

Soil Amendments: Mulching, Fertilizers, and 
Organic Amendments 

Mulches Mulches are used to conserve organic matter 
and moisture and to reduce soil erosion. A variety of 
materials can be used as mulches, including straw, 
manure, plastics, and paper. Mulching can lead to water 
retention and nutrient enrichment in the soil and can 
decrease soil temperature, all of which can influence 
pathogen infection and disease development in plants. 
Although mulching can reduce the spread of splash 
dispersed pathogens, by altering the environment, it 
could lead to increased severity of some diseases. 
Further, if crop residues are used in mulching, disease 
incidence could increase since the residues could be 
used as a food source by a range of pathogens. 

Fertilizers Adequate mineral nutrition is central to 
crop production, and can also exert considerable 
influence on disease development [28, 29]. Below, the 
influence of nitrogen, phosphorus, potassium, calcium, 
and silicon on plant disease will be dealt with briefly. 

Nitrogen Using nitrogen fertilizer above recommen¬ 
ded rates can lead to increased disease incidence and 
lesion area. This has been shown for biotrophic fungal 
pathogens such as powdery mildews and rusts [30, 31] 
and necrotrophic pathogens such as Magnaporthe 
grisea , the cause of rice blast [32]. It is commonly 


thought that application of nitrogen fertilizer can 
increase disease severity via effects on crop canopy 
development. Thus, large canopies with high shoot 
densities may be more conducive to spore transfer 
and pathogen infection than sparse canopies. For 
example, nitrogen has been shown to increase the 
severity of Fusarium head blight in wheat, and it has 
been suggested that this might be the result of 
a nitrogen-induced increase in canopy size, leading to 
an altered microclimate [33]. In contrast, work on 
yellow rust on winter wheat suggested that the impact 
of nitrogen on disease was the result of effects of 
nitrogenous substances in wheat leaves on pathogen 
growth, rather than effects on canopy growth and 
microclimate [34]. 

But nitrogen is not always associated with increased 
crop disease. Indeed, various studies have reported no 
effect of nitrogen on disease severity (e.g., [35, 36]), 
while Hoffland et al. [31] found that the effect of 
nitrogen depended on the type of pathogen. Thus, 
nitrogen increased susceptibility of tomato to the pow¬ 
dery mildew pathogen Oidium lycopersicum and the 
bacterial pathogen Pseudomonas syringae pv. tomato, 
while it had no effect on susceptibility to the vascular 
wilt pathogen Fusarium oxysporum f.sp. lycopersici [31 ]. 
In contrast, tomato plants were more susceptible to 
Botrytis cinerea when grown under low nitrogen con¬ 
ditions [37] . These results do not support the view that 
nutrient-limited plants are better defended [38, 39]. It 
seems that generalizing about the effects of nitrogen on 
plant disease is unwise, and practically, although 
manipulation or assessment of crop nitrogen status 
might be used as part of disease control strategies, the 
approach adopted will depend on the crop and the 
pathogens from which it is most at risk [29]. 

Phosphorus In general, phosphorus fertilization 
tended to improve plant health, with reductions in 
disease recorded in 65% of cases studied by Perrenoud 
[40]. Nevertheless, phosphorus fertilization increased 
disease and pest problems in 28% of the cases examined 
[40]. As with nitrogen, the effects of phosphorus on 
plant disease might be the result of direct effects on the 
pathogen, host plant metabolism, leading to effects on 
pathogen food supply, and effects on plant defenses 
[29]. Indeed, foliar application of phosphate salts has 
been shown to induce resistance to pathogens in 
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a range of crop plants, including cucumber [41], broad 
bean [42], grapevine [43], maize [44] and rice [45]. 

Clearly, an adequate phosphorus supply is impor¬ 
tant for crop growth and in turn, may well help to 
reduce disease. However, the regime of phosphorus 
fertilizer used will depend on a range of factors, includ¬ 
ing the crop and the pathogens likely to be important. 
Reuveni and Reuveni [46] suggested that foliar-applied 
phosphate might be used as part of an integrated dis¬ 
ease control program. However, grower adoption of 
such an approach will depend on the existence 
of other, effective disease control measures and the 
economics of disease control in the particular crop. 

Potassium There are many reports that potassium is 
associated with disease reductions [47]. However, as 
these authors point out, inadequate consideration has 
been given to the effects of associated anions, nutrient 
balance, and nutrient status, to allow the definitive role 
of potassium to be determined. Thus, it has been 
suggested that in some cases, the effects of potassium, 
applied as potassium chloride fertilizer, might be due to 
the chloride ion rather than potassium [48]. Further, 
chloride fertilization has been shown to suppress dis¬ 
ease in cereal crops [49]. 

There has been much interest in the application of 
fertilizers to crop foliage, including the effects of foliar 
fertilizer application for crop disease control [46, 50]. 
Foliar-applied potassium chloride has been shown to 
control Blumeria graminis and Septoria tritici on wheat 
in field studies [51, 52], probably due to osmotic effects 
on the fungal pathogens, disrupting pathogen develop¬ 
ment and subsequent infection [52, 53]. 

Application of potassium to deficient soils usually 
increases plant resistance to diseases [47]. This might 
be partly related to the effect of potassium in increasing 
epidermal cell wall thickness or disease escape as 
a result of vigorous crop growth [47], although the 
mechanisms by which potassium affects plant disease 
are not well understood. 

Calcium There are many reports that application of 
calcium to soils, foliage, and fruit reduces the incidence 
and severity of a range of diseases of crops, including 
cereals, vegetable crops, legumes, fruit trees, as well as 
postharvest diseases of tubers and fruits [54]. For 
example, calcium has been shown to inhibit anthrac- 
nose (caused by Colletotrichum gloeosporiodes or 


C. acutatum) in apples [55] and to decrease postharvest 
disease development on strawberry [56], while treat¬ 
ment of tomato with calcium carbonate reduced fusar- 
ium crown rot disease [57]. In contrast, Nam et al. [58] 
could find no effect of calcium on anthracnose on 
strawberry. Because calcium increases resistance of 
plant cell membranes and cell walls to microbial 
enzymes, increasing calcium concentrations in storage 
organs could lead to enhanced resistance to pathogens 
[59, 60]. However, the form in which the calcium is 
applied can influence the mechanism by which calcium 
affects disease. For example, the addition of lime 
can affect disease by altering pH, while calcium salts 
(e.g., propionate) can be directly inhibitory to patho¬ 
gens [54]. Making general recommendations for the 
use of calcium in plant disease control would be 
unwise, due to the range of crops and pathogens 
affected by calcium application. Instead, the appropri¬ 
ate amount and form of calcium to apply need to be 
determined for individual crop-pathogen interactions. 
The dwindling availability of fungicides, together with 
increasing public concern for the environment means 
that the use of calcium to control plant disease, espe¬ 
cially postharvest, is attracting increased attention. 

Silicon The effects of silicon in reducing disease 
severity have been known since 1940 [61]. However, it 
was not until the 1980s that more detailed work was 
carried out in this area. Thus, cucumbers grown in 
nutrient solutions supplemented with silicon were 
found to have significantly less powdery mildew infec¬ 
tion than plants not receiving silicon supplementation 
[62, 63]. Indeed, silicon has been shown to suppress 
both foliar and soilborne pathogens in cucurbits [64] 
and to reduce susceptibility of rice to various pathogens 
[65]. Wheat grown in soil amended with silicon 
showed reduced infection by several pathogens, includ¬ 
ing B. graminis f.sp. tritici, S. tritici , and Oculimacula 
yallundae (Fig. 1) [66, 67]. 

It has been suggested that the effects of silicon in 
providing disease control are due to the creation of 
a mechanical barrier to penetration [68]. However, 
this has been disputed by studies which could find no 
evidence for the creation of a physical barrier following 
silicon treatment in wheat inoculated with powdery 
mildew and bitter gourd and tomato inoculated with 
Pythium aphanidermatum [69, 70]. Rather, several 
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Crop Diseases, Management and Control of. Figure 1 

Changes in disease in plots with incorporated straw. Bars show the difference between straw-treated and control plots 
as a percentage of the average in the control plots. For Mycosphaerella graminicola, the data are for severity on the top 
three living leaves; for other diseases, they refer to incidence. For M.graminicola, errors bars were calculated on the 
transformed scale and back-transformed; for other diseases, no transformation was needed (From [66]) 


studies have suggested that silicon activates defenses in 
plants. For example, in wheat inoculated with 
B. graminis f.sp. tritici , epidermal cells of silicon-treated 
plants were shown to react to attempted infection with 
specific defenses, including papilla formation and 
callose production [71]. In the rice-M. grisea 
pathosystem - silicon-mediated resistance was found 
to be associated with accumulation of antimicrobial 
compounds at infection sites, including diterpenoid 
phytoalexins [72]. In fact, phytoalexin accumulation 
occurs in silicon-mediated resistance in both dicots 
and monocots and since phytoalexins are highly spe¬ 
cific to plant species, it has been suggested that silicon 
might be acting on mechanisms shared by all plant 
species, e.g., those resulting in activation of plant stress 
genes [73]. 

Organic amendments Organic amendments include 
animal manure, solid wastes, and composts. Such 
amendments are often used to improve soil quality, 
usually by contributing to general suppressiveness 
through enhanced microbial biomass and activity [7]. 
Organic amendments are rich in labile carbon fractions 
which are an energy source for microorganisms, and 
moreover, they can themselves contain antagonistic 
microbes. A substantial body of data indicates that 
organic materials can reduce incidence of diseases 
caused by a range of plant pathogens [74, 75]. 


Irrigation Although an adequate water supply is vital 
to crop production, irrigation can play a detrimental 
rather than a beneficial role in managing plant diseases. 
For example, irrigation water can spread pathogen 
propagules and under dry conditions, can prevent des¬ 
iccation of such propagules, thereby effectively increas¬ 
ing the level of inoculum in soil. Watering from 
overhead prolongs leaf wetness, thereby providing 
favorable conditions for germination and infection by 
fungal spores. Overhead watering also increases the risk 
of splash-dispersal of spores, thus increasing pathogen 
spread. However, irrigation can be used to reduce the 
level of pathogen inoculum. Thus, the activity of 
microbes that destroy fungal sclerotia can be increased 
by alternate wetting and drying of the soil. Generally, 
drip or trickle irrigation, which delivers water directly 
to the root zone at a rate insufficient to lead to patho¬ 
gen spread, is least likely to encourage disease 
development. 

Use of Host Resistance 

Most plants are resistant to most microbes. Thus, 
wheat plants are not affected by pathogens of tomato, 
and vice versa. This is known as nonhost resistance. 
However, every plant is attacked by its own pathogens, 
e.g., barley is attacked by the barley powdery mildew 
fungus, B. graminis f.sp. hordei, although it might not 
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be able to defend itself equally well against all patho¬ 
gens that are able to attack it. This ability of plants to 
resist attack by pathogens is genetically determined, but 
the diversity of types of genetic control of this resis¬ 
tance, and of conditions required for their expression, 
has led to a bewildering number of classifications of 
resistance types in plants. To complicate matters fur¬ 
ther, in some cases, different terms used by different 
authors are synonymous, while in other cases, the 
terms might be based on different criteria. 

Types of Resistance 

Seedling and Adult Plant Resistance Seedling 
resistance operates from the onset of plant growth 
and is effective throughout the life of the plant. It is 
generally controlled by single genes and is effective in 
the absence of matching virulence in the pathogen. In 
contrast, adult plant resistance (APR) covers a broad 
range of resistance types all of which are not effective at 
the plant’s seedling stage. APR tends to be controlled by 
a number of genes, which might operate through 
a wide variety of mechanisms. 

Polygenic and Oligogenic Resistance Resistance 
controlled by one, or at most two or three genes, is 
known as oligogenic. Here, a single gene can confer 
complete resistance. A gene conferring such resistance 
is often referred to as a major gene. Where resistance is 
controlled by a larger number of genes, it is known as 
polygenic. Genes conferring polygenic resistance are 
often referred to as minor genes. 

Race-specific and Race-nonspecific Resistance The 
terms race-specific and race-nonspecific sought to 
differentiate between resistance that was subject to 
loss of effectiveness with the appearance of new 
virulent strains of a pathogen and resistance that was 
thought would never be lost because the pathogen was 
not capable of developing virulence to it. A major 
problem with these terms has been a lack of evidence 
to suggest that any particular resistance was race- 
nonspecific. The assumption was generally made that 
genes of large effect (major or seedling genes) were 
race-specific and genes of small effect (minor genes or 
APR) were race-nonspecific. This has subsequently and 
comprehensively been shown not to be the case in 
many host-parasite systems. 


Vertical and Horizontal Resistance These terms were 
introduced to convey the difference between race- 
specific and race-nonspecific resistance [76]. In 
vertical resistance, there tends to be a high level of 
resistance by the host to some races of the pathogen 
and a low level of resistance to others, while in 
horizontal resistance, resistance tends to be exhibited 
equally to all races of the pathogen. Vertical resistance 
has also been denoted as specific resistance and 
horizontal resistance as general resistance. However, 
although the specific nature of some types of 
resistance can usually be established fairly easily, it is 
much more difficult to prove that resistance is truly 
general, since there is always the chance of a new 
pathogen race appearing with the ability to overcome 
the resistance. 

Qualitative and Quantitative Resistance These terms 
are a rough match for seedling/APR and vertical/ 
horizontal resistance, with qualitative resistance 
referring to a high level of resistance controlled by 
a single gene and quantitative resistance being 
controlled by several genes of smaller effect. The term 
“quantitative” subsequently came to be used to 
describe locations on chromosomes where genes of 
small effect were identified through mapping studies, 
hence “quantitative trait locus” or QTL. More recently 
the use of the term QTL has come to be used for major 
gene loci as well. 

Partial Resistance Partial resistance was originally 
used in studies of the resistance in barley to leaf rust 
caused by Puccinia hordei. It describes resistance that 
reduced the rate of epidemic development despite 
having a susceptible reaction type [77]. Resistance to 
leaf rust was found to be governed by up to 6-7 minor 
genes with additive effects and was correlated with 
increased latency period and reduced infection 
frequency, pustule size, infectious period, and spore 
production. The term is now used more generally to 
describe any resistance that is only partially effective in 
reducing disease expression and is usually synonymous 
with minor gene resistance. 

Durable Resistance Durable resistance was suggested 
for use by Johnson and Law [78] to refer to rust 
resistance in wheat that in practice had provided 
stable resistance in varieties that had been grown over 
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a large area for many years. The term specifically 
avoided identifying the resistance with particular 
phenotypes or suggesting that the resistance would 
never be lost to a change in pathogen virulence. 
Specific well-studied examples are the Sr2 and Rpgl 
genes for partial stem rust in wheat and barley, 
respectively, the genes Lr34 and Yrl8 which provide 
partial resistance to leaf and stripe rust in wheat and 
which may in fact be the same gene and the mlo locus 
which provides resistance to powdery mildew in barley. 

Genetic Engineering for Disease Resistance Using the 
techniques of genetic manipulation (GM), it is now 
possible to sequence and clone resistance genes from 
distantly related species. These genes could then be 
transferred into crop species, providing varieties with 
enhanced disease resistance. For example, the gene for 
resistance to bacterial blight of rice, Xa21> was 
sequenced, cloned, and transformed into rice, 
providing resistance against a range of pathogens 
[79]. This is a good example of how transgenic 
technologies could be used in a major crop and 
indeed, transgenic rice lines expressing Xa21 are 
currently being tested in the field. The vision is to use 
GM technology to pyramid several different resistance 
genes in a rice variety, providing what is hoped will be 
durable resistance. However, unless there is 
coordination among plant breeders and the industry 
to prevent varieties being grown together, thereby 
allowing the pathogen to mutate one gene at a time, 
this strategy is unlikely to be successful. 

One option for improving resistance to virus infec¬ 
tion is to transform the gene that expresses the viral 
coat protein into the host plant. This approach was 
used to develop a line of papaya resistant to the very 
damaging papaya ringspot virus (PRSV). As a result of 
this work, two papaya varieties, Rainbow and SunUp, 
were commercialized. Rainbow was widely planted in 
Hawaii and was important in preventing the complete 
destruction of the papaya industry by PRSV [80]. 

Deployment of Resistance in Practice Many new 
crop varieties have been bred where resistance to 
a particular pathogen is based on the introduction of 
one gene. All too often history has shown that the 
introduction of such varieties is followed, within just 
a few years, by the appearance of a new race of the 


pathogen, able to overcome the “new” resistance. Such 
rapid breakdown of host resistance is favored by mod¬ 
ern intensive agriculture, where crops are planted in 
monoculture covering huge areas. This favors any 
genetic variants of the pathogen with the ability to 
infect these new varieties. Breaking this “boom-and- 
bust” cycle of introducing a new resistant variety, 
followed by the rapid breakdown of resistance, can be 
achieved in various ways. One approach is to find 
durable sources of resistance. Another is to breed for 
combinations of race-specific genes in one crop variety. 
This is known as pyramiding and should be quite 
durable. However, it is controversial, since it might 
select for complex races of the pathogen possessing 
several matching virulences. Another approach is to 
diversify the deployment of resistance genes. This 
could involve spatial diversity across regions or fields, 
or the use of multilines or mixtures of cultivars. 

A multiline is a series of near-isogenic (genetically 
identical) plant lines each differing in a single character, 
e.g., disease resistance. The plant lines can be grown 
together like a conventional crop, thereby retaining 
agronomic advantages such as crop uniformity, while 
confronting the pathogen with the problem of over¬ 
coming several different genes for resistance. A number 
of mechanisms have been proposed to account for 
disease control in multilines: (1) The high proportion 
of resistant plants in the multiline grown in the field 
reduces the amount of pathogen inoculums per unit 
area of crop, thereby reducing the amount of infection 
in the succeeding generation and the amount of inoc¬ 
ulums produced by that generation. (2) Movement of 
pathogen inoculums between susceptible plants might 
be hampered by the presence of intervening resistant 
plants. (3) The average distance which the pathogen 
inoculums must travel in order to reach another sus¬ 
ceptible plant is increased. (4) There might be induc¬ 
tion of resistance, i.e., pre-inoculation with a race of the 
pathogen to which the line is resistant might protect it 
from a race to which it is normally susceptible [81]. 
However, the breeding program required for the devel¬ 
opment of a multiline is extensive. Similar in concept, 
but easier to put into practice, is the use of variety 
mixtures. Here, seed of a number of genetically distinct 
varieties of the crop are mixed and grown together as 
a single crop. Each variety possesses a different resis¬ 
tance gene(s), again presenting the pathogen with 
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a genetic puzzle. But the use of mixtures requires care¬ 
ful choice of varieties, since they must possess similar 
characteristics, such as crop height and time to 
flowering. Nevertheless, mixtures have been shown to 
develop less disease than would be expected in compo¬ 
nent varieties grown alone. Mixtures can also provide 
yield increases, and the magnitude of disease control 
and effects on yield are dependent on the number of 
components in the mixture (Fig. 2). The mechanisms 
proposed to account for reductions in disease in mix¬ 
tures include less efficient spread between plants and 
the induction of plant resistance by incompatible 
strains of the pathogen attempting to infect different 
hosts in the mixture [82]. 

Use of Chemicals to Control Crop Disease 


Fungicides can be divided into several groups 
according to their mode of action. Protectant fungi¬ 
cides protect the plant against fungal propagules (e.g., 
spores) landing on the surface of leaves, stems, or fruit, 
but tend to be ineffective against established infections, 
since they do not enter the plant to any extent. To be 
effective therefore, protectant fungicides need to be 
applied before the fungal pathogen enters the plant. 
In contrast, systemic fungicides enter the plant, and 
can become generally distributed within its tissues, 
thereby offering protection against fungal pathogens. 
Eradicant fungicides can enter plant tissues and can kill 
established infections. A number of protectant and 
systemic fungicides possess eradicant properties, 
while some systemic fungicides also have protectant 
activity. 


Chemicals continue to play an important role in crop 
disease control. These chemicals act either by inhibiting 
germination, growth, and multiplication of the patho¬ 
gen, or by killing the pathogen. Although chemicals can 
be used to control bacterial pathogens (bactericides) 
and nematodes (nematicides), this section will concen¬ 
trate on chemicals with activity against fungal patho¬ 
gens (fungicides). 


Protectant Fungicides Protectant fungicides can be 
applied to the crop as high-volume or low-volume 
sprays, or occasionally as dusts. These fungicides 
tend to be broad spectrum, with activity against 
a range of different fungal pathogens. As a result, it 
is unlikely that pathogens would develop resistance 
to these chemicals. However, because they must 
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Change in R. secalis infection of mixtures of winter barley cultivars compared with the mean of their components with 
different numbers of component cultivars (From [82]) 
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be applied before the pathogen attempts to penetrate 
the plant, there is the need for reliable forecasting of 
infection risk. Protectant fungicides include the oldest 
and still widely used inorganic sulfur and copper 
compounds. Sulfur, applied as a dust, wettable 
powder, paste, or liquid, is used to control powdery 
mildews, some rusts, leaf blights, and fruit rots. 
Bordeaux mixture, made by mixing copper 
sulfate solution with calcium oxide or calcium hydrox¬ 
ide, consists mainly of colloidal hydrated cupric 
hydroxide stabilized by calcium sulfate. It has been 
replaced by copper oxychloride, which can be formu¬ 
lated by the manufacturer and simply diluted by the 
grower. 

Organic sulfur compounds are an important and 
versatile group of fungicides and include thiram, 
ferbam, maneb, zineb, and mancozeb. These fungicides 
are all derivatives of dithiocarbamic acid, and because 
they are metabolized to isothiocyanate, they inactivate 
the sulfhydryl groups (-SH) in amino acids and 
enzymes. Thiram is used mostly for seed and bulb 
treatment for vegetables, flowers, and grasses, while 
ferbam is used to control foliage diseases, ornamentals, 
and fruit on trees. Maneb and mancozeb belong to 
a group of dithiocarbamic acid derivatives known as 
the ethylenebisdithiocarbamates. Maneb is a broad- 
spectrum fungicide for the control of foliage and fruit 
diseases of vegetables such as tomato, potato, and vine 
crops, as well as flowers, turf, and some fruit crops. 
Mancozeb is formed by the addition of zinc ion to 
maneb and a secondary effect of using mancozeb is 
the provision of manganese and zinc to plants deficient 
in these elements. 

Various aromatic compounds have been developed 
as fungicides and include dichloran, used as a foliar or 
fruit fungicide, or as a postharvest spray for vegetables 
and flowers, and chlorothalonil, which is a broad- 
spectrum fungicide used on vegetables, field crops, 
ornamentals, and turf. 

A number of very effective protectant fungicides 
belong to the rather heterogeneous group of heterocy¬ 
clic compounds and include captan, iprodione, and 
vinclozolin. Captan is used for the control of leaf 
spots, blights, and fruit rots on various crops, as well 
as a seed treatment. It works by inhibiting thiol- 
containing enzymes in the fungal cell, and may also 
react with sulfhydryl groups. 


Systemic Fungicides For a systemic fungicide to be 
effective, it must enter the plant and, to be translocated, 
it must be reasonably water-soluble. It must also be 
selective, possessing toxicity against the pathogen but 
not the host plant. Most systemic fungicides are xylem- 
mobile and as a result, tend to move from the base to 
the top of the plant, accumulating in leaves and shoot 
apices. As a result, such fungicides possess no 
activity against soilborne pathogens affecting roots. 
Phosphonate fungicides such as fosetyl-Al are also 
phloem-mobile and so can move down the plant to 
the roots, providing protection against, for example, 
root rots caused by Phytophthora species. 

A wide range of systemic fungicides are now used to 
control plant pathogens. Unlike protectant fungicides, 
most systemic fungicides are site specific, and target 
only one, or just a few, specific steps in fungal metab¬ 
olism. Although this was seen initially as a strength of 
systemic fungicides, it soon proved to be a weakness, 
since fungal pathogens were able to develop resistance 
to the chemicals, in some cases with alarming rapidity. 

Acylalanine fungicides are effective against 
Oomycete pathogens such as Phytophthora and 
Pythium and include the widely used fungicide, 
metalaxyl. This is used as a soil or seed treatment and 
also possesses some curative activity. It moves readily 
from roots to shoots, although its lateral movement is 
slight. Some fungal pathogens have developed resis¬ 
tance to metalaxyl, and its use tends to be 
recommended in conjunction with a broad-spectrum 
fungicide. 

Benzimidazole fungicides include some well- 
established and important chemicals, including beno- 
myl, carbendazim, and thiabendazole. They are 
converted to methyl benzimidazole carbamate (MBC, 
carbendazim), which interferes with cell division. 
Benzimidazoles are broad spectrum in activity. Thus, 
benomyl is effective against a wide range of leaf spots, 
blights, rots, scabs, seedborne, and soilborne diseases. 
It can be applied to plants as a seed treatment, foliar 
spray, trunk injection, root dip, or fruit dip. 

Oxanthiins, such as carboxin and oxycarboxin, pos¬ 
sess the distinction of being the first chemicals with 
demonstrated systemic activity. They are active against 
some smut and rust pathogens, and also against Rhi- 
zoctonia. By inhibiting succinate dehydrogenase in fun¬ 
gal mitochondria, they affect respiration. 
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Organophosphate fungicides, such as foestyl-Al 
and phosphorous acid, are effective against Oomycete 
pathogens on a range of crops. Interestingly, fosetyl-Al 
has been reported to trigger plant defense mechanisms, 
e.g., phytoalexin biosynthesis and accumulation. 

Other systemic fungicides include the pyrimidines, 
such as ethirimol and bupirimate, with activity 
against powdery mildews, and the triazoles, 
such as triadimefon, propiconazole, cyprodinil, and 
tebuconazole. The triazoles exhibit long-acting protec¬ 
tive and curative activities against a broad spectrum of 
pathogens. 

The most recently developed group of fungicides is 
the strobilurins. They are based on chemicals extracted 
from the wood rotting fungus Strobilurus tenacelus , and 
a range of highly effective strobilurins have been devel¬ 
oped over the years. They work by blocking electron 
transfer at the site of quinol oxidation (Qo site, 
hence their description as Qol fungicides), thereby 
preventing ATP formation in respiration. These fungi¬ 
cides move trans-laminarly within leaves and some also 
move systemically through the vascular system. Impor¬ 
tant strobilurins include azoxystrobin, pyraclostrobin, 
and kresoxim methyl. They have broad-spectrum activ¬ 
ity, and some also possess growth-promoting activity 
on treated plants, apparently by delaying senescence 
and altering plant-water relations. 

Development of Resistance to Fungicides Until the 
advent of systemic fungicides, fungicide resistance was 
rare. The development of resistance in Pyrenophora to 
mercury used as a seed dressing, in the apple scab 
fungus Venturia inaequalis to dodine, and in 
Penicillium species to diphenyl compounds are 
exceptions. In contrast, the use of systemic fungicides 
soon led to the appearance of fungicide resistance. 
Thus, the pyrimidine fungicide dimethirimol was 
introduced in 1968, and by 1971, strains of the 
powdery mildew fungus Sphaerotheca fuligenea were 
detected on glasshouse cucumbers in the Netherlands. 
Similarly, following the introduction of Qol fungicides 
in the UK in 1996, resistance to powdery mildew in 
wheat was first recorded in 1998. 

It seems that where a single site systemic fungicide 
is used intensively, there is the risk of fungal pathogens 
developing resistance to it. With single site fungicides, 
only a single mutation in the fungus might be required 


for resistance to develop. Some of the mechanisms by 
which a fungus might develop resistance following such 
a mutation include: (1) detoxification of the chemical, 
(2) decreased permeability of fungal cell membranes to 
the fungicide, (3) reduced affinity of the fungicide to 
the reactive site within the fungal cell, and (4) bypassing 
a blocked reaction via an alteration in fungal 
metabolism. 

Given that fungicide resistance is now an accepted 
fact of life in crop protection, how can it be managed 
effectively? The first point to remember is that fungi¬ 
cides to which resistance has developed can still be 
useful in disease control if deployed sensibly. Strategies 
to minimize the risk of fungicide resistance developing 
include: 

1. Reducing fungicide use 

• Applying fungicides only when necessary 

• Using fungicides as part of an integrated disease 
control program, including, for example, 
appropriate cultural control, and if available, 
resistant varieties 

2. Diversifying fungicide treatments 

• Avoiding the repeated use of fungicides with the 
same mode of action 

• Instead, use mixtures of fungicides with differ¬ 
ent modes of action 

• During the growing season, alternating fungi¬ 
cides with different modes of action 

• Including a multisite fungicide in any fungicide 
mixture or spray program 

Pesticides and Nontarget Organisms Pesticide use has 
greatly increased the quantity and improved the quality 
of food for the increasing world population. However, 
as pesticide use increased, so did concern about their 
adverse effects on nontarget organisms, including 
humans. Increasing public concern about the 
accumulation of pesticides in the environment and 
the impact on nontarget organisms has led to the 
introduction of rigorous regulatory processes [83, 
84]. Nevertheless, there is continued concern over the 
impacts of pesticides on wildlife, including invertebrate 
populations, wild plants, and farmland birds [85]. 
These concerns have led to reviews of active 
substances used in plant protection, with the resulting 
withdrawal of an increasing number of crop protection 
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products from the market [86]. This has created 
problems, and in some situations, effective control 
measures are no longer available to meet all the 
challenges posed by pathogens, pests, and weeds [86]. 

Biological Control of Plant Diseases Using 
Antagonistic Microorganisms 

Biological control can be defined as the control of a plant 
pathogen using another living organism or organisms. 
This definition includes direct and indirect effects, as 
a result of either the introduction of antagonists, or 
manipulation of existing microbial populations to 
reduce disease. This section will deal with biological 
control using antagonistic microorganisms. 

Antagonistic microorganisms control plant disease 
because they weaken or destroy the pathogen. They 
achieve this by various mechanisms, including: 
(1) directly parasitizing the pathogen, (2) producing 
antibiotics or toxins that exert an effect on the patho¬ 
gen, (3) competing for space and nutrients with the 
pathogen, and (4) producing hydrolytic enzymes that 
destroy components of pathogen cells. 

To be effective in disease control, a biological con¬ 
trol agent (BCA) must be able to colonize a particular 
habitat or occupy a niche in sufficient numbers to 
disrupt the growth and survival of the target pathogen. 
For this reason, the most effective BCAs are likely to be 
found in the environment in which they are to be used. 
Thus, if the pathogen to be controlled infects plant 
roots, the best place to look for a potential BCA 
would be the rhizosphere. Finding appropriate BCAs 
is one thing, but using them in practice is quite another. 
For a start, as a living organism, the BCA must be 
formulated in such a way as to allow it to remain viable 
and survive following application [87]. Great advances 
have been made in the production and formulation of 
BCAs over the years, but despite the fact that 1,000 of 
microorganisms have been shown to interfere with 
growth and survival of plant pathogens under con¬ 
trolled and field conditions, relatively few have been 
registered and used in commercial practice. Currently, 
there are more than 50 bacterial products and 50 fungal 
products available commercially. The majority of both 
bacterial and fungal products are sold for control of 
seedborne or soilborne pathogens (Table 2), with fewer 
for foliar pathogens and timber decay fungi and even 


fewer for postharvest pathogens. Most contain individ¬ 
ual microorganisms although there are some products 
that contain microorganism mixtures, and some indi¬ 
vidual microorganism strains are marketed in several 
different products expanding the potential market of 
a single active ingredient. Bacterial products are dom¬ 
inated by Bacillus species reflecting their ease of growth 
and production of long-lived spores mentioned earlier. 
Fungal products are dominated by Trichoderma 
spp. which are also easy to produce, generally have a 
low toxicity, and can sporulate well. 

America’s first biological fungicide seed treatment, 
Kodiak (marketed by Bayer), contains spores of Bacillus 
subtilis GB03 for control of Alternaria , Aspergillus , 
Fusarium , and Rhizoctonia spp. that attack root systems 
of a number of plants, including seed and pod vegeta¬ 
bles, cotton, peanut, soybean, wheat, barley, and maize. 
The spore concentrate can be applied through standard 
slurry or mist seed treatment equipment, with fungi¬ 
cides if required, and the bacterium colonizes the root 
system providing control over the whole growing 
season. 

Numerous products contain strains of Trichoderma 
harzianum , but one isolate, KRL-AG2 (T-22), sold by 
BioWorks Inc., USA, has been used and developed for 
several markets in US horticulture and agriculture in 
a number of different forms. When applied to soil, 
planting mixes, or turf, this BCA colonizes plant roots 
and provides protection against root pathogens such as 
Cylindrocarpon , Fusarium , Pythium , Rhizoctonia , and 
Thielaviopsis. RootShield granules are largely targeted 
at glasshouse and nursery use, and can be incorporated, 
top-dressed, broadcast, or applied in-furrow or to 
planting holes for use on flowers, bedding plants, orna¬ 
mentals, vegetables, pome and stone fruit, trees, and 
tree nuts. For agriculture, T-22 Planter Box is applied as 
a coating on seeds and seed pieces, an in-furrow spray, 
and as a transplant starter and T-22 HC as a broadcast 
or in-furrow treatment, both for use on field and row 
crops, hay and forage crops, bulbs and vegetables. In 
Europe, T-22 (Koppert, the Netherlands) is available as 
TRIANUM-G and TRIANUM-P, wettable granule and 
wettable powder formulations. Much of the scientific 
background concerning the use of this isolate has been 
extensively reviewed [88]. 

In terms of the future, advances that can improve 
quantity, quality, and shelf life of inocula would be 
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Crop Diseases, Management and Control of. Table 2 Examples of bacteria and fungi registered or commercially 
marketed as biological control agents for control of soilborne and seedborne plant pathogens (from [87]) 


Antagonist 

Target pathogen(s)/ 
activity 

Disease/host 

Product name and source 

BACTERIA 

Agrobocterium 

rodiobacter 

Agrobacterium tumefaciens 
causing root galls 

Ornamentals and other 
plants sensitive to A. 
tumefaciens root galls 

Norback 84-C, Galltrol-A, Nogall, 
Diegall, Dygall (Becker Underwood 
Pty Ltd, Australia; Bio-Care 
Technology Pty Ltd. Australia; New 
BiProducts, Inc. USA; AgBiChem 

Inc., USA; Agbioresearch Ltd. 

New Zealand) 

Bacillus cereus BP01 

Plant growth promotion 

Cotton 

Mepplus (MicroFlo Co. LLC, USA) 

B. pumilus GB34 

Fusarium spp v R. solani 

Soybean 

YieldShield concentrate; GB34 
Biological Fungicide (Gustafson 

LLC, USA) 

Bacillus licheniformis 

SB3086 

Numerous pathogens, 
especially Sclerotinia 
homeocarpa 

Ornamentals, turf 

EcoGuard, Green Releaf 
(Novozymes Biologicals Inc., USA) 

B. subtilis 

Pythium damping-off 

Tomato 

Cillus, Green-all G (Greenbiotech 

Co, Korea) 

R. solani, Fusarium spp., 
Alternaria spp., and 
Aspergillus spp. pathogens 

Root rots and seedling 
diseases generally 

Kodiak, Epic, Concentrate, Kodiak 

HB, Quantum 4,000, System 3 
(Gustafson LLC, USA) 

Fusarium spp., Verticillium 
spp., R. solani, and Pythium 
spp. pathogens 

Various vegetable and 
field crops 

PHYTOVIT WG (Prophyta 
Biologischer Pflanzenschutz GmbH, 
Germany) 

B. subtilis GB03 

Fusarium spp., 

Phytophthora spp., Pythium 
spp., R. solani 

Ornamentals, turf, dry 
and snap bean, cotton, 
peanut, soybean, wheat, 
and barley 

Companion, Kodiak (Growth 
Products Ltd, USA; Gustafson LLC, 
USA; Bayer CropScience LP, USA) 

B. subtilis MB 1600 

Alternaria spp., Aspergillus 
spp., Fusarium spp., 

Pythium spp., R. solani 

Alfalfa, dry/snap beans, 
peanut, soybean, field 
crops, turf, cotton 

HiStick N/T, Pro-mix, Subtilex; 
Subtilex HB (Becker Underwood 

Inc., USA; Premier Horticulture Inc, 
Mexico) 

B. subtilis subsp. 
amyloliquefaciens FZB24 

Fusarium spp., R. solani 

Various vegetables and 
ornamentals 

Taegro, Tae-Technical (Earth 
Biosciences Inc. USA) 

B. subtilis , Bacillus 
circulars, Bacillus 
amyloliquefaciens, 
Paenibacillus polymyxa 
(Mixture) 

Damping-off (bacterial) 
diseases 

All crops 

Hydroguard (American Agritech, 
USA) 

Burkholderia cepacia a 

Pythium spp., Fusarium 
spp., R. solani, nematodes 

Alfalfa, beans, clover, 
cotton, peas, wheat, 
vegetables, and others 

Deny, Blue Circle (Stine Microbial 
Products, USA) 

Pythium spp., Fusarium 
spp., R. solani, 

Maize, vegetables, 
cotton 

Intercept (Soil Technologies Corp., 
USA) 
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Antagonist 

Target pathogen(s)/ 
activity 

Disease/host 

Product name and source 

Erwinia carotovora, 
nonpathogenic 

Bacterial soft rots 

Vegetables, cruciferous 
plants, rice 

Biokeeper (Central Glass Co. Ltd., 
Japan) 

P. polymyxa AC-1 

Damping-off 

Cucumber 

NH, Topseed (Greenbiotech Co., 
Korea) 

Pseudomonas sp. 

Growth promotion, various 
seedborne and soilborne 
diseases 

Vegetables, potato, 
cereals, etc. 

Proradix (Sourcon-Padena, 

Germany) 

Pseudomonas 
aureofaciens TX-1 

Various turf grass 
pathogens 

Turf grass diseases 

Spotless (Turf Science laboratories 
Inc, USA) 

Pseudomonas 
chlororaphis MA 342 

Drechslera spp., Septoria 
spp., Fusarium spp. 

Cereal seedborne 
diseases 

Cedomon (BioAgri AB, Sweden) 

P. chlororaphis 63-28 

Pythium spp. R. solaniF. 
oxysporum 

Stem and root rots, and 
wilt disease in various 
crop plants 

AtEze (Eco Soil Systems Inc., USA) 

Pseudomonas fluorescens 

Frost damage, E. amylovora 

Fruit, potato, tomato, 
berries 

BlightBan A506 (NuFarm Inc., USA) 

Pseudomonas 

solanacearum, 

nonpathogenic 

P. solanacearum 

P. solanacearum rots in 
vegetables 

PSSOL (Natural Plant Protection, 
France) 

P. syringae 

Botrytis spp., Penicillium 
spp., Mucor spp. 

Fruit, potato 

Bio-save (EcoScience Corp., USA) 

Streptomyces 
colombiensis WYE20 

R. solani 

Turf 

Mycocide (KIBC Co., Korea) 

Streptomyces goshikiensis 
WYE324 

R. solani 

Rice, turf 

Safegrow (KIBC Co, Korea) 

Streptomyces griseoviridis 
K61 

Various soilborne 
pathogens 

Vegetable and 
ornamental soilborne 
diseases 

Mycostop (Verdera Oy, Finland) 

Streptomyces lydicus 
WYCD108 

Various soilborne 
pathogens 

Root rots in many crops, 
turf, and ornamentals 

Acinovate (Natural Industries 

Inc. USA) 

FUNGI 

Aspergillus flavus AF36 

A. flavus (aflatoxin +) 

Cotton 

AF36 (Arizona Cotton Research and 
Protection Council, USA) 

A flavus NRRL 21882 

A. flavus (aflatoxin +) 

Peanut 

Afla-guard (Circle One Global Inc, 
USA) 

Coniothyrium minitans 
CON/M/91-08 

Sclerotinia minor, 

S. sclerotiorum 

Protected vegetable and 
field crops 

Contans WG (Prophyta Biologischer 
Pflanzenschutz GmbH, Germany; 
Sylvan Bio Products Inc, USA) and 
Intercept WG (Encore Technologies, 
MN, USA) 

C. minitans KONI 

5. minor, 5. sclerotiorum 

Glasshouse crops and 
amenity areas 

KONI (Bioved Ltd., 

Szigetszentmiklos, Hungary) 
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Antagonist 

Target pathogen(s)/ 
activity 

Disease/host 

Product name and source 

F. oxysporum Fo47 

F. oxysporum 

Asparagus, basil, 
carnation, cyclamen, 
gerbera, tomato 

Fusaclean (Natural Plant Protection, 
France) 

Gliocladium catenulatum 
J1446 

Pythium spp., R. solani, and 
numerous other pathogens 

Damping-off of 
vegetables, herbs, 
ornamentals, and 
numerous other plants 

Prestop Mix, Prestop WP, Primastop 
(Verdera Oy, Finland) 

Gliocladium ( Trichoderma ) 
virens G-21 

Pythium ultimum, R. solani 

Damping-off of bedding 
plants, greenhouse 
crops, and ornamentals 

SoilGard 12 G formerly GlioGard 
(Certis Inc, Columbia, MD, USA) 

Pythium oligandrum 

Numerous diseases 

Numerous crops 

Polyversum (Biopreparaty Ltd, 

Czech Republic) 

Trichoderma spp. 

Soilborne fungal 
pathogens 

Turf, glasshouse crops, 
and field crops 

Trich-A-Soil (Becker Underwood Pty 
Ltd, Australia) 


Sclerotium cepivorum, 
Pyrenochaeta 

Onion 

Tenet (Agrimm Tecnologies, 

New Zealand) 


Soilborne plant pathogens 

Ornamentals, fruit, turf, 
olive, vine 

Trichomic (AMC Chemical, Spain) 


Armi IIaria 

Tree seedlings 

Arborguard (Biodiscovery Ltd, 

New Zealand) 

T. harzianum T-22 
(KRL-AG2) 

Fusarium spp., Pythium 
spp., Cylindrocarpon spp., 
Thielaviopsis spp., R. solani, 

5. homeocarpa 

Range of crops, 
ornamentals, and turf 

T-22 HC, T-22 Planter Box, T-22 
Granules PlantShield HC, 

RootShield drench and granules, 
TurfShield, TRIANUM-P, TRIANUM-G 
(Bio-Works Inc, Fairport, NY, USA; 
Koppert, the Netherlands) 

T. harzianum 

Various fungi 

Legumes and leaf 
vegetables 

Supresivit (Borregaard and Reitzel, 
Denmark or Fytovita, Czech 
Republic) 


Pythium spp., R. solani 
seedling diseases 

Numerous crops 

Eco-T (Plant health Products, South 
Africa) 

T. harzianum GBF-0208 

Numerous pathogens 

Vegetables, bulbs, turf 

Green-all T WP (Green Biotech Co. 
Ltd., Korea) 

T. harzianum + 
Trichoderma viride 

Fusarium spp., 

Phytophthora spp., Pythium 
spp., and R. solani 

Field crops, vegetables, 
ornamental and turf 

Trichodry, Trichoflow, Trichogrow, 
Trichopel R Trichopel 
(Agrimm Technologies, New 
Zealand) 

T. harzianum T 35 + 

T. harzianum TH315 

Fusarium spp., Pythium 
spp., R. solani, 5. rolfsii 

Seedlings diseases on 
a range of crops and 
potato 

Root Pro and Root-Protato 
(Mycontrol, Hamovil, Israel) 
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Antagonist 

Target pathogen(s)/ 
activity 

Disease/host 

Product name and source 

T. harzianum + 

T. polysporum 

Various root-infecting fungi 

Glasshouse crops 

BINAB-T W P(Bio-lnnovation Eftr AB, 
Bredholmen, Sweden; or Svenska 
Predator AB, Sweden; or Bayer, 
Sweden) 

T. viride 

Fusarium spp., Pythium 

Damping-off, foot rots 

Ecoderma (Margo Biocontrols 


spp R. solani 

and collar rots of several 
plants 

Private Ltd., Bangalore, India) 


a B. cepacia is no longer available in North America because of concerns over the potential for human pathogenicity of some strains of this 
species 


welcome, particularly for bacteria such as Pseudomonas 
that do not form spores. In recent work, both Pseudo¬ 
monas and Trichoderma isolates have been simulta¬ 
neously applied to seed via drum priming and found 
to survive and proliferate on roots similarly to when 
applied individually [89] . This procedure maybe a way 
to apply and maintain multiple BCAs in a commer¬ 
cially relevant process. Further approaches include 
integration with other control strategies such as cul¬ 
tural methods and chemical treatments [87]. A better 
understanding of the natural ecology of any potential 
BCA might facilitate a more rational approach to pro¬ 
duction and use. 

Induced Resistance to Control Crop Diseases 

Following infection by a microbial pathogen, suscepti¬ 
ble plants can develop an enhanced resistance to fur¬ 
ther infection [90] . This is known as induced resistance 
and can be split broadly into two types: systemic 
acquired resistance (SAR) and induced systemic resis¬ 
tance (ISR). 

In SAR plants, develop a broad-spectrum systemic 
resistance to pathogen infection following a localized 
infection by a necrotizing pathogen or treatment with 
various agents, e.g., acibenzolar-S-methyl (ASM) or 
Probenazole (Oryzemate®). SAR is associated with 
increased levels of salicylic acid (SA) and with the 
coordinate expression of a specific set of genes 
encoding PR proteins ([91]; Fig. 3). Moreover, appli¬ 
cation of SA or one of its functional analogues, such as 
ASM, induces SAR and activates the same set of PR 
genes [92]. Indeed, expression of a set of PR genes, and 
PR-1 in particular, is used as a marker for SAR 


induction, although it is important to note that the 
induction of resistance is not always accompanied by 
PR-1 expression [93, 94]. 

In contrast to SAR, ISR develops as a result of 
colonization of plant roots by certain strains of plant 
growth-promoting rhizobacteria (PGPR) and is medi¬ 
ated by jasmonic acid- (JA) and ethylene (ET)- 
sensitive pathways ([91]; Fig. 3). Phenotypically, ISR 
is similar to SAR in that it acts unspecifically against 
taxonomically different pathogens [91, 95]. 

The resistance responses described above can be 
associated with direct activation of defenses. However, 
such responses can also be associated with an ability to 
“recall” previous infection, root colonization, or chem¬ 
ical treatment. This phenomenon is known as priming 
and results in plants responding more rapidly and 
effectively when exposed to subsequent pathogen 
attack ([96]; Fig. 3). Usually, no changes in gene 
expression or in the levels of resistance traits are detect¬ 
able in response to the priming agent alone, which 
might be a chemical elicitor such as ASM or a challeng¬ 
ing pathogen. Interestingly, priming of resistance is 
usually caused by agents that fully induce resistance 
when applied at higher doses [97, 98] and suggests 
that direct resistance induction and priming might 
differ from one another quantitatively rather than 
qualitatively. 

A wide range of microbes and chemicals are known 
to induce resistance and it seems likely that other forms 
of induced resistance exist. Thus, it is well known that 
the nonprotein amino acid, [3-aminobutyric acid 
(BABA), can induce resistance in a variety of crop 
plants [99]. BABA-induced resistance (BABA-IR) 
has been used as a model for the study of priming 
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A model for the signal transduction network controlling induced systemic resistance (ISR) mediated by plant growth- 
promoting rhizobacteria and pathogen-induced systemic acquired resistance (SAR) in Arobidopsis thaliana. LPS, 
lipopolysaccharide; PRs, pathogenesis-related proteins; AVR, avirulence gene product; R, resistance gene product; HR, 
hypersensitive response; SA, salicylic acid; JA, jasmonic acid; ET, ethylene; NPR1, a regulatory protein involved in signaling 
in SAR and ISR in A. thaliana; SNI, transcriptional repressor of SAR genes; TGA transcription factors, family of transcription 
factors interacting with SA-induced NPR1 (From [91]) 


and in Arobidopsis, it is based on various mechanisms. 
Thus, BABA-IR against P. syringae and B. cinerea func¬ 
tions via priming for SA-inducible defenses, while 
against a different set of pathogens ( Hyaloperonospora 


parasitica , Plectosphaerella cucumerina and Alternaria 
brassicicola ), it is based on priming for resistance 
through the formation of callose-rich papillae 
[ 100 - 102 ]. 
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Since the introduction of the first chemical resistance 
activator Probenazole (registered in Japan as 
Oryzemate®, Meiji Seika Kaisha Ltd) in 1975, a number 
of other chemical and microbial activators have been 
developed, including ASM, registered as Bion® and 
Actigard® (Syngenta), Milsana® ( Reynoutria sacalinensis 
extract, KHH BioScience Inc. USA), Elexa® (chitosan, 
SafeScience, USA), and Messenger® (harpin protein, 
Eden Bioscience, USA). However, although high levels 
of disease control can be achieved with plant activators 
in controlled environments, their performance under 
field conditions has been less impressive. In fact, the 
moderate levels of disease control and high levels of 
variability exhibited by plant activators in the field have 
been instrumental in the very slow uptake of induced 
resistance in crop production systems. In the following 
paragraphs, the performance of selected plant activators 
under field conditions will be discussed. 

Probenazole (3-allyloxy-1,2-benziothiazole-1,1 - 
oxide) was first introduced for the control of rice 
blast disease ( Pyricularia oryzae ) and bacterial blight 
(.Xanthomonas oryzae). It is widely used in Asia where it 
is applied as a granular treatment either to paddy fields 
or as a seedling box treatment. Following application, 
the compound is absorbed by the roots, then systemi- 
cally transferred to the rest of the plant and can control 
rice blast disease for between 40 and 70 days post 
application [103]. However, despite continuous use 
since its introduction, there have been no reports of 
pathogen insensitivity to probenzole and indeed, it still 
accounted for 53% of the chemicals used for seedling 
box treatments on rice in 2005 [103]. It is believed this 
is because the compound is only weakly toxic to fungi 
and activates disease defense systems in rice [104, 105]. 

A large body of data has accumulated on the effi¬ 
cacy of ASM against a range of diseases under field 
conditions [106, 107]. Most studies report disease con¬ 
trol, although the level of control ranges from 4% to 
99%. High levels of disease control were achieved on 
tobacco, where infections by P. syringae pv. tabaci , 
Cercospora nicotianae , and Alternaria alternata were 
reduced by 99%, 91%, and 89%, respectively [108, 
109]. In wheat, the crop that ASM was originally 
aimed at, disease control was not so impressive, ranging 
from 35% for Puccinia recondita and Septoria spp., to 
77% for B. graminis f.sp. hordei [110, 111]. ASM even 
increased disease levels in peanut, where infection by 


Cercosporidium personatum was greater than untreated 
controls by 52% [112]. Working on oilseed rape, 
Liu et al. [113] found that pretreatment with ASM in 
October/November decreased the number of leaf 
lesions caused by the Phoma stem canker pathogen 
L. maculans in the autumn/winter, as well as the 
severity of stem canker in the subsequent spring/sum- 
mer. In this work, reductions in numbers of leaves with 
lesions were between 25% and 55% [113]. More 
recently, ASM was shown to reduce infection of barley 
by the leaf scald pathogen R. secalis by 45% [114]. 

Chitosan is a common polymer in shells of crusta¬ 
ceans, exoskeletons of insects, and cell walls of fungi 
[ 1 15] . A commercial formulation, Elexa®, contains 4% 
chitosan as its active ingredient and has been shown to 
protect a range of crops against pathogens. For exam¬ 
ple, when used as a seed treatment, it reduced downy 
mildew severity on pearl millet by 58%, and when used 
as a foliar spray, it reduced infection by 75% [116]. 
When used on grapevines, eight applications of Elexa® 
applied over the season reduced the incidence of downy 
mildew by 50% and powdery mildew by 75% com¬ 
pared to untreated controls [117]. 

A number commercially available products are 
based on microbial proteins. One such is Messenger®, 
which is based on the protein harpin obtained from 
Erwinia amylovora [118]. Used as a crop protectant, 
Messenger® has had mixed success. For example, 
although it possessed good efficacy against blue mold 
in apples [119], its efficacy against gray mold in straw¬ 
berry [120] and target spot of tomato [121] was poor. 
In some interesting recent work, Chen et al. [122] 
generated specific fragments of HpaG Xo0 c> a harpin 
from X. oryzae pv. oryzicola , and found that one of 
these fragments, HpaG 10 _ 42 , stimulated growth of rice 
plants and provided enhanced resistance to X. oryzae 
pv. oryzae and M grisea. HpaG 10 _ 42 was also shown to 
control bacterial blight, rice blast, and sheath blight, 
and to increase grain yields, under field conditions 
[123]. Here the level of disease control depended on 
the cultivar, with greater control obtained with indica 
compared to japonica cultivars. 

ISR was first shown to be effective under field con¬ 
ditions in the mid-1990s, when application of PGPR as 
a seed treatment followed by soil drench application led 
to a reduction in severity of bacterial wilt [124], and 
control of bacterial angular leaf spot and anthracnose 
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[125] . Subsequent research by Raupach and Kloepper 

[126] showed that treatment of cucumber seed with 
PGPR led to increased plant growth and control of 
angular leaf spot and anthracnose. Field experiments 
in Thailand in 2001 and 2002 studied the effects 
of PGPR, used alone or as mixtures, on control of 
southern blight of tomato caused by Sclerotium rolfsii , 
anthracnose of long cayenne pepper caused by 
Colletotrichum gloeospoiroides , and mosaic disease of 
cucumber caused by cucumber mosaic virus (CMV) 

[127] . Mixtures of PGPR (all Bacillus spp.) were found 
to suppress disease more consistently than the PGPR 
strain Bacillus pumilus IN937b, used alone. 

Induced resistance offers the prospect of durable, 
broad-spectrum disease control using the plants own 
resistance. However, it is plagued by inconsistency and 
relatively poor disease control compared with fungi¬ 
cides. These problems relate to the fact that induced 
resistance is a host response and as such is greatly 
influenced by genotype and environment. Unfortu¬ 
nately, the understanding of the impact of these influ¬ 
ences on induced resistance is poor, as is the 
understanding of how best to use induced resistance 
in crop protection practice. Understanding in these 
areas needs to improve, and answers to several impor¬ 
tant, practical questions are required, for example: 
(1) Should resistance inducing agents be applied early 
or late in the season? (2) Is induced resistance effective 
against pathogens with long periods of asymptomatic 
growth in plant tissue, e.g., R. secalis on barley? (3) Can 
resistance inducers be used as a means of reducing fun¬ 
gicide applications to crops, e.g., can resistance inducers 
be applied early to reduce pathogen infection and colo¬ 
nization, thereby allowing less fungicide to be used? 
What is required is research related to specific crops 
aimed at trying to determine how best to fit induced 
resistance into disease control programs. Farmers and 
crop protectionists have grown accustomed to high 
levels, or even complete, disease control. Ultimately, for 
induced resistance to gain more widespread acceptance 
in crop protection, there will need to be a lowering of 
expectation in terms of levels of disease control. 

Other Approaches to Controlling Crop Diseases 

Biofumigation Biofumigation involves the suppres¬ 
sion of plant pests and diseases by biocidal hydrolysis 


products, most notably the isothiocyanates released by 
glucosinolate (GSL)-containing plants in soil. It can 
involve GSL-containing plants as rotation crops, or 
intercrops by incorporating fresh plant material as 
green manure, or by utilizing processed plant products 
high in GSLs, e.g., seed meals. Strategies for the field 
implementation of biofumigation are described in 
detail by Kirkegaard [ 128] . 

Soil Solarization Soil solarization, or soil heating, 
harnesses solar energy in order to increase soil temper¬ 
ature. This is commonly achieved by mulching (cover¬ 
ing, tarping) the soil with transparent polyethylene or 
other transparent plastic sheets. The following are a few 
of the important principles of soil solarization: (1) It 
should be carried out during periods of high tempera¬ 
ture and intense solar radiation, with low levels, or no 
precipitation. (2) Soil needs to be moist in order to 
increase the thermal sensitivity of resting structures 
and improve heat conduction. (3) The mulching 
period needs to be sufficiently long (~4 weeks or 
greater) in order to achieve disease control at all desired 
depths. The efficacy of soil solarization in controlling 
crop diseases and the mechanisms involved are 
discussed in detail by Gamliel and Katan [129]. 

Integrated Control of Plant Diseases 

As pointed out by Agrios [130], plant disease control is 
most effective when all of the relevant information 
regarding the crop, potential pathogens, previous dis¬ 
ease history, availability of host resistance, environmen¬ 
tal conditions, etc., are taken into account in devising the 
disease control program. An integrated program of dis¬ 
ease control aims to (1) eliminate or reduce the initial 
pathogen inoculum, (2) reduce the effectiveness of the 
initial pathogen inoculum, (3) maximize host plant 
resistance, (4) delay the onset of disease in the plant/ 
crop, and (5) slow down the development and progres¬ 
sion of secondary cycles [130]. 

The precise approach adopted will depend on the 
crop and its disease spectrum, but would probably 
include several of the following: (1) appropriate 
hygiene/sanitation, e.g., using disease-free seed/plant- 
ing material, (2) using appropriate cultural measures, 
(3) using resistant plant varieties, (4) using appropriate 
fungicides, and (5) using biologically based methods, if 
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available and appropriate, e.g., biological control, 
induced resistance. 

Control of late blight (P. infestans ) on potato pro¬ 
vides a good example of an integrated approach to 
disease control (see www.endure-network.eu). For late 
blight, the first step in an integrated strategy is reducing 
primary sources of inoculum, such as avoiding infected 
seed tubers, using certified potato seed, preventing or 
reducing oospore production by controlling volunteer 
potatoes, for example. The next step is variety choice, 
since use of a variety expressing some resistance to late 
blight offers the potential to reduce fungicide inputs as 
part of an integrated strategy. Nevertheless, fungicides 
play a crucial role in integrated control of late blight. 
Although control measures are aimed mainly at 
preventing infection, if late blight appears in a crop, 
the control strategy switches to stopping or reducing 
the epidemic. Effective control of late blight depends 
on access to information such as phase of crop growth, 
fungicide products available, dose rates, timings, 
weather conditions, as well as access to tools, such as 
an appropriate decision support system (DSS). The 
DSS can integrate all relevant information to generate 
spray recommendations, thereby increasing the efficacy 
of the control strategy without increasing risk. 

Future Directions 

High crop yields are maintained in most developed 
countries through the use of improved varieties, together 
with fertilizers and pesticides. In these countries, farmers 
and growers are accustomed to achieving high levels of 
disease control with fungicides, although, as discussed 
above, the development of fungicide resistance can erode 
fungicide efficacy. Levels of disease control obtained with 
many biologically based control methods are lower than 
those achieved using fungicides, and moreover, many 
biologically based methods tend to provide inconsistent 
disease control. Thus, although induced resistance can 
provide high levels of disease control on some crops, with 
many crops, disease control is less impressive. Expression 
of induced resistance in crop plants can also be variable, 
depending, for example, on genotype and environment. 
Problems also exist with variability and inconsistency 
of disease control provided by some BCAs. Farmer per¬ 
ceptions of inadequate and inconsistent disease control 
will not persuade them to adopt biologically based 


approaches. Minimizing the effects of these problems 
clearly requires further research. 

In spite of the huge effort by researchers to develop 
novel biologically based solutions for disease control 
(such as BCAs, plant-derived substances, induced resis¬ 
tance agents), few products have reached the market¬ 
place. The relatively high cost of registration, together 
with the limited market size for some products, has been 
identified as a major barrier. However, this problem has 
been recognized by regulatory authorities, and in the 
UK, for example, the Chemicals Regulation Directorate 
(Pesticides) has launched a scheme that encourages 
applicants to register their products. Under this scheme, 
the requirements for registration can be tailored to the 
product type and importantly, the application fee can be 
reduced [86]. 

The continued ability of pathogens to overcome host 
resistance genes and to develop resistance to fungicides 
seriously erodes the ability to provide effective, lasting 
disease control on important crops. These problems 
combined with the withdrawal of active substances 
from the market and increasing public concern with the 
effects of pesticides on the environment create a huge 
challenge for plant pathology in the future. Plant disease 
control has an important role to play in efforts to feed the 
world s increasing population. However, providing effec¬ 
tive and lasting disease control, without harming the 
environment, will require not just a sensible approach to 
the use of host resistance and fungicides, but also a range 
of innovative approaches. In some situations, innovative 
control methods might be used to complement existing 
approaches. In other cases, for example for those diseases 
for which no adequate control measures exist, innovative 
control options might provide the only solution. 
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Glossary 

Acetosyringone A phenolic compound that activates 
the Agrobacterium tumefaciens vir genes and thus 
initiates DNA transfer. 

Agrobacterium-mediated transformation The intro¬ 
duction of DNA into plants by the bacterium 
Agrobacterium tumefaciens. 

Bactofection Gene transfer to animal cells mediated 
by bacteria. 

BIBAC A binary vector for plant transformation based 
on the high-capacity bacterial artificial chromo¬ 
some (BAC). 


Binary vector A plant transformation vector in two 
parts, one carrying the T-DNA and one carrying the 
vir genes. 

Cassette A modular DNA sequence designed for por¬ 
tability between different vectors. 

Chimeric (transformed plants) Comprising cells with 
different genotypes, usually a first-generation 
transformant (T 0 ) where some cells are transgenic 
and others are not. 

Concatemer A DNA molecule containing multiple 
contiguous copies of the same sequence. 

Conjugation DNA transfer between cells through 
a specialized conduit known as a pilus. 

Counterselectable marker A selectable marker gene 
whose absence is required for cells to survive. 

Destination vector The vector designated to receive 
a DNA cassette during LR recombination in the 
Gateway system. 

Direct DNA transfer DNA transfer mediated by phys¬ 
ical or chemical means rather than by bacteria or 
viruses. 

Entry clone The vector holding the DNA cassette 
that needs to be transferred to the Destination 
vector during LR recombination in the Gateway 
system. 

Episomal A genetic entity that replicates indepen¬ 
dently of the host chromosome, such as a plasmid 
or nonintegrating virus. 

Explant A piece of plant tissue transferred to culture 
and propagated independently. 

Filler DNA Extra DNA added at junctions during ille¬ 
gitimate recombination, either by synthesis across a 
template or random addition of nucleotides. 

Gateway A proprietary cloning system based on LR 
recombination, a form of site-specific recombina¬ 
tion using the attB and attV sites of Escherichia coli 
and bacteriophage X. 

Gene targeting Disruption of a preselected gene by 
homologous recombination with a DNA cassette. 

Helper plasmid A plasmid that is not used as a cloning 
vehicle, but which supplies necessary functions in 
trans. 

Homing endonuclease A specialized type of restric¬ 
tion enzyme encoded by introns and inteins with 
a long asymmetric recognition sequence. 

Homologous recombination Recombination requir¬ 
ing long regions of homology but no sequence 
specificity. 
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Illegitimate recombination Recombination requiring 
neither long regions of homology nor specific 
sequences; it can occur by direct nonhomologous 
end-joining (all sequences conserved, sometimes 
with added filler DNA) or at regions of 
microhomology. 

In planta transformation Transformation methods 
that can be used with intact plants rather than tissue 
explants, and which therefore require no regeneration. 

Integration The insertion of one DNA sequence in the 
midst of another. 

Minichromosome A vector comprising the minimal 
elements that allow it to function as a chromosome. 

Minimal cassette The minimal sequences required to 
achieve transformation and transgene expression, 
typically a promoter, gene, and terminator. 

Multigene transfer The simultaneous transfer of more 
than one gene into a plant. 

Particle bombardment A physical transformation 
method based on the acceleration of DNA-coated 
metal particles into intact plant cells. 

Promoter The DNA sequence in front of a gene which 
controls its spatiotemporal expression profiles and 
whether its expression is sensitive to external 
stimuli. 

Reporter gene A gene whose function is to yield an 
easily detectable product that can be used to mea¬ 
sure or delimit the activity of a regulatory element 
such as a promoter. 

Screenable marker Another name for a reporter gene. 

Selectable marker A gene whose function is to confer 
on transformed cells some property that allows 
them to be propagated in conditions under which 
nontransformed cells either die or cannot grow 
efficiently. 

Southern blot (DNA blot) A method for the detection 
of DNA sequences using specific labeled comple¬ 
mentary probes. 

Stable transformation Transformation followed by 
integration of DNA into the host genome such 
that the transgene becomes a permanent new geno¬ 
mic locus. 

Superbinary vector A binary vector in which the vir 
gene functions are enhanced. 

Supervirulent An Agrobacterium tumefaciens strain 
with a broader host range or more efficient DNA 
transfer ability than normal strains. 


T-DNA The section of DNA within a binary vector 
which is transferred to the host plant. 
Transactivation domain The part of a transcription 
factor that activates transcription. 

Transient expression The expression of introduced 
DNA for a short time before the DNA is diluted 
and degraded. 

Transfection (a) In bacteria, the transfer of phage 
DNA into bacterial cells by chemical or physical 
means; (b) in animals and plants, the transfer of 
any DNA into cultured cells by chemical or physical 
means. 

Transformant A plant that has been transformed with 
exogenous DNA. 

Transformation (a) In bacteria, the transfer of plas¬ 
mid or genomic DNA into bacterial cells by chem¬ 
ical or physical means; (b) in animal cells, the 
spontaneous or induced change from a normal to 
an oncogenic phenotype; (c) in plants, any means 
of DNA transfer that does not involve the use of 
a virus. 

Transgenic Containing integrated exogenous DNA in 
the nuclear genome. 

Transplastomic Containing integrated exogenous 
DNA in the plastid genome. 

Terminator The sequence following the gene that is 
required to terminate transcription. 
vir Gene One of several genes on binary vectors that 
are required in trans to mediate T-DNA transfer. 
Zinc finger A zinc-coordinating protein motif usually 
associated with DNA binding, often found in 
sequence-specific DNA binding proteins such as 
transcription factors. 

Definition of the Subject and Its Importance 

The term transformation (or more properly “genetic 
transformation”) was first coined to describe the natu¬ 
ral process by which bacteria take up free DNA from 
their surroundings. The term was required to distin¬ 
guish this process from two other ubiquitous natural 
gene transfer mechanisms in bacteria: conjugation 
(direct transfer of DNA from cell to cell through 
a connecting tube called a pilus) and transduction 
(transfer of DNA between cells carried by the capsid 
of a virus). All three of these natural processes have 
been exploited in the laboratory as ways to introduce 
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exogenous DNA into bacterial cells, and in this context 
the definition of transformation was later refined to 
take into account the different consequences of gene 
transfer depending on the origins of the transforming 
DNA. The meaning of transformation was thus 
restricted to the uptake of naked plasmids or genomic 
DNA fragments, whereas an alternative term - trans¬ 
fection - was coined to describe the uptake of naked 
phage DNA, the distinction required because only the 
latter can initiate a phage infection. 

Unsurprisingly, the same terms have been adopted 
to describe analogous processes in other organisms, but 
the precise definitions vary due to preexisting conven¬ 
tions. Cultured animal cells are said to undergo trans¬ 
formation when they change from the normal 
phenotype (cessation of growth when confluence is 
achieved) to an oncogenic phenotype (continued 
growth, formation of foci) so “transformation” is usu¬ 
ally avoided in the context of gene transfer to prevent 
ambiguity. Therefore, transfection is used to describe 
the introduction of any naked DNA into animal cells 
regardless of its origin. The transfer of DNA from 
bacteria to animal cells (including the use of conjuga¬ 
tion-like mechanisms) is unusual and has only recently 
been adopted as a gene transfer procedure in the labo¬ 
ratory, so a new term has been coined for this process - 
bactofection . The introduction of DNA into animal 
cells using a virus as a carrier is known as transduction, 
just as it is in bacteria. 

The terminology for plants is a hybrid of 
the conventions used for bacteria and animal cells. 
Transformation refers to any gene transfer process 
where a virus is not used as a carrier (otherwise 
transduction is the correct term, as in bacteria and 
animal cells). This means that plant transformation 
includes both the uptake of naked DNA ( direct DNA 
transfer ) and the transfer of DNA by the conjugation¬ 
like method adopted by Agrobacterium tumefaciens 
and Agrobacterium rhizogenes ( . Agrobacterium- 
mediated transformation ). However, due to opera¬ 
tional analogies with cultured animal cells, the intro¬ 
duction of DNA into cultured plant protoplasts 
is also termed transfection, so in this specific context 
the terms transformation and transfection are 
synonymous. 

The fate of the introduced DNA depends on many 
factors. If it does not contain an origin of replication 


that is active in plant cells it is usually maintained in 
the nucleus for a short period of time (hours to days), 
eventually being diluted and degraded. If the intro¬ 
duced DNA contains an active expression cassette, 
then a product may be expressed during this time, 
a process known as transient expression . In a very 
small number of cells, the DNA integrates into the 
genome and becomes a permanent new locus. This is 
known as stable transformation . It is a rare event 
(approximately one in every 10,000 cells that takes up 
exogenous DNA integrates it into the genome) and 
selectable marker genes are usually employed to ensure 
that the small number of transformed cells can grow at 
the expense of their nontransformed peers, allowing 
the recovery of transgenic tissues and whole plants. 
Cells, tissues, and whole plants that are stably 
transformed with a particular fragment of exogenous 
DNA are termed transgenic if the DNA integrates into 
the nuclear genome or transplastomic if it integrates 
into the plastid genome. The integrated DNA is known 
as the transgene (depending on context this term can 
refer to a single gene or to the integrated segment of 
exogenous DNA which may contain one or more 
genes). 

If the introduced DNA does have an origin of rep¬ 
lication that functions in plants, then it may be 
maintained episomally (as an extrachromosomal rep- 
licon). This occurs when the introduced genetic mate¬ 
rial is part of a replication-competent recombinant 
plant virus, or when it includes the components of 
a plant centromere, allowing it to function as 
a minichromosome. Viral vectors are generally short¬ 
lived, so they are also used for transient expression. 
However, the ability of viruses to spread systemically 
means that the transgene may be expressed at very high 
levels throughout the plant, and the infection may last 
days or weeks. Minichromosome vectors are a relatively 
new development in plants but as with equivalent 
“artificial chromosome” vectors in yeast and animal 
cells, the idea is that they should facilitate stable trans¬ 
formation without the need for integration into the 
genome. 

Plant transformation is a fundamental component 
of both basic and applied plant biology. For basic 
research, transformation allows scientists to study 
how genes function and allows the expression of both 
endogenous genes and transgenes to be controlled. 
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This has increased our understanding of how plants 
grow, develop, and defend themselves against pests, 
diseases, and harsh environments, how photosynthesis 
is controlled, and the basis of primary and secondary 
metabolism. For applied research, transformation can 
be used to improve the agronomic performance of 
crops, making them hardier, more nutritious, more 
productive, or converting them from conventional 
crops into green factories producing chemical precur¬ 
sors, novel oils, industrial enzymes, and pharmaceuti¬ 
cals. Plants provide human beings with all types of 
useful products: food and animal feed, fibers and struc¬ 
tural materials, and small molecules that can be used as 
dyes, scents, and medicines. Plants have been cultivated 
for these products since the dawn of history, and for the 
same length of time people have sought to improve 
plants by breeding them and selecting the better¬ 
performing and most useful varieties. The limitations 
of this approach, i.e., the fact that breeders are 
restricted to the existing gene pool in each group of 
sexually compatible species, and that breeding takes 
a long time to achieve its goals, can be overcome by 
plant transformation, thus accelerating the develop¬ 
ment of plants with novel, beneficial traits. 

Introduction 

Plant transformation became a routine process in the 
1980s but several attempts to transfer DNA into plant 
tissues were reported in the previous 2 decades, 
although stable transformation was never confirmed. 
The first deliberate transformation of plant tissue with 
laboratory-created recombinant DNA was achieved in 
1983 when several researchers reported the introduc¬ 
tion of recombinant plasmids, including backbone 
sequences from cloning vectors and selectable marker 
genes [1-3]. This marked the first creation of trans¬ 
genic plant cells in the currently accepted meaning of 
the word. In the intervening 30 years, heterologous 
genes have been introduced into well over 100 different 
plant species either through the use of the soil bacte¬ 
rium A. tumefaciens or alternative strategies involving 
direct DNA transfer to plant cells and tissues. In addi¬ 
tion, many plant viruses have been developed as epi- 
somal vectors, allowing high-level transient gene 
expression, although because of their inability to 


achieve stable transformation they are not considered 
further in this article. 

Principles and Methods of Plant Transformation 

A fundamental difference between animals and plants 
is that differentiated plant tissue shows a high degree of 
developmental plasticity. Depending on the species, 
isolated stem segments, leaf disks, and seed-derived 
callus tissue may be able to regenerate an entire new 
plant under appropriate culture conditions. For most 
plant species, some form of tissue culture step is there¬ 
fore used for the successful production of transgenic 
plants after cells or small tissue explants have been 
subjected to the actual transformation procedure. 
However, it should be noted that whole-plant (in 
planta ) transformation strategies are also available in 
some species, in which the need for tissue culture is 
minimized or eliminated. 

The ease with which plant material is manipulated 
and interconverted in culture provides many opportu¬ 
nities for the development of techniques for gene trans¬ 
fer and the recovery of transgenic plants (Fig. 1). DNA 
can be introduced into most types of plant material - 
protoplasts, cell suspensions, callus, vegetative tissue 
explants, gametes, seeds, zygotes, embryos, organs, 
and whole plants; so, the ability to recover fertile plants 
from such material is often the limiting step in plant 
genetic engineering rather than the DNA transfer pro¬ 
cess itself. It is also possible to maintain transformed 
plant cell lines or tissues (e.g., root cultures) rather than 
regenerating whole plants as these can often be 
sustained indefinitely in culture. 

Gene transfer to plants can be achieved through 
three types of mechanism - viral transduction, bacte¬ 
rial gene delivery, and chemical/physical direct DNA 
transfer. Bacterial gene delivery using the soil pathogen 
A. tumefaciens is the most widely used method for 
dicotyledonous plants, but increasingly also for mono¬ 
cots. Physical methods are the next most popular, espe¬ 
cially particle bombardment for the transformation of 
more recalcitrant monocots such as cereals. Chemical 
transfection methods are little used these days, and are 
only suitable for protoplasts. Each of the above 
methods can be used to achieve either transient expres¬ 
sion or stable transformation of the nuclear genome, 
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Crop Plants Transformation Methods. Figure 1 

Strategies for the transformation of higher plants [4]. The boxes show the various different targets for transformation 
and how they are obtained from whole plants (block arrows). The methods used to transform each of these targets are 
shown in black text. The red arrows show the routes used to obtain whole transgenic plants 


while direct gene transfer can also be used to achieve 
chloroplast transformation. As stated earlier, viral 
transduction does not lead to stable transformation 
and virus vectors are not discussed in this article. 

Principles of Agrobacterium -Mediated 
Transformation 

Gene transfer from bacteria to plants occurs naturally 
and is responsible for crown gall disease. This is 
a plant tumor that can be induced in a wide variety of 
gymnosperms and dicotyledonous angiosperms by 


inoculating wound sites with the Gram-negative soil 
bacterium A. tumefaciens. The tumor produces plant 
hormones responsible for the proliferation of undiffer¬ 
entiated plant tissue, and specialized amino acid deriv¬ 
atives known as opines that the bacteria use as a carbon 
source. The entire system has therefore evolved as a way 
for the bacteria to exploit plants for accommodation 
and food. 

Even so, the continued presence of the bacteria is 
not required to maintain the tumor, indicating that 
some “tumor-inducing principle” is transferred from 
the bacterium to the plant at the wound site. Research 
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in the late 1970s identified the principle as a small 
segment of DNA that is transferred to the plant genome 
(hence T-DNA, for transferred DNA). The source of 
the T-DNA is a large plasmid, the tumor-inducing (Ti) 
plasmid, resident in the bacterium. 

Agrobacterium- mediated transformation was devel¬ 
oped as a platform technology following the dissection 
and functional analysis of two key components of nat¬ 
ural Ti-plasmids: the T-DNA, which contains DNA 
sequences required in cis for DNA transfer, and the 
vir (virulence) region, which contains genes whose 
products are required in trans for DNA transfer. The 
resulting transformation system is known as a binary 
vector system because the T-DNA and vir genes do not 
need to be present on the same vector. This means that 
the T-DNA can be housed on a small shuttle vector 
suitable for cloning in bacteria, while the vir genes are 
provided on a second “helper” plasmid. 

Natural T-DNA carries genes encoding enzymes for 
plant hormone synthesis and enzymes for opine synthe¬ 
sis, and these are the factors that drive the formation of 
the crown gall tumor. The hormone genes are often called 
oncogenes because they lead to tumor formation. How¬ 
ever, neither the oncogenes nor the opine genes are 
necessary for DNA transfer. If these genes are removed, 
leaving an “empty” T-DNA cassette, the “disarmed” 
T-DNA can still be transferred to the plant genome, but 
the transformed cells at the wound site no longer prolif¬ 
erate and form a tumor. All that is required for transfer is 
the vir region, the T-DNA border sequences which are 
targets for excision from the Ti-plasmid, and a small 
number of loci on the A. tumefaciens genome. The 
T-DNA border sequences are 25-bp imperfect direct 
repeats which define the boundaries of the T-DNA. 
An enhancer, sometimes called the overdrive sequence, 
is located external to the right-hand repeat and is also 
required for high-efficiency transfer [5, 6]. Disarming 
the T-DNA does not improve the efficiency of trans¬ 
formation but it does improve the efficiency of regen¬ 
eration, since the oncogenes produce phytohormones 
that interfere with normal plant development. At the 
same time, however, this removes any visual confirma¬ 
tion that transformation has taken place, and selectable 
or screenable marker genes must be included in the 
T-DNA to allow the identification of transformants. 

The vir genes are organized into several operons. 
Two of these genes, virA and vzrG, are constitutively 


expressed at a low level and control the plant-induced 
activation of the other vir genes. The VirA protein is 
a kinase that spans the inner bacterial membrane, and 
acts as the receptor for certain phenolic molecules that 
are released by wounded plant cells. Many such com¬ 
pounds have been characterized, but acetosyringone is 
the most widely used in the laboratory to induce vir 
gene expression [7] . These phenolic compounds do not 
actually attract bacteria to wounded plant cells (the 
bacteria are attracted by simple molecules such as 
sugars and amino acids) but the vir genes are induced 
after bacterial attachment. Activated VirA transpho- 
sphorylates the VirG protein, which is a transcriptional 
activator of the other vir genes. Further genes on the 
bacterial chromosome also encode transcription fac¬ 
tors that regulate vir gene expression (reviewed in [8, 
9]). The induction of vir gene expression results in the 
synthesis of proteins that form a conjugative pilus 
through which the T-DNA is transferred to the plant 
cell. The components of the pilus are encoded by genes 
in the vzVB operon (reviewed in [10]). DNA transfer 
itself is initiated by an endonuclease formed by the 
products of the virDl and virD2 genes. This introduces 
either single-strand nicks or a double-strand break at 
the 25-bp borders of the T-DNA, a process enhanced by 
the VirC12 and VirC2 proteins, which recognize and 
bind to the overdrive enhancer element. The VirD2 
protein remains covalently attached to the processed 
T-DNA. Recent studies have suggested that the type of 
T-DNA intermediate produced (single- or double- 
stranded) depends on the type of Ti-plasmid, with 
double-stranded T-DNA favored by nopaline plasmids 
(where the T-DNA is a single element) and single 
“T-strands” favored by octopine and succinopine plas¬ 
mids, where the T-DNA is split into noncontiguous 
sections. T-strands are coated with VirE2, a single- 
stranded DNA-binding protein. The whole complex, 
sometimes dubbed the firecracker complex because of 
its proposed shape, is then transferred through the 
pilus and into the plant cell. The VirD2 protein may 
protect the T-DNA against nucleases, target the DNA to 
the plant cell nucleus, and help integrate it into the 
plant genome. The protein has two distinct nuclear 
localization signals, with the C-terminal signal thought 
to play the major role in targeting the T-DNA [11]. 
Once in the nucleus, the T-DNA is thought to integrate 
through a process of illegitimate recombination, 
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perhaps exploiting naturally occurring chromosome 
breaks [12-14]. 

Contemporary binary vectors have all the conve¬ 
niences of bacterial cloning vectors such as multiple 
unique restriction sites in the T-DNA region to facili¬ 
tate subcloning, the lacZ gene for blue-white screening 
and a X cos site for preparing cosmid libraries. 
A popular binary vector in current use is pGreen, 
which is <5 kbp in size, has 18 unique restriction 
sites in the T-DNA, a lacZ gene for blue-white screen¬ 
ing of recombinants, and a selectable marker that can 
be used both in bacteria and in transformed plants 
[15]. The progressive reduction in size has been made 
possible by removing essential genes required for rep¬ 
lication in A. tumefaciens and transferring those genes 
to the bacterium’s genome, or onto a helper plasmid. 
The pGreen plasmid, for example, contains the Sa 
origin of replication, which is much smaller than the 
more traditional Ri and RK2 regions. Furthermore, an 
essential replicase gene is housed on a second plasmid 
called pSoup resident within the bacterium. All conju¬ 
gation functions have also been removed [15]. 

More recent innovations have also been incorpo¬ 
rated into binary vectors, such as the inclusion of 
Gateway technology, a proprietary technology devel¬ 
oped by Invitrogen (now part of Life Sciences, 
Carlsbad, California, USA) which is based on the attR 
and attP site-specific recombination sites and associ¬ 
ated enzymes employed by bacteriophage X to integrate 
into the Escherichia coli chromosome during lysogeny 
(the process by which the bacteriophage genome inte¬ 
grates into the bacterial chromosome and becomes 
dormant). Under normal circumstances, the attP site 
in the phage genome and the E. coli attR site 
undergoes site-specific recombination catalyzed by 
a phage enzyme, resulting in integration and the for¬ 
mation of two hybrid sites flanking the prophage, attL, 
and attR. To develop this as a cloning system, sequences 
are prepared with flanking attR sites (typically gener¬ 
ated using extended PCR primers) and these undergo 
recombination with attP sites in a Gateway vector to 
generate an Entry Clone in which the sequence is 
flanked by attL sites. The Entry Clone can then be 
mixed with a plant-specific Destination Vector, which 
contains attR sites flanking a marker gene, such that 
the cloned gene is transferred into the destination vec¬ 
tor to replace the marker (LR recombination). A large 


number of Gateway-compatible binary vectors have 
been developed [16] allowing universal cloning 
independent of the presence of restriction endonucle¬ 
ase sites and the assembly of multiple genes on one 
plasmid, a modification known as MultiRound 
Gateway [17]. 

Methods of Agrobacterium -Mediated 
Transformation 

Many dicotyledonous plants can be transformed using 
variations of the simple protocol developed by Horsch 
et al. [18] in which small disks punched from leaves 
are surface-sterilized and inoculated in a medium 
containing A. tumefaciens harboring the recombinant 
binary vector. The disks are cultured for 2 days, during 
which T-DNA transfer takes place, and are then trans¬ 
ferred to a medium containing the selection agent and 
carbenicillin to kill the bacteria. After 2-4 weeks, devel¬ 
oping shoots are excised from the callus and 
transplanted to root-inducing medium, and thereafter 
into soil. 

This leaf disk method is unsuitable for most mono- 
cotyledonous plants because they lie outside the 
Agrobacterium host range, and do not respond to 
wounding in the same way as dicots. In order to trans¬ 
form staple crops such as cereals, modified culture 
conditions were developed involving explants 
containing a high proportion of actively dividing 
cells, such as embryos or apical meristems. In dicots, 
cell division is often induced by wounding, whereas 
wound sites in monocots tend to become lignified. 
This probably explains why traditional procedures 
such as the leaf disk method are inefficient in mono¬ 
cots. Hiei et al. [19] showed that the cocultivation 
of Agrobacterium and rice embryos in the presence of 
100 mM acetosyringone was a critical factor for suc¬ 
cessful transformation. Efficiency could be improved 
further by using so-called “supervirulent” strains like 
AGL-1, which incorporate modifications to boost gene 
transfer activity, such as overexpressed vzVG (switching 
on the expression of the other vir genes) and/or vzrEl, 
which is a major limiting factor in T-DNA transfer 
(reviewed in [20]). Komari et al. [21] used a different 
strategy, in which a portion of the virulence region 
from the Ti-plasmid of supervirulent strain A281 was 
transferred to the T-DNA-carrying plasmid to generate 
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a so-called superbinary vector. The advantage of the 
latter technique is that the superbinary vector can be 
used in any Agrobacterium strain. 

Principles of Direct DNA Transfer 

Direct DNA transfer, as the name suggests, involves the 
introduction of DNA directly into the cell without 
using a biological carrier. This has two important 
advantages over Agrobacterium -mediated transforma¬ 
tion. First, there is no dependency on the biological 
properties of the carrier, so any species and variety is 
suitable for transformation. Second, there is no need to 
use specialized vectors for transformation - indeed 
transformation is possible without any vectors at all. 
The principles of direct DNA transfer are therefore 
much more straightforward than those for 
Agrobacterium -mediated transformation. All that is 
required is a mechanism for getting DNA into the 
plant cell, and ensuring it reaches the nucleus. This 
can be achieved using physical or chemical means. 

Chemical Methods for Direct DNA Transfer Histor¬ 
ically, the first direct transfer methods were chemical, 
and were closely related to the (at the time) newly 
devised methods for gene transfer to animal cells. 
Animal cells lack a cellulose wall and are protected by 
a simple plasma membrane. Therefore, the methods 
devised for animal cells would not work directly on 
plant cells, but were suitable for protoplasts. Gene 
transfer across the protoplast membrane is promoted 
by a number of chemicals, of which polyethylene glycol 
has become the most widely used due to the availability 
of simple transformation protocols [22]. Alternatively, 
DNA uptake may be induced by electroporation [23]. 
Putative transformants are transferred to selective 
medium, where surviving protoplasts regenerate their 
cell walls and commence cell division, producing 
a callus. Subsequent manipulation of the culture con¬ 
ditions then makes it possible to induce shoot and root 
development, culminating in the recovery of fertile 
transgenic plants. The major limitation of protoplast 
transformation is not the gene transfer process itself, 
but the ability of the host species to regenerate from 
protoplasts. Protoplast transformation was also the 
first method developed for gene transfer to the chloro- 
plast genome of higher plants (see below). 


Physical Methods for Direct DNA Transfer There is 
a great diversity of physical approaches for gene trans¬ 
fer to plants, including electroporation of walled plant 
cells, perforation of the cell with silicon carbide whis¬ 
kers, microinjection, and poration with a finely focused 
laser beam. In most of these cases, only transient 
expression of the introduced DNA has been achieved, 
although transgenic corn plants have been recovered 
following whisker-mediated transformation [24]. Par¬ 
ticle bombardment (microprojectile bombardment, 
biolistics) is a more robust and reliable method for 
stable transformation which has been successful for 
the transformation of cereals, soybean, cotton, 
phaseolus, and other recalcitrant crops. Initially, 
a modified shotgun was used to accelerate small 
(1-4 pm) metal particles into plant cells at a velocity 
sufficient to penetrate the cell wall (^250 m/s). In the 
initial test system, intact onion epidermis was 
bombarded with tungsten particles coated in tobacco 
mosaic virus (TMV) RNA. Three days after bombard¬ 
ment, approximately 40% of the onion cells that 
contained particles also showed evidence of TMV rep¬ 
lication [25]. A plasmid containing the cat (chloram¬ 
phenicol acetyltransferase) reporter gene driven by the 
CaMV35S promoter was then tested to determine 
whether DNA could be delivered by the same method. 
Analysis of the epidermal tissue 3 days after bombard¬ 
ment revealed high levels of transient CAT activity [26] . 
The stable transformation of explants from several 
plant species was achieved soon after these initial 
experiments. Early reports included the transformation 
of soybean [27], and corn [28, 29]. The ability to stably 
transform plant cells by this method offered the excit¬ 
ing possibility of generating transgenic plants 
representing species that were, at the time, intractable 
to other transformation procedures. Early successes 
included soybean, cotton, papaya, and tobacco 
(reviewed in [30]). Particle bombardment has also 
been pivotal in the development of chloroplast trans¬ 
formation technology (see below). 

There is no intrinsic limitation to the potential of 
particle bombardment since DNA delivery is governed 
entirely by physical parameters [31]. Many different 
types of plant material have been used as transforma¬ 
tion targets, including callus, cell suspension cultures, 
and organized tissues such as immature embryos, mer- 
istems, and leaves. The number of species in which 
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transgenic plants can be produced using variants of 
particle bombardment has therefore increased dramat¬ 
ically over the last 20 years including rice [32], wheat 
[33], and oat [34], as well as many other crops 
(reviewed in [30]). The original gunpowder-driven 
device has been improved and modified resulting in 
greater control over particle velocity and hence greater 
reproducibility of transformation conditions. An appa¬ 
ratus based on electric discharge has been used for the 
development of variety-independent gene transfer 
methods for the more recalcitrant cereals and legumes 
[35], while several instruments have been developed 
where particle acceleration is controlled by pressurized 
gas (reviewed in [36]). Physical parameters such as 
particle size and acceleration (which affect the depth 
of penetration and the amount of tissue damage) as 
well as the amount and conformation of the DNA used 
to coat the particles must be optimized for each species 
and type of explant [30, 37] . However, the nature of the 
transformation target is probably the most important 
single variable in the success of gene transfer. 

In Planta Transformation Methods 

Most transformation methods for plants require some 
form of tissue culture step. This is because the funda¬ 
mental principle of most transformation methods is 
that plants are regenerated from a small number of 
transformed cells that can survive under selection. 
Experiments using the model dicot Arabidopsis 
thaliana have led the way in the development of so- 
called in planta transformation techniques, where the 
need for tissue culture is minimized or eliminated 
altogether. Such methods involve the introduction of 
DNA, either using A. tumefaciens or direct transfer, into 
intact plants [38, 39]. 

The procedure is carried out at an appropriate 
time in the plant’s life cycle so that the DNA becomes 
incorporated into cells that will contribute to the germ 
line, directly into the germ cells themselves (often at 
around the time of fertilization), or into the very early 
plant embryo. Generally, in planta transformation 
methods have a very low efficiency, so the small 
size of Arabidopsis and its ability to produce over 
10,000 seeds per plant is advantageous. This limitation 
has so far prevented in planta techniques from being 
widely adopted for crop species, although radish, pak 


choi, and Medicago truncatula are exceptions [40-42]. 
The first in planta transformation system involved 
imbibing Arabidopsis seeds overnight in an A. 
tumefaciens culture, followed by germination [43]. 
A large number of transgenic plants containing T- 
DNA insertions were recovered but in general this 
technique has a low reproducibility. A more reliable 
method has been described by Bechtold et al. [38] in 
which the bacteria are vacuum infiltrated into 
Arabidopsis flowers. An even simpler technique called 
floral dip has become widely used [39]. This involves 
simply dipping Arabidopsis flowers into a bacterial 
suspension at the time of fertilization. In both these 
methods, the transformed plants are chimeric, but give 
rise to a small number of transgenic progeny. It has 
been established that T-DNA is transferred into the 
ovule during the transformation procedure [44]. 

An alternative to the direct transformation of germ 
line tissue is the introduction of DNA into meristems in 
planta followed by the growth of transgenic shoots. In 
Arabidopsis, this has been achieved simply by severing 
apical shoots at their bases and inoculating the cut 
tissue with A. tumefaciens suspension [45]. Using this 
procedure, transgenic plants were recovered from the 
transformed shoots at a frequency of about 5%. In rice, 
explanted meristem tissue has been transformed using 
A. tumefaciens and particle bombardment, resulting in 
the proliferation of shoots that can be regenerated into 
transgenic plants. Such procedures require only 
a limited amount of tissue culture. 

Vectors for Plant Transformation 

Components of Plant Transformation Vectors 

The vectors used for plant transformation are usually 
designed with four purposes in mind - the ability to 
replicate in both E. coli and A. tumefaciens (i.e., shuttle 
vector capability), suitability for subcloning (multiple 
restriction enzyme sites and/or Gateway compatibil¬ 
ity), the ability to confer a selectable phenotype on 
transformed cells (selectable marker genes), and the 
ability to drive transgene expression (an expression 
cassette, consisting minimally of a promoter, site for 
transgene insertion and a terminator/polyadenylation 
site). 

For Agrobacterium- mediated transformation, the 
binary vector system comprises one transformation 
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vector and one helper plasmid containing the vir genes 
(see above). The transformation vector contains the 
T-DNA, with the selectable marker gene and expression 
cassette housed within (reviewed in [46]). The replica¬ 
tion functions are not required for DNA transfer and 
are found on the plasmid backbone, but they are 
required for maintenance in A. tumefaciens and cannot 
be dispensed with entirely, nor moved to the helper 
plasmid because the replication functions are required 
in cis. The E. coli replication functions (generally the 
ColEl origin) are not required in A. tumefaciens but are 
needed for basic cloning operations prior to transfor¬ 
mation (reviewed in [47]) and these are also found on 
the plasmid backbone. 

In contrast to Agrobacterium- mediated transforma¬ 
tion, direct transfer (e.g., particle bombardment) is an 
entirely physical process with no dependence on bio¬ 
logical functions. Therefore, replication functions on 
vectors used for direct DNA transfer are solely present 
to facilitate cloning in E. coli. They are entirely dispens¬ 
able for the transformation process and can indeed be 
a nuisance if integrated into the plant genome since 
they encourage recombination and may in some cases 
promote transgene silencing. Only the expression cas¬ 
sette and selectable marker are required in planta , and 


therefore the plasmid backbones can be removed prior 
to transformation, leaving the small, linear cassettes as 
the substrate (reviewed in [31]). Although this is func¬ 
tionally equivalent to the linear T-DNA which is excised 
from the binary vector during Agrobacterium- mediated 
transformation, it should be noted that the T-DNA 
excision process is often imprecise, resulting in varying 
amounts of backbone sequences being cotransferred to 
the plant genome. 

The Development of Binary Vectors 

One of the first binary vectors for Agrobacterium- 
mediated transformation was pBIN19 [48] although 
this has fallen out of favor because of its low copy 
number in E. coli , which makes it difficult to obtain 
large amounts of DNA for cloning (Table 1). Another 
disadvantage of pBIN19 is that the selectable marker is 
next to the right border. Because T-DNA transfer is 
directional, with the right border being transferred 
first, it is better to have the marker next to the left 
border to ensure that resistant plants have received 
a complete (or nearly complete) copy of the T-DNA. 
These two disadvantages were addressed in more recent 
vectors such as pPZP and pBINPLUS, which contained 
a high-copy-number origin of replication for E. coli and 
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Vector 

Category 

Details 

References 

pBI N19 

Binary vector 

Low copy number in Escherichia coli, plant resistance marker is 
next to the right border 

[48] 

pPZP 

Binary vector 

ColEl origin of replication, plant marker is adjacent to the left 
border of T-DNA 

[49] 

pCAMBIA 

Binary vector 

Modification of pPZP 

[49] 

pBINPLUS 

Binary vector 

Selectable marker gene at the left T-DNA border, a higher copy 
number in E. coli, and two rare restriction sites for easier cloning 

[50] 

pIG 121 Hm 

Binary vector 

A derivative of pBIN 19 

[19] 

pTOK233 

Superbinary vector 

Contains virB, virC, and virG genes from pTiBo542 

[19,51] 

pTiBo542 

Ti-plasmid 

Has virB, virC, and virG 

[52, 53] 

pSBIl 

Intermediate vector of 
superbinary system 

ColEl origin of replication, multiple cloning sites within T-DNA, 
replicates only in E. coli, has a fragment homologous to the 
acceptor vector pSBI 

[21] 

pSBI 

Acceptor vector of pSB11 
of superbinary system 

Replicates in E. coli and Agrobacterium 

[21] 
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a selectable marker gene at the left T-DNA border 
[49, 50]. Two rare restriction sites were also provided 
for cloning convenience. 

As discussed above, the successful Agrobacterium- 
mediated transformation of monocots rested on the 
development of superbinary vectors with extra copies 
of some of the vir genes to enhance transformation 
efficiency. In the first instance, the vzVB, vzrC, and 
virG genes were transferred from Ti-plasmid 
pTiBo542 [52, 53] carried by supervirulent strains of 
the bacterium, suchasA281 or EHA101. Hieiet al. [19] 
constructed a new superbinary vector called pTOK233 
by adding the vzVB, virG , and virC genes of pTiBo542 to 
achieve rice transformation. The T-DNA in this case 
carried the nptll selectable marker under the control of 
the nos promoter, the hpt selectable marker driven by 
the CaMV 35S promoter and an intron-gzzsA fusion 
gene also driven by the CaMV 35S promoter. 
A comparison between the binary vector pIG121Hm 
in the supervirulent strain EHA101 and the 
superbinary vector pTOK233 in the regular strain 
LBA4404 showed that pTOK233 was slightly more 
efficient [19]. Similar combinations have been used to 
achieve the transformation of many monocot species 
(reviewed in [54, 55]). In corn and sorghum, efficient 
transformation systems were established only with 
superbinary vectors in LBA4404, whereas a standard 
binary vector in a supervirulent strain was inefficient 
even with improved co-culture conditions [56]. 

One disadvantage of the superbinary system is the 
large size of all the vector components, reducing the 
convenience of cloning by standard methods. There¬ 
fore, the final construction step of a superbinary vector 
involves the cointegration of an intermediate vector 
such as pSBll and an acceptor vector such as pSBl 
via homologous recombination between the shared 
DNA segments [21]. The intermediate vector is 
a small plasmid containing the T-DNA and ColEl 
origin for replication in E. coli. The acceptor vector is 
an IncP plasmid, which can be replicated in E. coli and 
A. tumefaciens , and carries the 14.8-kb Kpnl fragment. 
If a gene of interest is to be introduced into plants in 
tandem with a marker gene, the two genes are first 
inserted into an intermediate vector, which is intro¬ 
duced into a strain of A. tumefaciens that carries an 
acceptor vector so that the cassette is integrated into the 
acceptor to generate the final superbinary vector. 


This approach also facilitates the transfer of large seg¬ 
ments of DNA with minimal rearrangement and favors 
a low number of integrated copies [46]. 

Other improvements have been made to reduce the 
frequency of vector backbone transfer. For example, 
Hanson et al. [57] placed the lethal barnase gene out¬ 
side the left T-DNA border so that any transgenic plants 
containing sequences beyond the left border were 
counterselected. Another way to reduce backbone 
cotransfer is to insert additional left border sequences, 
increasing the likelihood of recognition by the 
corresponding Vir proteins and thus suppressing trans¬ 
fer, as has been demonstrated in rice [58] and 
Arabidopsis [59]. Improvements have also been made 
to facilitate rapid and efficient subcloning. The zero 
background TA cloning system [60] was developed for 
Agrobacterium- mediated transformation and uses 
restriction enzyme Xcml to generate 3'-T overhangs in 
the linearized vector within the counterselectable 
marker gene ccdB, which ensures that self-ligated vec¬ 
tors are not propagated after transformation, but that 
PCR products with 3'-A overhangs can be inserted 
without further modification. 

Multiple Gene Transfer (MGT) 

In the early years of plant biotechnology, most trans¬ 
genic plants contained two transgenes - one selectable 
marker under the control of a constitutive promoter to 
facilitate the selective propagation of transformed cells, 
and a “primary transgene” or “gene of interest” which 
could be under the control of any sort of promoter and 
was intended to alter the plant’s phenotype in a specific 
manner. MGT is now being embraced as an approach 
to generate plants with more ambitious phenotypes 
[61]. MGT allows goals that were once impossible to 
be achieved, e.g., the import of complex metabolic 
pathways, the expression of entire protein complexes, 
and the development of transgenic crops simulta¬ 
neously engineered to produce a spectrum of added- 
value compounds [61, 62]. 

Essentially there are two MGT methods known as 
the linked and unlinked cotransformation strategies. In 
the linked strategy, all the different genes are linked on 
the same piece of transforming DNA. This is the nor¬ 
mal approach chosen with Agrobacterium-mediated 
transformation where several genes are carried 
within the same T-DNA borders. However, it is quite 
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possible to transform plants efficiency with (a) an 
Agrobacterium strain carrying a binary vector that has 
two separate T-DNAs, and (b) different Agrobacterium 
strains carrying different T-DNAs, although this 
becomes increasingly complex as the number of genes 
increases. There are several reports of a mixed strategy 
where two Agrobacterium strains carrying different 
T-DNAs each with multiple genes has been used to 
achieve MGT (reviewed in [63]). 

MGT is more easily achieved by direct gene transfer 
because there is no need to combine multiple genes 
on the same length of DNA. When the genes are 
supplied separately as a mixed preparation of plas¬ 
mids, there appears to be no bias in the process of 
integration. Tandem cotransformation can be under¬ 
taken, although it becomes more cumbersome as larger 
numbers of genes are required, but cotransformation 
with discrete, unlinked genes is just as efficient, and the 
genes tend to cointegrate at the same locus. As many as 
12 transgenes have been integrated using this unlinked 
cotransformation approach [64]. 

The main issue with linked MGT is the inconve¬ 
nience of serial cloning. Very rare restriction sites can 
help to address this challenge, as shown by Goderis 
et al. [65] who developed a binary vector for MGT 
incorporating 13 hexanucleotide restriction sites, six 
octanucleotide restriction sites, and five sites for hom¬ 
ing endonucleases, which are extremely rare in natural 
sequences and allow unidirectional cloning. Six differ¬ 
ent expression cassettes in auxiliary vectors with differ¬ 
ent promoter and terminator sequences were cut with 
the five different homing endonucleases plus an addi¬ 
tional octanucleotide restriction endonuclease and 
transferred into the homing endonuclease sites of 
the binary vector. Modified auxiliary vectors also facil¬ 
itate N- or C-terminal fusions to five different 
autofluorescent tags (EGFP, EYFP, Citrine-YFP, ECFP, 
and DsRed2) expressed from the tandem CaMV 35S 
constitutive promoter [66]. Similar systems have been 
developed by Thomson et al. [67], with the pUGA 
vectors for direct transfer and the pUGA2 vectors for 
Agrobacterium- mediated transformation. A series 
of unidirectional shuttle vectors containing various 
combinations of homing endonuclease sites was 
constructed and used to create artificial gene clusters 
in the pUGA or pUGA2 vectors, allowing the simulta¬ 
neous transfer of up to six genes. Versatile systems now 


exist that offer a large number of promoters, termina¬ 
tors, and autofluorescent tags (reviewed in [68]). 

A further strategy for linked MGT is to use the Cre/ 
lox P recombination system together with homing 
endonucleases. In this method, two donor vectors are 
needed to introduce the expression cassettes into the 
acceptor vector using Cre recombinase [69]. After each 
round of recombination, the unnecessary backbone 
sequence from the donor vector is cleaved out using 
the homing endonucleases leaving just one lox P site for 
the next step. In this manner, ten transgenes were 
inserted in the acceptor vector and introduced into 
rice by Agrobacterium- mediated transformation. 

The most recent generation of vectors are modular 
and multifunctional, such as the pCLEAN- 
G/pCLEAN-S dual binary vectors for Agrobacterium- 
mediated transformation [70] and the pORE vector 
system that can be adapted for both Agrobacterium- 
mediated transformation and direct DNA transfer [71 ]. 
The pORE vector series consists of “open” vectors for 
general plant transformation, “reporter” vectors for 
promoter analysis, and “expression” vectors for trans¬ 
gene expression. The sets comprise various combina¬ 
tions of promoters (PHPL, PENTCUP2, and 
PTAPADH), selectable markers (nptl I and pat), and 
reporter genes ( gusA and smgfp ), and any element can 
be modified independently. 

Direct DNA Transfer Using Minimal Expression 
Cassettes 

As discussed above, vector backbone sequences are not 
required for direct DNA transfer and they may also 
promote recombination within the transgenic locus 
and/or act as triggers for de novo DNA methylation. 
To determine whether minimal linear cassettes (pro¬ 
moter, transgene, and terminator) could be used for 
direct DNA transfer, Fu et al. [72] separated the expres¬ 
sion cassettes for the marker genes gusA and hpt from 
the parent vector, and used these as substrates for 
coating the microprojectiles used in particle bombard¬ 
ment, with the intact plasmids as a control. They found 
not only that the linear cassettes were equally efficient 
for transformation, but also that the elimination of the 
plasmid backbone had a remarkable positive effect on 
the resulting transgenic rice plants (Fig. 2). The cassette 
transformants generally contained fewer transgene 
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The clean DNA transformation system compared to particle bombardment with whole plasmid DNA [4]. The 
transformation strategy is shown, and two representative DNA blots are compared to demonstrate the simpler integration 
patterns resulting from transformation with linear minimal cassettes 


copies than those transformed with intact plasmids, 
and they showed stronger and more stable expression. 
Further analysis of the minimal cassette population 
showed that the transgenes were expressed in most 
of the transgenic plants and that, for all the 
transgenes, overall expression levels and coexpression 
frequencies were higher than previously reported for 
whole plasmid transformants. These results were con¬ 
firmed using the yfp (yellow fluorescent protein) and 
hpt markers, with concatemers forming only rarely in 
such plants [73]. 

Promoters Used for Plant Transformation 

The promoters used in plant biotechnology are tradi¬ 
tionally divided into three categories - constitutive 


(active continuously in most or all tissues), spatiotem- 
poral (tissue-specific or stage-specific activity), and 
inducible (regulated by the application of an external 
chemical or physical signal) [74]. Most basic transfor¬ 
mation vectors incorporate strong constitutive pro¬ 
moters because it is assumed that the objective is to 
express the transgene(s) at the highest possible level. 
Initially, the A. tumefaciens nos (nopaline synthase), ocs 
(octopine synthase), and mas (mannopine synthase) 
promoters were popular because they were already 
present in the natural T-DNA sequences from which 
the early binary vectors were developed, and had 
evolved to be active in many plant species (at least 
those within the A. tumefaciens host range). However, 
the CaMV 35S promoter was found to be stronger and 
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unaffected by wounding, and its modular nature made 
it easy to modify [75, 76] . The activity of the CaMV 35S 
promoter can be increased by duplicating the enhancer 
up to four times [77] and the enhancer can also 
increase the activity of heterologous promoters to 
which it is attached [78]. 

Although widely used, the CaMV 35S promoter has 
certain limitations such as its poor performance in 
monocots, its suppression by feeding nematodes, and 
the intellectual property issues affecting its commercial 
deployment (reviewed in [79]). For this reason, alter¬ 
native virus promoters with similar or improved prop¬ 
erties have been sought. Thus far, however, the only 
virus promoters that have been developed into 
established expression vectors are those from Cestrum 
yellow leaf curling virus (CmYLCV) [80], which can be 
licensed from Syngenta Biotechnology, Inc. for limited 
research purposes, and from Subterranean clover stunt 
virus (SCSV), which has been used to construct the 
pPLEX series of expression vectors for use in both 
dicots and monocots [81, 82]. 

Constitutive expression in monocots is usually 
achieved with housekeeping promoters, particularly 
those from the rice actinl and corn ubiquitinl genes 
(reviewed in [83, 84] ). In both cases, the presence of the 
first intron of the gene is required for high-level expres¬ 
sion [83, 84], and the addition of this intron to the 
CaMV 35S promoter also enhances its activity in 
monocots, e.g., 40-fold in corn [85]. In contrast, the 
first intron of the recently characterized rice actin2 gene 
contains a negative regulatory element whose removal 
is required for high-level promoter activity [86]. 

Many different plant promoters have been 
described that restrict expression to particular cells, 
tissues, organs, or developmental stages, and seed- 
specific promoters are probably the most diverse. 
Seeds are a frequent target for genetic engineering in 
plants because they can accumulate recombinant pro¬ 
teins to levels that would be lethal in vegetative tissues 
but can do so without compromising plant growth and 
development; they are also a harvestable product and 
thus the target for nutritional improvement. Many 
promoters have been identified that target gene expres¬ 
sion specifically to the seed, or to a particular region of 
the seed such as the endosperm, embryo, or aleurone 
[74, 79, 87]. Anther-specific promoters are also very 
useful because they can be used to control male fertility, 


an important trait in crop breeding, while fruit- and 
tuber/root-specific promoters are valuable for the 
nutritional improvement of fruit and root vegetable 
crops, pest/disease resistance, and the use of staple 
crops as factories for the production of novel proteins 
and metabolites. 

Inducible promoters are also highly valued in plants 
because they allow transgenes to be controlled by inter¬ 
nal and external physical or chemical cues. Many dif¬ 
ferent inducible promoters have been identified in 
plants and these generally fall into three categories - 
(1) those responsive to endogenous signals (plant 
hormones); (2) those responsive to external physical 
stimuli (abiotic and biotic stresses); and (3) those 
responsive to external chemical stimuli. Such pro¬ 
moters provide immense scope for the precise regula¬ 
tion of transgene expression through external control, 
ranging from the precise control of transgene activa¬ 
tion/inactivation in experimental settings to the ability 
to activate transgenes on an agricultural scale by the 
application of chemical sprays. Examples of commonly 
used promoters include those responsive to phytohor¬ 
mones (particularly auxin, abscisic acid, gibberellin, 
and ethylene), heat-shock promoters responsive to 
raised temperatures, light-inducible promoters, pro¬ 
moters induced by wounding or by exposure to elici- 
tors produced by pathogens, and promoters that 
respond to specific metabolites [74, 79]. Sugar respon¬ 
sive promoters fall into the latter category and the ds- 
acting elements that confer sensitivity to sugar are 
particularly useful for controlling gene expression in 
cultured plant cells. For example, elements from 
sporamin and amylase promoters have been studied 
in detail and the minimal a-amylase 3 promoter 
makes the normally constitutive rice actinl promoter 
sensitive to the presence of sugar [88]. Inducible pro¬ 
moters that respond to xenobiotic signals are also valu¬ 
able because transgenes can be activated without 
affecting endogenous genes. Martinez et al. [89] devel¬ 
oped a hybrid system consisting of the tobacco bud- 
worm ecdysone receptor ligand-binding domain fused 
to the mammalian glucocorticoid receptor DNA- 
binding domain and the VP 16 transactivation domain. 
The receptor responds to tebufenozide (an insecticide 
better known by its trade name CONFIRM). Similarly, 
Padidam et al. [90] have developed a system that is 
based on the spruce budworm ecdysone receptor 
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ligand-binding domain, and responds to another com¬ 
mon insecticide, methoxyfenozide (INTREPID). 

Selectable and Screenable Markers 

As stated above, selectable marker genes provide 
a phenotype that allows transformed cells to be prop¬ 
agated under conditions where nontransformed cells 
cannot survive, such as in the presence of an otherwise 
toxic or growth-disrupting reagent (positive selection) 
or in the absence of an otherwise essential nutrient 
(negative selection). This is necessary to isolate the 
small number of transformed cells from the over¬ 
whelming majority of their nontransformed peers 
which, without selection, would quickly outcompete 
them. Although a wide range of selectable markers 
has been tested in plants, only a few are used routinely 
(reviewed in [47, 91]). The broadest markers are suit¬ 
able in most plants and are expressed using the most 
active constitutive promoters to ensure that all cell 
types are protected under selection (the CaMV35S 
promoter in dicots and the actin or ubiquitin 
promoters in monocots, as discussed earlier). 
Most selectable markers are described as conditional 


because an external reagent must be applied to facilitate 
selection, whereas others are nonconditional, i.e., they 
work without any external selection reagent. The typi¬ 
cal selectable markers used in plant transformation are 
positive and conditional, and work by conferring resis¬ 
tance to a toxic substance such as an antibiotic or 
herbicide that has a very specific intracellular target 
(Table 2). 

Marker genes that confer antibiotic resistance orig¬ 
inate from bacteria but have been modified to function 
well in plants. The first marker to be used in plants was 
neomycin phosphotransferase (nptll, aphll ), which 
confers resistance to the aminoglycoside antibiotics 
neomycin, kanamycin, and geneticin (G148) [94]. 
This is probably still the most widely used marker in 
the laboratory but some plants are naturally resistant to 
kanamycin, and the antibiotic can also interfere with 
normal development in some species. An alternative is 
hygromycin phosphotransferase ( hph , hpt, aphTV ), 
providing resistance to the antibiotic hygromycin 
through the ATP-dependent phosphorylation of a 7- 
hydroxyl group [95]. Other antibiotic-resistance 
markers used less frequently include those conferring 
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Crop Plants Transformation Methods. Table 2 Selectable markers used for plant transformation (Data compiled 
and updated from [91-93]) 


Gene (Product) 

Source 

Phenotype and other comments 

aad (aminoglycoside 
adenyltransferase) 

Shigella flexneri 

Provides resistance to trimethoprim, streptomycin, spectinomycin, 
and sulphonamides. Used mainly for chloroplast transformation 

bar (phosphinothricin 
acetyltransferase) 

Streptomyces 

hygroscopicus 

Resistance to phosphinothricin (PPT), which is a component of the 
herbicides bialophos, Basta, and glufosinate 

ble (glycopeptide-binding 
protein) 

Streptalloteichus 

hindustantus 

Resistance to the glycopeptide antibiotics bleomycin and 
pheomycin (and the derivative Zeocin) 

dhfr (dihydrofolate reductase) 

Mouse 

Resistance to methotrexate 

sull (dihydropteroate 
synthase) 

Escherichia coli 

Resistance to sulfonamides (Asulam) 

epsps (enolpyruvylshikimate 
phosphate synthase) 

Petunia hybrida 

Resistance to the herbicide glyphosate 

hpt (hygromycin 
phosphotransferase) 

Klebsiella spp. 

Resistance to the aminoglycoside antibiotic hygromycin B 

manA (mannose-6-phosphate 
isomerase, MIP) 

E. coli 

Ability to grow on mannose as sole carbon source 

neo/nptll/aphll (neomycin 
phosphotransferase) 

E. coli 

Resistance to the aminoglycoside antibiotics neomycin, kanamycin, 
and geneticin (G148) 
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resistance to bleomycin [96], gentamycin [97], and 
methotrexate [98]. 

Marker genes that confer herbicide resistance may 
originate from bacteria or plants, and those conferring 
resistance to the broad-spectrum herbicides 
phosphinothricin (PPT)/glufosinate and glyphosate 
are used the most widely. PPT/glufosinate is a compet¬ 
itive inhibitor of glutamine synthetase (GS), the only 
enzyme that can catalyze the assimilation of ammonia 
into glutamic acid in plants. Inhibition of GS therefore 
results in the accumulation of toxic levels of ammonia. 
The enzyme phosphinothricin N-acetyltransferase 
(PAT) encoded by either the bar or pat genes 
(these are genes from different microbial species) 
can be used to provide PPT/glufosinate resistance in 
transformed plant cells [99]. Glyphosate, the 
active ingredient of Roundup, inhibits the enzyme 5- 
enolpyruvylshikimate-3-phosphate synthase (EPSPS), 
which is required for the synthesis of aromatic amino 
acids. Glyphosate resistance can be conferred 
by markers encoding a modified EPSPS that is not 
affected by the herbicide, or those encoding 
a bacterial enzyme that breaks down the herbicide 
(glyphosate oxidoreductase, GOX) [100]. Markers pro¬ 
viding resistance against sulfonamindes such as 
Asualam [101] and chlorsulfuron [102] are also used 
occasionally. 

Negative selectable markers are useful because they 
allow transformed cells to be selected based on the 
absence of something that is necessary for 
nontransformed cells to grow or regenerate efficiently. 
The E. coli manA/pmi gene confers the ability to use 
mannose as a sole carbon source so that only 
transformed cells survive on media containing man¬ 
nose but lacking sucrose [103]. The gene is a negative 
conditional marker because mannose itself is not toxic 
to nontransformed plants, but rather the absence of 
sucrose (nontransformed plants grow perfectly well in 
the presence of both sugars). Another example is the 
A. tumefaciens ipt gene, which promotes the synthesis 
of cytokinins [104]. This is a nonconditional marker 
because it confers the ability to produce shoots in 
growth medium lacking exogenous cytokinins, i.e., 
nothing has to be added to the medium to facilitate 
selection. Simply, transformed tissues are placed on 
medium lacking cytokinins and only those tissues 
that produce shoots are transgenic. 


The other major class of marker genes are known as 
screenable, scorable, or visible markers (or reporter 
genes) because rather than providing cells with a selec¬ 
tive advantage they confer a phenotype that can easily 
be detected without interfering with other cellular pro¬ 
cesses, allowing transformed cells to be identified and 
studied (Table 3). Although screenable markers have 
been used for the prosaic purpose of identifying 
transformed cells and manually separating them from 
nontransformed cells, they tend to be used for more 
sophisticated purposes, such as reporter assays and 
tracing experiments. Some screenable marker genes 
only provide their visual signal when provided with 
a particular substrate, i.e., they are conditional (e.g., 
gusA , luc). Others have an intrinsic ability to yield 
a visible signal, i.e., they are nonconditional (e.g., gfp> 
DsRed ), and these are the most useful since they can be 
used in living organisms. 

The E. coli gusA ( uidA) gene encodes the enzyme 
[3-glucuronidase (GUS), the most widely used condi¬ 
tional screenable marker in plants. As well as its 
endogenous substrates, GUS can process a range of 
chromogenic and fluorescent derivatives in a range of 
assays that allow the quantification or in situ localiza¬ 
tion of reporter gene activity. The most common 
substrate for GUS histochemical staining is 5-bromo- 
4-chloro-3-indolyl glucuronide (X-gluc), a clear sub¬ 
strate that yields a blue product. Other common 
substrates include p-nitrophenyl (3-D-glucuronide, 
which is used for spectrophotometric quantitative 
assays, and 4-methylumbelliferyl-beta-D-glucuronide 
(MUG), which produces a quantitative fluorescent 
signal. GUS is preferred in plants over the very similar 
reporter GAL (encoded by lacZ and widely used in 
microbes and animals) because of its stability in plants 
and its lack of toxicity and background activity. The 
main disadvantage of GUS is that it cannot be conve¬ 
niently used for in vivo imaging because plant cells must 
be fixed or destroyed to visualize the reaction. Its stabil¬ 
ity can also be problematic if the aim of an experiment is 
to study fluctuations, since the longevity of the protein 
can mask transient decreases in expression. 

Both the disadvantages of GUS are addressed by 
luciferase (LUC) an enzyme from the firefly ( Photinus 
pyralis ) which catalyzes the ATP-dependent oxidative 
decarboxylation of luciferin, producing light in the 
process. LUC allows nondestructive qualitative and 
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Crop Plants Transformation Methods. Table 3 Screenable markers (reporter genes) used in plants (Data compiled 
and updated from [83, 105]) 


Gene (product) 

Comments 

gusA ((3-glucuronidase) 

Source: Escherichia coli gusA/uidA gene 

Activity: catalyzes the hydrolysis of (3-glucuronides 

Assays: nonisotropic; in vitro assays are colorimetric or fluorometric; also histochemical 
assay format using X-gluc 

Advantages: simple, sensitive, quantitative, many assay formats available, inexpensive 

Disadvantages: assays are destructive; enzyme is stable so unsuitable for studies of 
downregulation 

cat (chloramphenicol 
acetyltransferase) 

Source: E. coliJr\9 

Activity: catalyzes the transfer of acetyl groups from acetyl coenzyme A to 
chloramphenicol 

Assays: in vitro assays only, isotropic 

Advantages: simple to perform 

Disadvantages: low sensitivity, expensive, low resolution in vivo, reliance on isotopic 
assay format 

luc (luciferase) 

Source: The firefly Photinus pyralis 

Activity: light produced in the presence of luciferase, its substrate luciferin, oxygen, Mg 2+ , 
and ATP 

Assays: nonisotopic bioluminescent assays in vitro and in vivo 

Advantages: sensitive, rapid turnover, quantitative 

Disadvantages: Expensive detection equipment, limited reproducibility of some assay 
formats 

Anthocyanin regulators 

Source: corn ( Zea mays) 

Activity: induces pigmentation 

Assays: visual screening for pigmented cells in vivo 

Advantages: simple, inexpensive, nondestructive 

Disadvantages: low sensitivity, not quantitative, background expression, adverse effects 
on transgenic plants 

GFP (green fluorescent 
protein) 

Source: the jellyfish Aequorea victoria 

Activity: intrinsic fluorescence under blue/UV light 

Assays: nonisotopic, in vivo assays in live plants 

Advantages: intrinsic activity (no substrate requirements), sensitivity, use in live plants; 
many variants with modified absorption and emission spectra available, and different 
subcellular targeting signals; several variants can be used simultaneously 

Disadvantages: weak signal in some systems (this is being addressed through the use of 
modified GFPs with stronger emission and reduced photobleaching); no variants that 
emit in the orange-red part of the spectrum 
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Crop Plants Transformation Methods. Table 3 (Continued) 


Gene (product) 

Comments 

DsRed (red fluorescent 

Source: coral reef Discosoma spp. 

protein) 

Activity: intrinsic fluorescence under blue/UV light 


Assays: nonisotopic, in vivo assays in live plants 


Advantages: intrinsic activity (no substrate requirements), sensitivity, use in live plants; 
many variants with modified absorption and emission spectra available, and different 
subcellular targeting signals; several variants can be used simultaneously 


Disadvantages: weak signal in some systems, no variants that emit in the violet-blue- 
green part of the spectrum 


quantitative assays to be carried out both in vitro and in 
vivo, and because the reaction has a short half-life, it 
can be used to monitor fluctuating activity. A series of 
vectors that incorporate the luc gene have been devel¬ 
oped for plants, including the LucTrap series that allow 
targeted and random transcriptional and translational 
fusions of a transgene with a luc gene optimized for 
plant cells [106]. One drawback of luc is that it is still 
a conditional reporter, requiring the substrate luciferin 
and the presence of oxygen, ATP, and magnesium ions. 
In contrast, the green fluorescent protein (GFP) from 
the jellyfish Aequorea victoria is a nonconditional 
reporter allowing the direct, noninvasive visualization 
of fluorescence in vivo in real time merely by exposure 
to blue/UV light. The original gfp gene was 
nonfunctional in plants because of a cryptic splice 
site, but this has been corrected and the protein has 
been widely deployed as a vital marker [107]. One of 
the major advantages of GFP is that its spectral qualities 
can be modified by mutation, giving rise to a whole 
family of derivatives with enhanced brightness, less 
susceptibility to quenching, and a range of excitation/ 
emission wavelengths allowing different reporters to be 
used in vivo simultaneously. In combination with var¬ 
iants that allow targeting to different compartments 
within the plant cell, many sophisticated forms of anal¬ 
ysis become straightforward to implement, e.g., 
allowing the real-time monitoring of protein 
processing, trafficking and protein-protein interac¬ 
tions [108]. As well as GFP and its derivatives, other 
bioluminescent proteins have also been identified, 
including DsRed from Discosoma spp. This is similar 
to GFP but covers a different spectral range (red GFP is 


not available) but there is little background fluores¬ 
cence and it can be visible under white light [109]. 

The useful properties of selectable and screenable 
markers can also be combined into one protein. 
For example, Ochiai-Fukuda et al. [110] developed 
a fusion marker incorporating enhanced green fluores¬ 
cent protein and blasticidin deaminase, conferring 
resistance to the aminoacylnucleoside antibiotic 
blasticidin S. The gfbsd marker was introduced into 
rice callus and allowed the rapid and efficient selection 
and visual confirmation of transformed cells. 

Consequences of Nuclear Transformation 

Integration of Nuclear Transgenes 

As discussed above, plant transformation is a multistep 
process, the first step involving the transfer of DNA 
into the plant cell, which can be achieved either by 
direct transfer or by Agrobacterium- mediated transfor¬ 
mation. Once the DNA reaches the nucleus, the next 
step (transgene integration) is dependent predomi¬ 
nantly on that DNA and on factors provided by the 
plant cell, although in the case of Agrobacterium- 
mediated transformation it is possible that the Vir pro¬ 
teins complexed with the T-strand may facilitate the 
integration process (see below). A number of groups 
have investigated the structure of genomic/T-DNA and 
T-DNA/T-DNA junctions in plants and have con¬ 
cluded that integration occurs by illegitimate recombi¬ 
nation (see [111, 112]). A strand invasion mechanism 
has been proposed (reviewed in [12]), in which the 3' 
end of the T-strand initiates the integration process by 
hybridizing to a short region of homology in the plant 
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genome, the second strand being completed by primer 
extension of the plant DNA. Other models suggest 
conversion of the T-strand into a double-stranded 
intermediate, which integrates at the site of naturally 
occurring chromosome breaks via double-strand DNA 
break repair. This is supported by experiments that 
show transformation efficiency increases following 
UV irradiation, which generates nicks and breaks in 
genomic DNA. However, since T-DNA integration still 
occurs in DNA repair mutants, it is possible both 
mechanisms occur simultaneously albeit with different 
efficiencies. 

DNA repair models argue that proteins encoded by 
the host plant have a much more important role in 
T-DNA integration than Agrobacterium proteins, such 
as VirD2, which are imported into the plant with the 
T-DNA. However, since the VirD2 protein remains 
covalently attached to the 5' end of the T-strand during 
transfer it is also likely to influence integration [113]. 
In an in vitro assay, VirD2 can ligate together a cleaved 
T-DNA border sequence but cannot ligate T-DNA to 
other genomic targets unless plant cell extracts are also 
present [114], a phenomenon supported by the iden¬ 
tification of Arabidopsis mutants impaired for T-DNA 
integration [115]. 

Much can be learned about the T-DNA integration 
mechanism by the inspection of borders, especially the 
borders between adjacent T-DNA sequences in 
multicopy insertions. The formation of heterodimers 
during cotransformation argues in favor of T-DNA 
concatemerization prior to integration. Although 
inverted repeats around the right border are often 
precise, those around the left border and those separat¬ 
ing direct T-DNA repeats are often characterized by the 
insertion of variable-sized regions of filler DNA, which 
may be derived from the T-DNA sequence or from 
plant genomic DNA [116]. This suggests either the 
simultaneous integration of multiple T-DNAs at 
a single locus, or a two-phase mechanism, in which 
a primary T-DNA integration event stimulates further 
secondary integrations in the same area, similar to 
those proposed for particle bombardment (see 
below). Zhu et al. [117] carried out a comprehensive 
study of T-DNA border characteristics in a population 
of transgenic rice plants including 156 T-DNA/geno- 
mic DNA junctions, 69 T-DNA/T-DNA junctions, and 
11 T-DNA/vector backbone junctions, which included 


171 left borders and 134 right borders. Conserved 
cleavage was observed in 6% of left and 43% of right 
borders, microhomology was observed in 58% of 
T-DNA/genomic DNA, 43% of T-DNA/T-DNA, and 
82% of T-DNA/vector junctions, mostly at left borders, 
and about one third of the T-DNA/genomic DNA and 
T-DNA/T-DNA junctions showed evidence of filler 
DNA (up to 344 bp). This was derived mainly from 
the T-DNA region adjacent to the breakpoint and/or 
from the rice genomic DNA flanking the T-DNA inte¬ 
gration site, with T-DNA/T-DNA filler DNA showing 
the greatest complexity. Interestingly, when two 
T-DNAs were integrated in the inverted repeat config¬ 
uration, significant truncation was always observed in 
one of the two T-DNAs whereas with the direct repeat 
configuration, large truncations were rare. These data 
suggest that no single integration mechanism can 
account for all observations but the presence of filler 
DNA at many of the junctions argues that a template- 
driven DNA synthesis mechanism must be involved. 

The analysis of plasmid/plasmid and plasmid/geno¬ 
mic junctions in transgenic plants generated by particle 
bombardment reveals features characteristic of illegiti¬ 
mate recombination similar to those seen for T-DNA 
junctions, suggesting that the same overall integration 
mechanisms may be involved [118]. For example, 
such junctions are characterized by regions of 
microhomology, filler DNA, trimming of the DNA 
ends so sequences are lost, and AT-rich elements sur¬ 
rounding the junction site, with similarity to topo- 
isomerase I binding/cleavage sites (Fig. 3). In the 
analysis of multiple plasmid/plasmid junctions in 12 
transgenic rice lines, Kohli et al. [120] observed 10 
plants with microhomology at the junctions and 2 
plants where junctions appeared to be generated by 
blunt ligation, with no overlap. A similar ratio of con¬ 
served end-joining to microhomology-mediated 
recombination was observed by Gorbunova and Levy 
[121] and Salomon and Puchta [111]. Topoisomerase 
I sites were also observed adjacent to 10 out of 12 
junctions characterized in transgenic Arabidopsis 
plants generated by particle bombardment [122] and 
in four of the six junctions in the commercial SUNUP 
variety of papaya [123]. Illegitimate recombination 
therefore appears to be responsible both for the inte¬ 
gration of foreign DNA into the plant genome, and the 
linking of multiple plasmid copies, which is similar to 
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Crop Plants Transformation Methods. Figure 3 

Mechanism for transgene integration at regions of 
microhomology [119]. A mixture of DNA fragments with 
ragged ends (a) interacts with a double-stranded DNA 
break with partially complementary ragged ends 
(b). Repair synthesis across the gap (c) generates 
a recombination junction (d) which may be completely 
conserved if the homology is precise, or may involve either 
the loss of terminal sequences or the insertion of filler DNA 
if the homology is partial 


the mechanism proposed for T-DNA integration (see 
above). 

When nuclei from the cells of transgenic cereal 
plants generated by particle bombardment are analyzed 
by fluorescence in situ hybridization (FISH) using 
a transgene-specific probe, a curious phenomenon in 
often observed in which a single fluorescent spot in the 
interphase nucleus separates into multiple signals along 
a metaphase chromosome [124]. Any model for trans¬ 
gene integration following particle bombardment must 
take into account some form of three-tier organization, 
consisting of contiguous transgene arrays, interspersed 
with short regions of genomic DNA to generate 
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Crop Plants Transformation Methods. Figure 4 

Explanation for the formation of transgene arrays and 
transgene clusters interspersed with genomic DNA [119]. 
A mixture of DNA fragments interacts with a double- 
stranded DNA break where a repair complex has already 
assembled (a). The repair complex may stitch together 
DNA fragments to form concatemers prior to integration, 
or may integrate single copies. The first integration event 
stimulates further repair complex activity nearby, resulting 
in additional nicks and breaks in the genomic DNA that act 
as further integration sites (b). This results in a cluster of 
transgenes (single copies and concatemers) interspersed 
with short regions of genomic DNA (c) 

local clusters, and the appearance of widely dispersed 
signals at metaphase. Two-phase transgene integration 
mechanisms have been proposed to explain the first 
two levels of organization, and in such models 
concatemerization is proposed to occur prior to inte¬ 
gration, while interspersion occurs during the integra¬ 
tion process [118, 120, 125] (Fig. 4). In each model, 
penetration of the cell is proposed to elicit a wound 
response, which would include the induction of DNA 
repair enzymes, such as nucleases and ligases. The 
presence of these enzymes and an excess of foreign 
DNA would result in the linking together of several 
copies to form concatemers, which would be the sub¬ 
strates for integration. This might be stimulated by 
homology between individual copies of transforming 
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plasmids, and “backbone” homology might also result 
in the concatemerization of plasmids carrying different 
transgenes in cotransformation experiments. However, 
as stated above, cotransformation and cointegration 
also occur when two nonhomologous minimal cas¬ 
settes are used for transformation, so homology 
might not be as important as the presence of free 
DNA ends [72]. Kohli et al. [120] suggested that trans¬ 
gene clusters arise in a second phase where a primary 
integration event occurring by illegitimate recombina¬ 
tion at a chromosome break generates a “hot-spot” for 
further integration events in the same area. This might 
be due, for example, to the presence of local repair 
complexes that can slide along the DNA and introduce 
nicks which can be exploited by more foreign DNA. 
Pawlowski and Somers [125] suggested an alternative 
second phase where a number of discrete transgene 
concatemers integrate simultaneously at a site 
containing multiple replication forks. Although there 
is no direct evidence for either mechanism, it is inter¬ 
esting to note that DNA integration is stimulated in 
rapidly dividing cells, and is blocked in Arabidopsis 
mutants lacking essential components of the DNA 
recombination machinery. 

The higher-order organization of transgenic loci 
observed by FISH is thus far unique to particle bom¬ 
bardment and demands a model which takes into 
account the three-dimensional structure of the nucleus. 
It is possible that the transformation event affects 
a local region of the interphase nucleus, e.g., a metal 
particle may cause damage to a particular area of chro¬ 
matin arranged in loops attached to the nuclear matrix, 
or to a localized transcription factory. If the particle 
“skims” several loops or several transcription units, 
there will be regions of DNA damage close together in 
trans , but widely separated in the cis configuration were 
the DNA to be stretched out (Fig. 5). Each of these sites 
could act as a nucleation point where foreign DNA 
diffusing from the metal particle is used to patch up 
double-strand breaks, generating widely separated 
arrays and/or clusters [124, 126]. In support of this 
induced break and repair model, Svitashev et al. [127] 
have shown that in 6 of 25 transgenic oat plants gener¬ 
ated by particle bombardment, transgene integration 
sites were associated with rearranged chromosomes. 
This suggests that DNA breaks caused by incoming 
particles are repaired with foreign DNA and may also 


result in deletions, inversions, and translocations 
involving genomic DNA. 

Transgene Structure and Integrity 

Transgene rearrangements following particle bombard¬ 
ment have been widely reported in the literature and 
many publications repeat the “lore” that direct 
DNA transfer is more likely than T-DNA transfer to 
generate complex rearranged loci. The number of 
rearrangements that can be detected depends entirely 
on the resolution of the method being used. Thus, 
careful analysis of locus structure by Southern blot 
hybridization, PCR, and DNA sequencing has shown 
that rearrangements maybe more widespread than first 
envisaged in both transformation methods. 

In the case of direct DNA transfer, the analysis of 
transgenic oat loci by Somers et al. has shown that 
transgene rearrangements can be extensive and 
extremely complex, with multiple small insertions, 
inversions, and deletions within any transgene, plus 
the presence of filler DNA [127]. In corn, Mehlo et al. 
[128] noted that every single plant among the popula¬ 
tion they analyzed showed some form of 
rearrangement, and they speculated that undetected 
“minor” rearrangements could be responsible for 
many instances of transgene silencing otherwise attrib¬ 
uted to epigenetic processes. In particular, certain 
transgene rearrangements were not detectable by 
Southern blot hybridization because they were too 
subtle, but they could be picked up by long-range 
PCR and sequencing. Because Southern blot hybridi¬ 
zation is normally the sole method used to determine 
whether a given locus is intact or rearranged, this 
suggests caution should be used in relying on such 
results, since only “major” rearrangements can be 
detected in this manner. 

In the case of Agrobacterium -mediated transforma¬ 
tion, Afolabi et al. [129] and Zhu et al. [117] found that 
nonintact T-DNAs were present in >70% of transgenic 
rice lines, in most cases reflecting loss of the mid to 
right border portion of the T-DNA. Similarly, Rai et al. 
[ 130] found that about 50% of rice plants transformed 
with a T-DNA containing the phytoene synthase ( psy ) 
and phytoene desaturase ( crtl) genes showed evidence 
of rearrangements, and in the majority of cases the 
rearrangements occurred in the crtl expression cassette, 
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Crop Plants Transformation Methods. Figure 5 

Higher-order transgene locus organization in cereals transformed by particle bombardment [124]. Transformation occurs 
during interphase, when the chromatin is distributed into specific nuclear zones and territories. If a metal particle causes 
localized damage, DNA repair complexes will form at these sites and initiate transgene integration (a). During metaphase, 
when FISH analysis is generally carried out, loci that are brought together in interphase may be separated, resulting in 
multiple signals from the same transformation event (b). If the DNA were stretched out, this would reveal large (megabase) 
interspersed sequences, which have also been observed in fiber-FISH experiments 


which was adjacent to the right T-DNA border. 
Rearrangements involving the left border are often 
characterized by the insertion of variable-size regions 
of filler DNA, possibly derived from the T-DNA 
sequence or from plant genomic DNA [116, 131]. 

Few researchers have characterized transgene 
rearrangements in detail, but work by Kohli et al. 
[132] has shown that rearrangements may involve pal¬ 
indromic sequences in the transforming plasmid, 
which tend to form secondary structures such as hair¬ 
pins and cruciforms. These investigators characterized 
12 transgenic rice lines created by particle bombard¬ 
ment, which had been shown to contain rearranged 
transgenes. Interestingly, they found that an imperfect 
palindrome in the CaMV 35S promoter was involved in 
one third of all rearrangements, i.e., the sequence of 


this palindrome was adjacent to the rearrangement 
junction. Similar phenomena have been noted in T- 
DNA transformants containing the same promoter. 
This sequence has the ability to adopt a cruciform 
structure that may stimulate recombination events. 
Many other promoters contain palindromic sequences 
of variable length within 100 bp of the transcription 
start site. The secondary structures formed at these sites 
enable DNA-protein interactions for transcription 
under normal circumstances, but may also participate 
in aberrant recombination events. The fully sequenced 
papaya genome [123] also revealed a number of previ¬ 
ously unidentified transgene rearrangements, i.e., 
a 1,533-bp fragment comprising a truncated, 
nonfunctional tetA gene and flanking vector backbone 
sequence, and a 290-bp nonfunctional fragment of the 
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nptll gene, in addition to the intact, primary transgene 
conferring virus resistance. 

Transgene Silencing 

A common issue raised in association with nuclear 
transformation in plants is the phenomenon of trans¬ 
gene silencing, where the phenotype corresponding to 
the introduced transgene is not expressed. In the 
absence of a genetic explanation (e.g., an undetected 
mutation or rearrangement), silencing is an epigenetic 
phenomenon that can occur at either the transcrip¬ 
tional or posttranscriptional levels. Transcriptional 
silencing involves the absence of transgene mRNA, 
and often occurs due to the integration of the transgene 
at a genomic position that is already repressed (posi¬ 
tion-dependent silencing). However, transgenes 
in active regions of the genome may also be silenced 
if the promoter region is inactivated by 
hypermethylation, which can occur in response to 
unusual DNA structures and compositions that attract 
de novo methylation, or DNA sequences that allow the 
synthesis of short double-stranded RNA (dsRNA) mol¬ 
ecules. In contrast, posttranscriptional silencing actu¬ 
ally requires transcription to take place, but the mRNA 
is rapidly degraded. This is confirmed by nuclear run- 
on assays, which measure the amount of pre-mRNA in 
the nucleus. Like transcriptional silencing, posttran¬ 
scriptional silencing appears to have evolved as 
a defense against invasive nucleic acids and is also 
triggered by dsRNA, in this case matching the tran¬ 
scribed region. There is considerable cross talk between 
the transcriptional and posttranscriptional silencing 
pathways and if a transgene is homologous to an 
endogenous gene, the silencing effect can spread to 
that gene resulting in a phenomenon known as 
cosuppression. 

Transcriptional silencing may be encountered in 
plants where several copies of the same transgene or 
part thereof are present in the transgenic locus, or 
when the same promoter is used to control several 
transgenes. However, the context is very important. 
There have been plenty of reports describing transgenic 
plants carrying multiple transgenes under the control 
of the same promoter yet showing strong and stable 
expression. For example, although Zhu et al. [133] used 
five different endosperm-specific promoters in corn to 


achieve the high-level expression of carotenogenic 
genes, Naqvi et al. [134] achieved strong expression 
of four genes in the same system using the barley 
D-hordein promoter to control each transgene, with 
no adverse effects. Transcriptional gene silencing 
resulting from repetitive promoter regions is correlated 
with increased promoter methylation [135] and 
appears to be driven by the production of dsRNA 
matching the promoter sequence [136]. This has been 
demonstrated by deliberately expressing dsRNA 
corresponding to the nos promoter in transgenic plants 
carrying a second transgene driven by the nos promoter 
[137] and by creating transgenic plants with 
a transgene locus that triggers both transcriptional 
and posttranscriptional silencing simultaneously, by 
producing dsRNA corresponding to promoter and 
transcribed sequences of different target genes [138]. 
In the absence of deliberately created promoter dsRNA, 
the transcriptional silencing seen in some transgenic 
plants carrying multiple copies of the same promoter 
appears to arise from dsRNA produced either by unfor¬ 
tunate transgene positioning or rearrangements that 
create hairpin structures, or by transgenes with such 
high levels of expression that the polyadenylation 
machinery is saturated. Evidence from many transfor¬ 
mation experiments indicates that there is no simple 
correlation between transgene copy number and 
expression level, with the exception of certain carefully 
controlled experiments using boundary elements. In 
some cases, higher copy numbers have suppressed 
overall expression levels whereas in others higher copy 
numbers have enhanced expression. Where suppres¬ 
sion effects have occurred, it has been suggested that 
“runaway expression” resulting in the generation of 
aberrant RNAs lacking polyadenylate tails has triggered 
potent silencing through the posttranscriptional silenc¬ 
ing pathway [139]. 

The organization of a transgenic locus is difficult to 
control, and it is therefore a common occurrence in 
both Agrobacterium -mediated transformation and 
direct DNA transfer that the juxtaposition of 
transgenes or fragments thereof can result in the crea¬ 
tion of hairpin promoter structures at the DNA level 
that are transcribed into aberrant dsRNA species. Such 
arrangements can be obvious and easy to detect, but 
even where gross rearrangements are absent it is possi¬ 
ble that undetected “micro-rearrangements” are 
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present in the transgenic locus, as observed by Mehlo 
et al. [128] when investigating the structure of 
a transgenic locus in corn generated by direct DNA 
transfer. The siRNAs that trigger RNA- dependent 
DNA methylation are just 24 bp in length, so it is 
conceivable that inverted repeats of <50 bp could be 
sufficient for transgene silencing, and such structures 
would be undetectable using the coarse analysis 
methods typically employed to study transgenic plants, 
such as Southern blot hybridization. The likelihood of 
dsRNA production depends not only on the presence of 
damaged or rearranged transgenes, but also on the 
relative position of intact transgenes, which is itself 
a reflection of the mechanism of transgene integration. 
The organization of integrated T-DNA sequences dif¬ 
fers among Agrobacterium strains, but a common fea¬ 
ture of nopaline-type derivatives such as C58 is the 
preferential integration of T-DNA as dimers with an 
inverted repeat configuration, linked either at the left 
or right borders [140]. Where cotransformation is car¬ 
ried out with two T-DNAs containing different genes, 
the different T-DNAs often integrate as heterodimeric 
inverted repeats, preferentially around the right border 
[141] . If the same promoter is used for both genes, this 
would favor the formation of hairpin structures that 
could be transcribed from the opposite strand. The 
structure of loci generated by direct DNA transfer is 
more variable, but inverted repeat structures involving 
promoter sequences are not uncommon, allowing the 
same silencing mechanism to operate [126]. 

Plastid Transformation Methods 

The introduction of DNA directly into the chloroplast 
genome is considered beneficial for a number of rea¬ 
sons including the high level of gene expression that 
can be achieved, reflecting the presence of thousands of 
chloroplasts in photosynthetic cells and the absence of 
gene silencing. Chloroplast transformation also pro¬ 
vides a natural containment method for transgenic 
plants, since in most crops the transgene cannot be 
transmitted through pollen (reviewed in [142]). 

The first reports of chloroplast transformation were 
serendipitous, and the integration events were found 
to be unstable. For example, an early experiment in 
which tobacco protoplasts were cocultivated with 
Agrobacterium resulted in the recovery of one 


transgenic plant line in which the transgene was trans¬ 
mitted maternally. Southern blot analysis of chloro¬ 
plast DNA showed directly that the foreign DNA had 
become integrated into the chloroplast genome [143]. 
However, Agrobacterium is not an optimal system for 
chloroplast transformation because the T-DNA com¬ 
plex is targeted to the nucleus. Therefore, direct DNA 
transfer has been explored as an alternative strategy and 
efficient chloroplast transformation has been achieved 
both through particle bombardment and PEG- 
mediated transformation (reviewed in [144]). 

Stable chloroplast transformation was first achieved 
in the alga Chlamydomonas reinhardtii , which has 
a single large chloroplast occupying most of the volume 
of the cell [ 145] . Particle bombardment was used in this 
experiment and the principles established using this 
simple organism were extended to tobacco, allowing 
the recovery of stable transplastomic tobacco plants 
[146]. These principles included the use of vectors 
containing chloroplast homology regions, allowing 
targeted integration into the chloroplast genome, and 
use of the selectable marker gene aadA (encoding 
aminoglycoside adenyltransferase) which confers resis¬ 
tance to streptomycin and spectinomycin [147]. Basic 
vectors for plastid transformation include flanking 
sequences and chloroplast-specific expression cassettes. 
Species-specific chloroplast flanking sequences are gen¬ 
erated by PCR using primers designed from the avail¬ 
able chloroplast genomes. The chloroplast expression 
cassette is composed of a promoter, the selectable 
marker, and 573' regulatory sequences to enhance the 
efficiency of transcription and translation. The most 
frequently used integration site is the transcriptionally 
active intergenic region between the trnl and trnA 
genes, within the rrn operon. The first-generation plas¬ 
tid transformation vectors included the pPRV series 
and plasmids pRB94/95, in which both the marker 
gene and the primary transgene have their own 5' 
and 3' regulatory sequences (reviewed in [144]). 
More recent vectors include modified restriction sites, 
lox P sequences for posttransformation marker exci¬ 
sion, and homology regions targeting insertions to 
the rbcL-accD intergenic region [148]. Thus far, chlo¬ 
roplast transformation by particle bombardment has 
been achieved only in crops that allow direct organo¬ 
genesis, and this does not include any monocots 
(reviewed in [142]). 
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Future Directions 

The vast majority of the transgenic plants generated 
thus far carry a single primary transgene plus 
a selectable marker. The transgene integrates randomly 
into the genome, which means it is subject to 
unpredictable position effects that may result in silenc¬ 
ing; the locus structure is also very difficult to control. 
In the future, there will be a stronger emphasis on 
strategies to increase the scope of gene transfer and 
the predictability and preciseness of DNA integration, 
and consequently the likelihood of stable and predict¬ 
able transgene expression. 

Transfer of Large DNA Molecules Using Modified 
Conventional Vectors 

A precise upper limit for T-DNA transfer has not been 
established. It is greater than 50 kbp [2, 3], but using 
standard vectors it is difficult to transfer inserts larger 
than 30 kbp routinely due to instability in the bacterial 
host. However, the analysis of very large genes or the 
transfer of multiple genes linked in series can now be 
achieved thanks to the development of high-capacity 
binary vectors based on the artificial chromosome type 
vectors used in E. coli. 

The first to be described was BIBAC2 [149]. This 
contains an F-plasmid origin of replication and is 
modeled on the bacterial artificial chromosome. The 
basic vector transforms tobacco with high efficiency, 
but the efficiency of transformation drops substantially 
when large inserts are used. This vector has been used 
to introduce 150 kbp of human DNA flanked by 
T-DNA borders into the tobacco genome, although 
virulence helper plasmids supplying high levels of 
VirG and VirE in trans were critical for successful 
DNA transfer. An alternative vector carrying a PI ori¬ 
gin of replication and modeled on the PI artificial 
chromosome was constructed by Liu et al. [150]. This 
transformation-competent bacterial artificial chromo¬ 
some (TAC) vector was used to introduce up to 80 kbp 
of genomic DNA into Arabidopsis, and while there was 
some loss of efficiency with the larger inserts, it was still 
possible to produce many transgenic plants. Both vec¬ 
tors contain a kanamycin resistance marker for selec¬ 
tion in bacteria and hpt for hygromycin selection in 
transgenic plants. Both vectors also contain the Ri 
origin for maintenance in Agrobacterium , and within 


the T-DNA region, the sacB marker for negative selec¬ 
tion, interrupted by a multiple cloning site for trans¬ 
gene insertion. One of the most attractive uses of 
high-capacity binary vectors is for the positional clon¬ 
ing of genes identified by mutation. The ability to 
introduce large segments of DNA into the plant 
genome effectively bridges the gap between genetic 
mapping and sequencing, allowing the position of 
mutant genes to be narrowed down by complementa¬ 
tion. Genomic libraries have been established for 
several plant species in BIBAC2 and TAC vectors [151, 
152] and cloning in high-capacity vectors has been 
simplified by the inclusion of Cre//oxP and Gateway 
site-specific recombination technology [153, 154]. 

Large (80-150 kbp) DNA molecules have also been 
transferred to plants by direct DNA transfer [155], and 
although this is not a routine procedure a novel trans¬ 
formation method has been developed recently, based 
on bombardment with DNA-coated “bioactive beads” 
to deliver up to 150 kbp of DNA into rice protoplasts 
[156]. 

Plant Minichromosomes 

In bacteria, plasmid vectors are maintained as episomal 
replicons to make cloning and isolating recombinant 
DNA a simple procedure. When it comes to expressing 
heterologous genes in eukaryotic cells, episomal vectors 
are widely used to avoid position effects, hence the 
development of yeast episomal vectors, yeast artificial 
chromosomes, mammalian plasmid vectors carrying 
virus origins of replication (e.g., SV40-based vectors, 
herpes virus-based vectors), and plant expression vec¬ 
tors based on plant viruses (all of which replicate epi- 
somally). The yeast artificial chromosome (YAC) 
system is the most relevant in this context because it 
allows genes of any size to be introduced into the yeast 
genome as an independent replicating unit that is 
treated by the cell as an additional chromosome. 
YACs comprise a yeast centromere and telomeres, 
the origin of replication (autonomous replicating 
sequence), and selectable markers. 

More recently, analogous systems have been devel¬ 
oped to maintain genes as episomal minichromosomes 
in plants. These have many advantages for plant genetic 
engineering including the ability to express large 
transgenes or groups of transgenes, and the ability to 
rapidly introduce new linkage groups into diverse 
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germplasm. Carlson et al. [157] created plant 
minichromosomes by combining the DsRed and nptll 
marker genes with 7-190 kb of corn genomic DNA 
fragments containing satellites, retroelements, and 
other repeat sequences commonly found in centro¬ 
meres. The circular constructs were introduced into 
embryogenic corn tissue by particle bombardment 
and transformed cells were regenerated and propagated 
for several generations without selection. The 
minichromosomes were maintained as extrachromo- 
somal replicons through mitosis and meiosis, and 
showed roughly Mendelian segregation ratios (93% 
transmission as a disome with 100% expected, 39% 
transmission as a monosome crossed to wild type 
with 50% expected, and 59% transmission in self 
crosses with 75% expected). The DsRed reporter gene 
was expressed over four generations, and DNA blot 
analysis indicated the genes were intact. 

Gene Targeting (Homologous Recombination) 

Gene targeting is the directed modification of an 
endogenous DNA sequence by homologous recombi¬ 
nation, an efficient procedure in bacteria, yeast, certain 
animal cells, and in the plastid genomes of plants, but 
typically not in the nuclear genome. Only one plant 
species has been shown to undergo efficient nuclear 
homologous recombination and that is the moss 
Physcomitrella patens [158]. Among higher plants, 
low-level gene targeting has been achieved in certain 
dicots with frequencies ranging from 10 -3 to 10 -6 
[159]. However, targeting frequencies of up to 1% 
have been achieved using a T-DNA-mediated gene 
targeting strategy involving a long homology region 
in combination with a strong counterselectable marker 
in rice [160, 161]. 

There has also been interest in the use of zinc-finger 
endonucleases to make targeted double-strand breaks 
in the plant genome, so that homologous recombina¬ 
tion is favored at such sites [162]. The modular nature 
of zinc-finger transcription factors means that recom¬ 
binant DNA technology can be used to “mix and 
match” these DNA-binding domains to create recom¬ 
binant proteins with unique sequence specificities. 
Zinc-fingers are motifs approximately 30 amino acids 
in length which coordinate a Zn 2+ ion and bind to DNA 
sequences 3-bp long. Combining different zinc fingers 


in series allows proteins to be tailor made to bind 
longer DNA sequences. When a nonspecific DNA 
endonuclease is incorporated into such a protein, it 
becomes a targeted DNA cutting tool [163, 164]. The 
recent achievement of targeted transgene integration 
and endogenous gene disruption in corn [165] and 
tobacco [166] using zinc-finger endonucleases pro¬ 
vides a tantalizing glimpse of the future of plant bio¬ 
technology in which precise changes can be made to the 
genome of any plant genome that is amenable to DNA 
transfer. 

Site-Specific Recombination 

Although site-specific recombination has already been 
described as a cloning tool, particularly the Cre/ZoxP 
and Gateway systems for rapid vector assembly, it can 
also be used in transgenic plants to introduce DNA at 
a specific, favorable locus, or remove DNA sequences in 
vivo. The Cre/ZoxP system has been most widely used in 
plants for the controlled excision of selectable marker 
genes after transformation (e.g., [153]), but also for 
controlled transgene insertion (e.g., [167]). Controlled 
integration has been studied in transgenic plants 
already engineered to contain recipient lox P sites 
[168]. In this study, three different recipient wheat 
lines were generated by bombarding plants with the 
lox P sequence, and these were subsequently bombarded 
with a gusA construct also containing flanking lox P 
sequences, and a ere gene. Following transformation, 
about 80% of lines contained gusA at the recipient site, 
many with single-copy transgenes and others with 
concatemers. Both types of locus were stably inherited. 
There was much less variation in expression among the 
single-copy lines [168]. Chawla et al. [169] generated 
18 different transgenic rice lines containing a precise 
single copy of gusA at a designated site. In seven of these 
lines, additional copies of the transgene integrated at 
random sites by illegitimate recombination while 11 
showed “clean” integration by site-specific recombina¬ 
tion only. The single-copy lines were stable over at least 
four generations and showed consistent levels of 
expression, which doubled in homozygous plants. In 
contrast, the multicopy lines showed variable expres¬ 
sion and some fell victim to transgene silencing. Inter¬ 
estingly, where the site-specific and illegitimate 
integration loci segregated in later generations, 
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transgene expression was reactivated in the plants car¬ 
rying the site-specific integration site alone, whereas 
close linkage between the site-specific and random 
integration prevented segregation in other lines and 
the silencing persisted. 

An exciting recent development is the GENE 
DELETOR system, which is a hybrid of the Cre-/oxP 
and FLP-ERT systems. The GENE DELETOR is based 
on a fusion recognition site ( loxV-FRT ), which is ineffi¬ 
cient when both recombinases are expressed but highly 
efficient when either one of the recombinases is expressed 
alone, giving up to 100% efficiency in populations of up 
to 25,000 T1 transgenic tobacco plants [170]. 

Another use for Cre-ZoxP is the simplification of 
locus structure by resolving multicopy loci to a single 
transgene copy [171]. A strategy was developed in 
which the transformation vector contained a transgene 
flanked by lox P sites in an inverted orientation. Regard¬ 
less of the number of copies integrated between the 
outermost transgenes, recombination between the outer¬ 
most sites resolved the integrated molecules into a single 
copy. The principle was proven by resolving four 
multicopy loci successfully into single-copy transgenes. 
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C3 The C3 pathway of photosynthesis, found in most 
plant species, for example, rice, potato, and wheat. 

C4 The C4 pathway of photosynthesis, found in some 
tropical species, for example, maize, sugarcane, 
sorghum. 

PAR Photosynthetically active radiation. Solar radia¬ 
tion in the wavelength region 400-700 nm. 

RUE Radiation use efficiency, the ratio of biomass 
produced per unit radiation intercepted. 


The rate of growth (the rate of the accumulation of dry 
matter) of all plants is entirely dependent on the inter¬ 
ception of energy (electromagnetic radiation) from the 
sun in the wavelength range 400-700 nm. This energy is 
utilized by photosynthesis to synthesize carbohydrates 
and other biological molecules needed for essential plant 
processes. 

The amount of energy intercepted or captured by the 
whole plant and community system (the canopy) is 
determined by the organization of leaves into an efficient 
spatial structure with a large total surface area. The 
amount of radiation captured will determine the rate of 
photosynthesis possible and the rate of growth. However, 
the final growth rate is then determined by losses in the 
system that originate from a number of sources, includ¬ 
ing the type of photosynthetic mechanism, metabolic 
and hydraulic constraints, the relationship between 
photosynthetic source and non-photosynthetic sink 
organ, variability in environmental conditions, and 
limitations imposed by management techniques. 

The discovery that plant and crop growth is closely 
linked to the amount of intercepted radiation led to the 
establishment of methods for measuring radiation use 
efficiency (RUE). RUE is measured as the amount of 
dry matter produced per unit intercepted radiation 
over a given time period and it is often separated into 
key developmental stages within the life cycle of the 
crop. It was quickly established that values of RUE tend 
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to be stable for a given species, growth stage, and 
environment, but there are important differences 
across crop species and plant types. In the absence of 
other factors, RUE will set the theoretical limit to 
biomass production and ultimately crop yield. It is 
now accepted that RUE is a fundamental measurement 
which underpins potential crop productivity and yield 
and it has become embedded into modern methods of 
crop growth analysis. 

In many cases, plants absorb more photosyntheti- 
cally active radiation (PAR) radiation than they utilize 
for growth: given the current emphasis on global food 
security, there is currently much interest in raising the 
RUE of key crops in important agroecosystems. 

Introduction and History 

All green plants use sunlight as their sole energy source 
for assimilation of carbon dioxide into carbohydrates. 
At its most fundamental level, this process is the for¬ 
mation of energy-rich bonds in a form that is easily 
stored, transported, and utilized in most essential plant 
functions. This process is of course photosynthesis and 
it is the sole supplier of energy for almost all human 
nutrition and fuel requirements, via current and pre¬ 
historic photosynthesis. 

The continuing expansion of the human popula¬ 
tion and the steady increase in consumption per capita 
is placing pressure on land availability. A variety of 
factors including urbanization and the erosion of 
existing land certainly necessitate production of more 
food and fuel per hectare with fewer inputs (water, 
fertilizer) and all of this within the uncertainty of 
changes in climate and increased pressure to use land 
which may be currently unsuitable for cultivation. For 
example, 700 million people depend on rice for calo¬ 
rific intake. Each hectare can currently feed around 
27 people; by 2050 this will have to rise to 43 people 
per hectare [ 1 ]. This is compounded by the fact that 
Asian rice yield potential appears to be stagnating [2]. 
Wheat is grown on over 200 million hectares of land 
providing approximately one-fifth of the total calorific 
input of the world’s population, and while there have 
been steady increases in productivity since the green 
revolution, global demand for wheat is predicted to 
increase at a faster rate than the annual genetic gains 
that are currently being realized [3]. Since the highest 


yields of major crops are usually only achieved within 
high input systems, it is not surprising that 
many suggest a reappraisal of the basis of crop 
productivity. 

It seems clear that an increase in the rate of appli¬ 
cation of fertilizer or water is neither desirable nor even 
possible. Nitrogen fertilizer is heavily dependent on the 
continued availability (physically, financially, and 
politically) of fossil fuels. Water is increasingly scarce 
not just in equatorial regions but in temperate regions. 
One way to improve resource use efficiency is to 
increase the rate of overall biomass production and 
this is starting to be associated with yield progress of 
major crops such as rice and wheat [3, 4]. As shall be 
described, an increase in biomass production by crops, 
whilst helping to improve upon current rates of breed¬ 
ing progress in yield potential, would also (1) underpin 
future genetic improvements in adaptive processes, 
(2) improve the relative efficiency of resource use in 
terms of biomass production per unit water, light, and 
fertilizer, and therefore (3) increase the attractiveness of 
using waste products of arable crops for secondary uses 
such as fuel, and (4) if applied universally, reduce the 
competition for land between biofuel crops and food 
crops, and (5) increase the feasibility of using marginal 
land for fuel and food production. 

A History of Radiation Use Efficiency 

Given that it underlies so much of human activity and 
endeavor it is perhaps curious that the relationship 
between radiation and plant growth did not become 
explicitly defined and quantified until relatively recently. 
It has been suggested that a tendency to retain methods of 
classical crop growth analysis during the first half of the 
twentieth century delayed a mechanistic analysis of the 
processes of biomass accumulation [5]. In these early 
studies, growth (crop mass) was described as a function 
of time possibly because of the ease with which mass 
could be measured through the growing season. This 
approach was confounded by its inclusion in measure¬ 
ments of relative growth rate where crop biomass increase 
is closely related to existing biomass. The central role of 
light interception in growth was recognized by Watson 
[6] who suggested a measurement of net assimilation 
rate which worked in some situations but did not 
adequately account for the complex relationship 
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between leaf area and variation in assimilation rate 
(photosynthesis) of individual leaves. 

Another advancement in understanding came with 
the consideration that efficiency of light use declined at 
high levels (light saturation) [7, 8] and that light levels 
were lower at the base of the canopy. In fact, as pointed 
out by Hirose [9], the first mathematical model of can¬ 
opy photosynthesis was first produced by Monsi and 
Saeki in 1953 but this was not recognized for more than 
a decade. This resulted in the first real considerations of 
growth as a function of the amount of light intercepted by 
the plant canopy. At this time, it was observed that the 
amount of light intercepted by the canopy was closely 
related to dry matter accumulation [5]. Despite this 
there were many studies in the 1970s which used inci¬ 
dent light rather than intercepted radiation to make key 
studies in agronomy and ecology. 

The sole use of incident radiation measurements 
will lead to errors in the calculation of radiation use 
efficiency especially before canopy closure when not all 
radiation is absorbed and when crops possess differing 
rates of canopy development. Over the whole life cycle 
of the crop, the proportion of incident radiation that is 
intercepted and available for assimilation can be quite 
small. It was not until the mid 1970s that John 
Monteith established the relationship between accu¬ 
mulated intercepted radiation and accumulated bio¬ 
mass within canopies [10]. This was accompanied by 
experimental data demonstrating a level of conserva¬ 
tion of radiation use efficiency of crop species when 
grown under optimal growth conditions with high 
resource availability and a consideration of the role of 
leaf photosynthetic capacity in crop canopies. This has 
since proved to be a robust approach and has remained 
a central feature of crop growth analysis ever since. 
Many studies attempted to provide values for RUE 
across a wide range of species and growth conditions 
across the world [11, 12]. It was quickly claimed 

(1) that it was possible to attain consistent values for 
a given species when growing conditions were good, 

(2) that suboptimal (e.g., nitrogen or water deficient) 
or stressful conditions caused RUE to decline, and 

(3) that there were clear differences between crop spe¬ 
cies: plants possessing the C4 photosynthetic mecha¬ 
nism had the highest RUE followed by most C3 plant 
species and finally legumes with the lowest RUE values. 
Early data suggested that RUE was a conservative or 


even constant value for a given species. However, data 
published since has demonstrated significant variation 
and it has been pointed out that close attention in each 
case must be paid to the methods of analysis, growth 
conditions, developmental phase, and genotype. 

RUE is commonly used within the crop sciences 
although it is applicable to growth of any autotrophic 
organism and most ecosystems. Its calculation requires 
knowledge of the amount of radiation absorbed in 
a given time period and measurement of the resulting 
biomass and energy content. These are easier to 
measure in a uniform system like a monoculture. 
The historical development of RUE was no doubt 
dependent on the advancement in other areas of sci¬ 
ence such as photosynthetic regulation and the techni¬ 
cal development of instruments to accurately measure 
key attributes, that is, radiation, photon flux density, 
and the rate of photosynthetic carbon assimilation. 

The understanding that RUE is fundamental to 
crop productivity leads naturally to the question of 
whether it can be improved. The observation that C4 
plants possess a higher RUE leads to the notion that 
such a change would give a higher biomass productiv¬ 
ity which could be exploited to increase yield per hect¬ 
are. The green revolution provided a step change in 
agricultural productivity that was able to prevent the 
tragedy of mass starvation in some regions and also to 
support population increase. As outlined here, a step 
change of similar magnitude now would probably 
require an improvement in RUE. 

Determination of Radiation Use Efficiency 

RUE is the ratio between accumulated plant biomass 
and the accumulated radiation intercepted by the crop. 
A note on terminology: some authors use the terms 
“radiation conversion factor” [13] and “radiation con¬ 
version efficiency.” Some authors have pointed out that 
use of the word “conversion” is inappropriate because 
energy is not being directly converted to matter but 
rather converted from one form of energy to another 
(i.e., solar radiation to higher energy state of chloro¬ 
phyll molecules) [5]. The term radiation use efficiency 
(RUE) shall be used. 

To calculate RUE, it is necessary to know the 
amount of radiation arriving at the top of the canopy, 
the amount of radiation intercepted by the canopy, and 
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the biomass accumulated during a given period. 
In many studies, the energy content of the dry 
matter is also required. This section will summarize 
how RUE is measured in a practical sense and tackle 
some of the diversity in approaches that have been 
taken. 

Radiation arrives at the edge of the earth’s atmo¬ 
sphere at a mean rate of 1.4 kj m -2 s _1 , the so-called 
solar constant. By the time photons reach the earth’s 
surface a number of geometrical and atmospheric fac¬ 
tors have reduced this value significantly. Cloud cover 
notwithstanding the greatest influx occurs in tropical 
regions at low latitudes. The greatest measured flux at 
the earth’s surface itself occurs in low latitude regions 
where cloud cover is minimal (20-30°). High latitudes 
can have extremely low solar radiation levels. 

Light is absorbed and scattered by molecules, aero¬ 
sols, and particles in the atmosphere, reducing flux 
even further. Gas molecules such as C0 2 , H 2 0, and 
CH 4 absorb energy at specific infrared wavelengths 
whilst gas molecules 0 2 and 0 3 absorb at lower wave¬ 
lengths. Photosynthesis is restricted to a range of wave¬ 
lengths that approximate to those visible to the human 
eye: 400-700 nm. Higher wavelengths do not contain 
sufficient energy to drive photosynthesis. At ground 
level, this range of wavelengths makes up 49% of total 
solar energy. In other words, the photosynthetic pro¬ 
cess does not use over half of the energy available. 
However, longer wavelengths have an important 
heating effect which raises plant tissue temperature 
and accelerates metabolic and developmental processes 
such as leaf and canopy construction. 

Variation from day to day means that it is necessary to 
integrate measurements of radiation over long time 
periods. This is often done by positioning devices above 
the canopy and below the canopy. Most commonly these 
have been fairly inexpensive tube solarimeters connected 
to a device that continually stores data produced. Reflec¬ 
tion can be measured by inverting the device. Instanta¬ 
neous measurements (spot measurements) of fractional 
interception (/) are also used but these can be mislead¬ 
ing because they are most reliably taken at midday 
during the same, usually sunny, conditions and there¬ 
fore do not account for cloud cover or low solar eleva¬ 
tion and give lower values of / 

The proportion of PAR is higher in scattered light 
than direct beam radiation and so it is useful to 


distinguish between the two and a few commercially 
available devices are capable of doing so. This also 
means that the proportion of PAR changes according 
to solar elevation, although it is around 0.45 at eleva¬ 
tions greater than 30° [5]. Growth in predominantly 
scattered radiation commonly causes an increase in 
RUE. 

Wavelengths used for growth (400-700) are prefer¬ 
entially absorbed by chlorophyll within the canopy 
resulting in transmitted light that is depleted in red 
and blue and enriched in far-red. The formula for 
fractional interception is presented in its simplest 
form by Eq. 1. 


where / = fractional interception, Io = incident radia¬ 
tion, r = reflected radiation, I = intercepted radiation. 

An alternative method for measuring / is spectral 
reflectance. This compares reflectance from the canopy in 
the red band and the near IR band to produce 
a normalized difference vegetation index (NDVI). It has 
been possible to show the relationship between NDVI 
and / in many crops [14] and is a technique that is 
possible to use remotely from satellites and aircraft. 
Although it is clearly convenient and rapid, one must be 
aware that it has the drawback of being a spot measure¬ 
ment. Measurements made from satellites suffer from 
problems of images that deviate from 90° to the earth’s 
surface [15]. 

Photography is also used to estimate / using 
a camera fitted with a fish-eye lens positioned below or 
above the canopy. This has the advantage of 
encompassing a large area, producing permanent image 
and being taken rapidly in the field. This method has been 
shown in many cases to correlate with / measured using 
solarimeters; however, errors arise due to common bias 
in positioning the camera. All indirect measurements 
of/should consider the potential errors that may result. 

RUE is usually calculated by measuring the difference 
in biomass between many consecutive harvests (i.e., accu¬ 
mulated intercepted growth) and plotting this against the 
measured accumulated intercepted radiation. This is usu¬ 
ally a linear relationship and RUE is calculated as the 
slope of this relationship (Fig. 1). The points at which 
measurements are made are critical because RUE varies 
according to developmental state. 
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Crop Radiation Capture and Use Efficiency. Figure 1 

An example of the relationship between accumulated intercepted radiation and accumulation of dry matter by three 
species : the C3 crop rice {circle), the C4 crop maize {square), and the C4 weed Echinochloa glabrescens {triangle). Note the 
higher slope for maize in (a), that maize reached full interception before rice in (b), and that the weed is the first to form 
a full canopy but does not achieve the same amount of biomass as the crop species (Redrawn from Sheehy et al. [1]) 


Care must be taken when comparing values from 

different sources [5, 13]. Some of the common sources 

of variation are as follows: 

1. If root mass is not included then RUE values will be 
lower. However, it is extremely difficult to measure 
root mass directly and it is often ignored or assumed 
to be a fixed percentage of total plant mass. 
Nonuniform growth can therefore be a significant 
source of error. Energy required to grow and maintain 
roots is also a function of the biotic and abiotic 
environment of the soil. 

2. Solar radiation flux density is measured using 
solarimeters, pyranometers, or radiometers which 
usually cover the wavelengths 300-3,000 nm. The 
photon flux density is commonly measured using 
sensors that detect within the photosynthetically 
active range of 400-700 nm. Therefore the conver¬ 
sion between energy and quanta-based measure¬ 
ments must take into account the energy of each 
waveband. Data based on different regions of the 
electromagnetic spectrum can cause inaccuracies, 
for example, the transmission of shortwave radia¬ 
tion through canopies differs from that of PAR 
[ 16] . However, the solar spectrum is fairly constant, 
and the reliable figure for this is 4.6 pmol quanta. 
Inaccuracies can arise from badly positioned, 
poorly leveled or uncalibrated instruments. It is 


common practice to keep devices free of anything 
that would attenuate radiation in a way that would 
bias the results, for example, dead leaves are 
removed and glass is cleaned. 

3. It is important to distinguish absorbed from 
intercepted radiation. The latter does not take into 
account the reflection of PAR from the top surface 
of the canopy. This is generally assumed to be about 
5% of total PAR but variation is likely. Not all of the 
plant tissues that absorbed energy are necessarily 
alive or of equal photosynthetic potential. 

4. When comparing different species of plant with 
contrasting harvest organs it is important to consider 
the energy content of the organs involved. Lipid has 
a higher calorific content than protein, which in turn 
has a higher calorific content than carbohydrates 
[17]. The heats of combustion of carbohydrates, 
proteins, and lipids are 17.3, 22.7, and 37.7 kj g -1 , 
respectively. Since RUE is comparing the input of 
energy to the output of dry matter it can vary 
according to the energy content per unit dry matter 
and it is essential to adjust RUE accordingly. For 
example, the RUE of crops that produce particu¬ 
larly oily seeds should decrease during the seed¬ 
filling period. However, the nutrient content of 
plant tissue does not greatly directly influence the 
measurement of RUE. 
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5. The stage of growth can have a large influence on 
the RUE measured (see section Source-Sink Pro¬ 
cesses and Partitioning of Assimilates). It can be 
common practice to calculate the RUE over the 
entire growing season but this does not necessarily 
represent the maximal RUE value. For example, in 
many crops RUE appears to be steady during the 
vegetative period but to decline following the onset 
of the reproductive phase. 

Radiation Capture by Crop Canopies 

Canopy structure and therefore the efficiency of light 
capture was a common feature in the domestication 
and the later improvement of many crop species. For 
example, the reduction of branching in maize and sun¬ 
flower allowed dense planting [18] and there is evidence 
of further adaptation to higher planting density [19]. 
The reduction in height of cereals was a key factor in the 
green revolution, permitting an increase in harvest index 
and reduction in lodging. Crop canopies must intercept 
or capture as much radiation in the 400-700 nm band¬ 
width as possible. Most of the light interception occurs by 
leaves; however, stems, petioles, leaf sheaths, and repro¬ 
ductive structures can also absorb significant amounts of 
radiation. To achieve high interception, they must con¬ 
struct and present a canopy with large leaf area and in 
many crops the minimum leaf area index (L: the ratio of 
leaf area per unit ground area) is around three. The L 
can be expressed by the product of the number of plants 
per unit ground area, the number of leaves per plant, and 
the mean area of leaves per plant (plus green stem area). 
Light can penetrate the canopy and strike the ground 
below as direct beam radiation (so-called sunflecks), 
scattered radiation, or radiation that has passed through 
leaves and other plant organs, that is, transmitted. The 
amount of penetrated radiation is dependent on the 
three-dimensional arrangement of leaves in the canopy 
and for a given L will be a function of a large number of 
features such as leaf size, leaf density per cubic meter, 
the angle of individual leaves, heterogeneity of leaves in 
space (clumping), leaf thickness, and albedo. There is 
clear variation between species and even between 
crop varieties in these features. Additionally, it is pos¬ 
sible to alter these features through management tech¬ 
niques such as planting density and the application of 
NPK fertilizer. Therefore there needs to be a way of 


describing radiation distribution within canopies 
mathematically and linking this to agronomic charac¬ 
teristics such as leaf area index and nitrogen content. 

The relationship between L and fractional intercep¬ 
tion can be described by Monsi-Saeki equation (Eq. 2), 
a modification of Beer’s law. This assumes that the 
canopy is a homogeneous medium whose leaves are 
randomly distributed in space, that is, there is no effect 
of row structure or clumping and under these condi¬ 
tions Beer’s law will apply [9]. The Monsi-Saeki equa¬ 
tion can be applied if the canopy is considered to be 
divided into horizontal layers with each layer 
possessing a particular L and the irradiance within 
each layer is measured. The irradiance at each layer 
will depend on the three-dimensional characteristics 
of the leaves both within that layer and the layers 
above. It has been found that this provides an accurate 
estimate in many crops in which this has been mea¬ 
sured: In (I/Io) against L provides a linear relationship, 
the slope of which gives the value k which provides 
a simple but useful mathematical description of the 
architecture of the crop canopy in question. 

I = Io exp (—CL) (2) 

I and Io as in Eq. 1, L = leaf area index, and k = 
extinction coefficient for a given waveband 

Crops have been found to vary for the value of k and 
the major feature is the erectness of leaves. Canopies with 
erect and narrow leaves such as cereals have a lower value 
of K than those with flat, broad, and horizontal leaves. 
This also applies to differences within species, for 
example, rice varieties vary in erectness [20]. 

If the Monsi-Saeki equation holds and the k value 
for a given canopy type is known then it becomes 
possible to calculate / simply from a knowledge of L 
(Eq. 3). 

/ = 1 - exp(-fcL) (3) 

Equation 3, letters as Eqs. 1 and 2 

This analysis is useful when canopies are considered 
to be a three-dimensional “box” of vegetation with 
radiation penetrating only from above. It can fail 
where plants are sparsely populated and do not achieve 
full ground cover or are simply planted in rows with 
space in between, which is the case for some crops that 
are tended by hand. The calculation of intercepted 
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radiation in these cases can become quite complex and 
impractical at high levels of heterogeneity although 
estimations are often made on the basis of incident 
radiation and leaf area per plant. Another source of 
error is angle of solar elevation which can significantly 
alter the proportion of reflected light and the propor¬ 
tion of scattered light within the canopy. It also does 
not account for the commonly seen variation in leaf 
angle that occurs between the top and the bottom of 
the canopy. Even at a point close to full canopy cover 
there can be a “bimodal” type of variation in crop 
canopies. For example, rice plants tend to form 
“inverted cone” shapes and this causes a complex 
three-dimensional variation in irradiance distribution 
especially during canopy development. 

The optimal “design” of plant canopies must con¬ 
sider not just maximum interception but also the rela¬ 
tionship with photosynthetic rate. In principle, a plant 
could achieve close to 100% interception with a single, 
planar chlorophyll-rich leaf and a I of 1.0. However, 
this does not result in optimum productivity largely 
due to the fact that photosynthesis saturates below 
full sunlight and this is especially marked in C3 
plants. This is a central point: the response of 
leaf photosynthesis to irradiance is shown in Fig. 2. 


This is a useful measurement and easy to make 
with today’s equipment. It provides not only 
a measurement of the maximum rate of photosynthesis 
(Amax) but also the potential quantum yield of 
photosynthesis. 

A greater canopy carbon gain is achieved by reduc¬ 
ing the proportion of leaves in the canopy that exist in 
the light-saturated state. The leaves lower in the canopy 
are retained at light limitation while those at the top 
will be prone to light saturation. If the irradiance 
increases (e.g., moving from cloud-cover to full sun¬ 
light) then those lower in the canopy will be able to 
respond accordingly. This has lead to the widely 
observed phenomenon that when irradiance is plotted 
against carbon-gain (photosynthesis) one commonly 
observes a linear response for canopies but a saturation 
in leaves (normally described by a non-rectangular 
hyperbola) (Fig. 3) [11]. This is important: if canopies 
demonstrated light saturation then this would severely 
limit their ability to improve RUE. 

Canopy Properties and Photosynthesis 

The optimum design for biomass production in terms 
of carbon gain is predicted to be one that permits 
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Crop Radiation Capture and Use Efficiency. Figure 2 

The relationship between photosynthesis and incident radiation levels for (a) single leaves and (b) a full canopy: (a) typical 
features of the leaf response, the linear phase (maximum quantum efficiency), the light compensation point, convexity, 
and the light saturated rate (Amax, and (b) schematic figure: note the linear response of whole canopies in comparison to 
single leaves 
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Canopy depth E, erect canopy e.g. rice F, flat, broadleaf canopy e.g. common bean 





Crop Radiation Capture and Use Efficiency. Figure 3 

Canopy structure and function according to leaf "architecture." (a) The schematic figure on the left shows canopy depth 
where the canopy has been divided into layers, each with a leaf area index (L) of 1.0. Attenuation of light in the canopy 
follows the Monsi-Saeki equation. For a broadleaf canopy, the radiation reaches extinction at a higher point and a lower L 
compared to an upright canopy. The canopy extinction coefficient value "K" is derived from Eq. 2 and provides 
a mathematical description of the link between Fand L. (b) The calculated relationship between canopy photosynthesis 
and L, measured at irradiance saturating at the top of the canopy for species with two values of k. Note the higher 
maximum value of the species with higher k (Redrawn from [9]). (c) A highly schematic figure redrawn and adapted from 
Ong and Monteith [32] and shows the theoretical differences in canopy biomass accumulation over time in species with 
different values of k. Maximum values of carbon gain are achieved after canopy closure which occurs earlier in broadleaf 
species (high k). Species with a lower value of k have higher potential rates of total canopy photosynthesis 


a higher proportion of radiation to penetrate to lower 
layers, whilst also reducing light saturation at the top 
(Fig. 3). Indeed, this seems to be the trend in species 
such as rice and wheat although the advantage here is 
thought to be greatest at lower latitudes where light is 
overhead for a greater proportion of the year and 
therefore penetration into the canopy is greater. This 
also has the advantage that it prevents the unwanted 


senescence of leaves when light levels drop below or 
close to the light compensation point [21]. 

This raises the question of whether there is room for 
improvement in canopy architecture to improve RUE. 
Early work found that canopies with a low L benefitted 
from horizontal posture while those with high L 
benefitted from upright posture [5, 22]. This work 
suggested that RUE would not be sensitive to extreme 
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upright posture of leaves. However, it is clear that 
upright leaves are associated with the highest yielding 
lines in wheat [23, 24] and it has been suggested that 
some cereal canopies may benefit from increasing 
erectness further [3, 23, 25]. There are relatively few 
genes controlling erectness and this should be straight¬ 
forward to test, although work has shown that com¬ 
pensatory effects such as leaf size may be hard to 
account for. 

Modern high-yielding rice lines also have highly 
erect leaves and it is assumed that this also results in 
higher potential productivity. In rice, this also permits 
denser planting which accelerates canopy closure and 
improves total radiation intercepted over a short trop¬ 
ical growing season. 

A further feature is that leaves have different char¬ 
acteristics according to the light intensity to their can¬ 
opy position. Most notably they have “sun leaf” and 
“shade leaf” characteristics, the former having higher 
light compensation points, higher photosynthetic 
capacity, and higher nitrogen and enzyme content. 
This is considered to be an efficient use of resources 
with nitrogen being located only where it is required 
for high photosynthetic activity. Additionally, there are 
a number of acclimation mechanisms to improve light 
harvesting in shade and diffuse light such as the syn¬ 
thesis of light harvesting complexes enriched in chlo¬ 
rophyll b [20, 26]. The efficiency of use of shade and 
diffuse light within the canopy is a relatively 
unexplored area in crops. This raises the question of 
whether leaf ageing or leaf acclimation gives rise to the 
nitrogen content of the leaf. There is evidence that both 
processes are at play: in fact the distribution of nitrogen 
in plant canopies should be closely related to photo¬ 
synthesis and irradiance level [27, 28]. There is some 
indication that this has been achieved although the role 
of “storage” of assimilated nitrogen in leaves needs 
further analysis [20, 21]. 

Canopy development occurs by the successive 
emergence of leaves from primordia which are localized 
points of tissue formed around the apical dome of the 
vegetative shoot. The arrangements of these primordia 
will largely determine the positions of the leaves in the 
final canopy. Therefore there is great interest in 
the genetic manipulation of developmental processes 
in crops [29, 30]. The thermal time interval between 
the initiation of successive primordia is critical and 


determines the time taken for production of successive 
leaves. The genetic and molecular processes determin¬ 
ing leaf development and expansion is an exciting area 
of research [31]. 

For annual crops with a limited growing season, it is 
critical that canopy development is synchronized with 
periods of high radiation. In temperate regions, low 
temperatures in Spring can increase the time taken for 
canopy closure. Since maximum rates of production 
are not attained until full canopy cover is achieved, this 
can be deleterious and is referred to as “lost time” [32]. 
Faster rates of canopy closure can be achieved by dif¬ 
ferent strategies such as the application of nitrogen 
increasing planting density, irrigation, and the planting 
of crops in autumn so that the time taken for establish¬ 
ment is reduced. For example, in the UK, maximum 
radiation receipts occur in June, and for Spring-sown 
crops such as potato and sugar beet, productivity can 
depend on the establishment of a high L before this 
period. In rice, the remarkable erectness of leaves in 
some modern cultivars has allowed for extremely dense 
planting and a shift toward a reduction in tillering. This 
has the advantage that canopy closure can be achieved 
quickly and the so-called lost time is reduced. 

There is no doubt that canopy structure is a central 
feature of crops with high RUE although the precise 
three-dimensional characteristics are difficult to quan¬ 
tify by direct measurement and this has meant that the 
techniques to describe crop canopies mathematically 
have remained relatively simple and continued to 
assume that they exist as randomly distributed photo- 
synthetic elements in space. As shall be described in the 
next section, it will be essential to have a more sophis¬ 
ticated approach which links leaf arrangement and light 
distribution at higher resolutions and accounts for the 
dynamics of both light distribution and localized pho¬ 
tosynthetic responses. For this various photographic 
and laser-based methods are available for “digitization” 
of plant and crop canopies although the high density of 
many crop canopies makes such measurements difficult 
in a practical sense. Mathematical modeling of plant 
canopy structure is progressing rapidly along with 
increased computing power and more sophisticated 
programming [33]. It would seem likely that methods 
that can use empirical measurements to model canopy 
architecture at high resolution and accurately predict 
photosynthetic light responses are not far away. 
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"Conversion" of Captured Radiation: 
Photosynthetic Mechanisms 

The efficiency of radiation use by the whole canopy is the 
sum of photosynthesis occurring by each leaf and each 
leaf portion within the canopy. In turn, the photosyn¬ 
thetic rate of each leaf is determined by the sum of 
photosynthesis in the chlorophyll-containing organelles 
(chloroplasts). Each leaf and organelle will exist in 
a dynamic microenvironment and the photosynthetic 
productivity will depend on the resources available to it 
at a given moment in time (light, water, nutrients). Due 
to limitations placed on the photosynthetic process such 
as light saturation and C0 2 diffusion, the rate of photo¬ 
synthesis at the leaf level is a strong determinant of 
canopy carbon gain. This section will describe the pho¬ 
tosynthetic process, the different types of photosynthe¬ 
sis, and limitations to leaf level photosynthesis. 

The vast majority of chloroplasts are located in 
leaves and they are numerous, with each mesophyll 
cell containing between 50 and 200 chloroplasts [34]. 
Chloroplasts are responsible for absorption of the 
400-700 waveband in crop canopies and are the reason 
that plants appear green. They are sac-like subcellular 
organelles that contain membranous structures that 
possess the chlorophyll used for light harvesting. Most 
of the chlorophyll is contained within pigment protein 
complexes called “light harvesting complexes” and 
these are extremely efficient at absorbing visible light, 
and through a series of resonance transfer mechanisms, 
they pass the excitation energy to a reaction center 
where a special pair of chlorophyll molecules use this 
energy to generate a redox potential capable of oxidiz¬ 
ing water via the oxygen evolving complex. The 
resulting electrons are passed through an electron 
transport chain and used to generate a proton gradient 
that synthesizes ATP. Ultimately, the electrons produce 
a reductant, NADPH. The NAPDPH and ATP are uti¬ 
lized within the Calvin-Benson or photo synthetic car¬ 
bon reduction cycle to reduce C0 2 to triose phosphates 
that are used in hexose and starch synthesis or exported 
from the chloroplast for sucrose synthesis. 

How Efficient Is Crop Photosynthesis? 

It is useful to consider the amount of light energy 
arriving at a canopy top or leaf surface and calculate 
energy losses at each stage based on current knowledge 


of the photosynthetic process in order to estimate the 
potential system productivity [35, 36]. As mentioned, 
only 49% of solar energy is available for photosynthe¬ 
sis. Within the PAR range chlorophyll does not absorb 
strongly in the green band and the reasons for this 
adaptation have been well explored [37]. Green pho¬ 
tons make up around 10% in the PAR range. Red and 
blue photons drive photosynthesis with equal effi¬ 
ciency despite the fact that blue light contains more 
energy per photon than red light. Chlorophyll is excited 
to higher energy states by blue photons but the extra 
absorbed energy is not used to drive photochemistry. 
This extra energy which is calculated to make up 6.6% 
of incident solar energy is effectively lost [36]. 

C3 photosynthesis is the dominant form among 
plant life which uses the enzyme ribulose bis phosphate 
carboxylase (Rubisco) to fix C0 2 where the initial 
product of photosynthesis contains three carbon 
atoms. Rubisco is part of the Calvin-Benson cycle. 
Examples of C3 crop species are rice, wheat, potato, 
soybean, cotton, and chickpea. In C3 photosynthesis, 
three ATP molecules and two NADPH molecules are 
used to assimilate one molecule of C0 2 and regenerate 
the acceptor, RuBP. Zhu et al. [36] provide a calculation 
of the energy required to fix one carbon atom and 
compare this to the energy contained within that one- 
sixth of a mole of glucose. A calculation of the quantum 
requirement of each ATP and NADPH molecule reveals 
that 8 mol of photons are needed for the fixation of 
C0 2 molecule and this represents 1,388 kj of energy 
[36] whilst the energy content of this carbon atom 
within the glucose molecule is 477 kj. 

C3 plants operate the process of photorespiration. 
This is a consequence of the dual reaction of Rubisco: 
in addition to the fixation of C0 2 into carbohydrates, 
Rubisco also reacts with 0 2 to produce glycolate. This 
molecule must be metabolized through the 
photorespiratory pathway, a process that consumes 
ATP, reducing power and releases C0 2 . The product 
of this is phosphoglycerate which reenters the Calvin- 
Benson cycle. The losses caused by photorespiration are 
significant and are greatly dependent on temperature: 
the specificity of Rubisco for C0 2 and the solubility of 
C0 2 relative to 0 2 in water declines with increasing 
temperature. In tropical C3 plants the rate of 
photorespiratory flux can be significant especially 
when stomata limit the internal leaf C0 2 
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concentration. In tropical rice in field temperatures of 
35° C under otherwise optimal conditions, saturating 
leaves with C0 2 resulted in an increase in leaf photo¬ 
synthesis of over 40% [38]. 

A subset of plants called C4 plants have evolved 
a fascinating mechanism which minimizes the 
oxygenase reaction of Rubisco to often insignificant 
levels. It does this by restricting Rubisco to a bundle 
sheath cell, which is oxygen free and non-leaky. Initial 
fixation of C0 2 takes place in mesophyll cells using an 
alternative enzyme, phosphoenol pyruvate carboxylase 
which generates C4 acids. These acids are pumped into 
the bundle sheath cell where they are de-carboxylated, 
and the C0 2 released is concentrated around Rubisco 
in the bundle sheath chloroplasts. This effectively 
reduces the extent of the oxygenase reaction and the 
photorespiratory pathway to insignificant levels. It is 
thought to have evolved in a climate where high tem¬ 
peratures, low C0 2 levels, and low humidity could all 
have been strong selective pressures. The C4 mecha¬ 
nism is largely found in warm regions. Measurements 
show that in many plants the advantage of the C4 
mechanism over C3 starts to become particularly sig¬ 
nificant at growth temperatures above 25°C. C4 
requires additional ATP synthesis which increases the 
quantum requirement of the photosynthetic process to 
12 photons per C0 2 molecule, in comparison with 8 for 
C3 photosynthesis. However, this is more than over¬ 
come by the elimination of photorespiration and low 
activity of Rubisco resulting in a greatly enhanced 
capacity for carbon assimilation at most light intensi¬ 
ties in C4 plants. Other advantages become apparent, 
for example, the higher efficiency of Rubisco means 
that lower amounts of this protein are needed. Since 
Rubisco can make up to 35% of total leaf protein in 
some C3 plants [20], this increases the photosynthetic 
nitrogen use efficiency of C4 plants. The C0 2 concen¬ 
trating mechanism of C4 plants means that stomatal 
conductance does not need to be high and water loss 
can be reduced. 

A substantial “loss” of biomass is mitochondrial 
respiration which has often been divided into growth 
respiration and maintenance respiration in accordance 
with the associated physiological process. Respiration is 
unsurprisingly difficult to measure and to define in 
practice [39]. The relative rates can vary substantially 
according to growth stage. For example, maintenance 


respiration increases substantially as a proportion of 
total carbon flux following ear emergence in barley 
[35]. During vegetative growth, growth respiration is 
substantially higher than maintenance respiration. Res¬ 
piration as a whole can vary from 30% to 60% of total 
carbon exchange and therefore it has a significant 
impact on RUE. Studies have shown that it undergoes 
optimization according to the environment and devel¬ 
opmental state: for example, it is well known that the 
relationship between L and respiration is not linear 
because this would result in a negative net carbon 
balance in the lower, shaded regions of the canopy. 
Attempts have been made to optimize respiratory 
rates to improve crop productivity but this is 
a difficult task because of the problems with measure¬ 
ment (especially with roots) on different scales from 
tissue to whole canopy, and of the separation between 
maintenance and growth respiration. It is often con¬ 
sidered that there is greater scope for the improvement 
of maintenance respiration because this is more sensi¬ 
tive to prevailing conditions. Recent work suggests that 
the possibilities should be revisited [39, 40]. 

At any given moment in time photosynthesis can be 
limited by light level (e.g., under cloud cover or self¬ 
shading), stomatal conductance (this becomes severely 
limiting under conditions of low soil water), and 
Rubisco (under saturating irradiance and high stoma¬ 
tal conductance Rubisco limits photosynthetic rate in 
C3 plants). Interactions between growth condition and 
genotype will determine the final rate of photosynthe¬ 
sis. Growth under optimal conditions is considered: the 
imposition of resource limitation on RUE will be 
discussed in the final section. 

Taking into account the minimum essential losses 
that occur, the percentage of “biomass energy” pro¬ 
duced per unit solar radiation (total solar spectrum) 
for C3 and C4 plants is generally agreed to be 4.6% and 
6%, respectively (calculated on a kj/kj basis) [36]. The 
highest recorded values are 2.4% and 3.7% [36] with 
common measured values being much lower than this, 
for example, [41]. The large difference between mea¬ 
sured and attained photo synthetic efficiency of crop 
canopies is the cause of some debate and some mech¬ 
anisms focusing on metabolic constraints are discussed 
here. It is possible that canopy architecture remains 
a limitation (discussed above). It is however noted 
that much of the observed reduction in RUE and 
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photosynthetic efficiency in agricultural systems is 
caused by growth under suboptimal conditions [36]. 

The above has considered photosynthetic efficiency 
in the context of a leaf canopy. However there is current 
interest in exploiting photosynthesis in non-leaf organs 
such as the green ear in wheat [42, 43] and the spikelets 
in rice. Despite the relatively small surface area (com¬ 
pared to LAI) it seems clear that the high exposure to 
radiation and close proximity to the grain sink may 
mean that their photosynthetic contribution has been 
underestimated [42]. In the case of oil seed rape, the 
photosynthesis in seed pods is the dominant supply 
due to a diminished leaf area [44]. This is a source of 
photosynthate that deserves closer investigation. 

Metabolic and Regulatory Constraints to RUE 

Although photosynthesis underpins growth and yield it 
has often been considered that it is a feature of primary 
metabolism which has undergone optimization 
through natural selection and empirical breeding. 
This view has partly arisen from the conservative 
nature of the basic mechanism and composition of 
the photosynthetic apparatus. However, firstly, it is 
clear that canopy photosynthesis has undergone 
improvement via total leaf area and nitrogen per unit 
leaf area (fertilizer) and, secondly, there is considerable 
genetic variation in leaf photosynthetic capacity and in 
the response of leaf photosynthesis to environmental 
factors and abiotic stress. Nevertheless, the role of any 
improvement in leaf photosynthesis in crop yield pro¬ 
gress has been hard to quantify due to the difficulty of 
routinely measuring net leaf and canopy carbon gain 
and of eliminating compensating processes, although 
some success has been achieved [24, 45, 46]. and pho¬ 
tosynthesis has been successfully linked to yield pro¬ 
gress in some cases, for example, [24]. There is some 
recent evidence that leaf photosynthesis can exert an 
effect when biomass production is the dominant lim¬ 
iting factor [47-49]. Nevertheless leaf photosynthesis is 
considered to be the dominant factor determining RUE 
[5] and its improvement is increasingly viewed as an 
important target [49, 50]. 

Many of the suggested routes to the leaf C0 2 assim¬ 
ilation rate of crops have focused on Rubisco. Increasing 
the amount of Rubisco in the leaf is problematic: it is 
already at extremely high levels and to accumulate more 


would require an increase in nitrogen fertilizer applica¬ 
tion. There are indications that Rubisco may be accumu¬ 
lated to excess capacity in some leaves [51]. There may 
be opportunities to improve the properties of the 
Rubisco enzyme [52] and there is some natural varia¬ 
tion among plants and algae in the properties of 
Rubisco. For example, forms of Rubisco present in 
the genus Limonium have a higher specificity factor 
than that in all crop species [53]. However, there is 
a well-cited inverse relationship between specificity 
for C0 2 and maximum catalytic activity [54]. It has 
been suggested that different forms of Rubisco could 
be assigned different roles within the plant according 
to environmental condition, tailored, for example, to 
high light or low light conditions. Other enzymes in 
the Calvin-Benson cycle have been shown to have 
promise for improvement such as sedoheptulose-1,7- 
bisphosphatase [55]. 

Elimination or reduction in photorespiration has 
been of interest for a long time even though the seem¬ 
ingly wasteful process has been assigned metabolic and 
photoprotective roles. The increased growth rates, bio¬ 
mass, and yield of plants grown under high C0 2 where 
photorespiratory flux is reduced indicates that the pro¬ 
cess is largely wasteful. Methods for blocking the path¬ 
way have proved ineffective [56] probably due to the 
accumulation of intermediates. A recent and novel 
approach has avoided this problem by using bacterial 
enzymes to “shortcut” the pathway [57]. In Arabidopsis 
thaliana plants, this has had the effect of improving 
biomass production and shows great promise although 
the precise mechanism of improved growth has not 
been described yet. 

The greatest improvements in yield of crops native to 
warmer climates would come from the introduction of 
the C4 mechanism into C3 crop species. In the case of 
rice, for example, it has been calculated that this is the 
only way to bring about an increase in biomass produc¬ 
tion sufficient to meet a 50% improvement in yield by 
2050 [1, 58]. Early attempts to introduce elements of the 
C4 pathway into rice by transformation with C4 genes 
from maize or other C4 species [59] are considered 
ambiguous or partially successful at best and introduction 
of the full “Kranz” anatomy seems to be the most likely 
way to achieve the required goal. Some natural C4 mech¬ 
anisms exist in a single cell [60] but it is unclear whether 
this would provide sufficient rates of assimilation. 
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The evolution of the C4 syndrome independently 
on more than 60 occasions in angiosperms would sug¬ 
gest there is no intrinsic reason why the C4 pathway 
could not be introduced into a major crop such as rice 
or wheat. A combination of advanced molecular tech¬ 
niques, transformation of key genes and smart screen¬ 
ing of germplasm may achieve this [61] . Indeed there is 
now a funded international consortium of scientists 
formed to address this task in rice (www.irri.org). 

Recent modeling work suggests that photosynthesis 
may not be optimized in many plant species. For exam¬ 
ple, Zhu et al. [62] used an evolutionary algorithm to 
partition nitrogen between enzymes associated with 
different processes within a plant cell. The combination 
of enzyme amounts and activities that produced 
a higher photosynthetic rate was allowed to proceed 
to the next generation, and after 1,500 generations it 
was found that photosynthesis was substantially 
increased. It seems that an over investment in 
photorespiratory metabolism and an underinvestment 
in enzymes of the Calvin-Benson cycle may be critical. 
It is possible that this represents a lack of adaptation to 
contemporary higher C0 2 levels. 

Photosynthesis is a dynamic process and constantly 
responds to changing environmental conditions. It is still 
debated as to whether the responses observed in situ are 
optimized. For example, when light is absorbed in excess 
of that required for photosynthesis, a series of regulatory 
mechanisms are activated which dissipate the excess exci¬ 
tation energy within the thylakoid membrane [63, 64]. 
This is considered a photoprotective process which 
reduces the likelihood of photooxidative stress. This pro¬ 
cess often has no impact on the light-saturated rate of 
C0 2 assimilation but it does reduce the quantum yield at 
low irradiance levels. Therefore, following a transfer to 
low light (caused, e.g., by cloud cover or leaf and solar 
movement), the slow relaxation of photoprotection 
causes a potential reduction in the rate of C0 2 
assimilation. Given the large variation in irradiance in 
canopies in space and time, this has long been consid¬ 
ered a factor in canopy photosynthesis. Indeed the 
manipulation of photoprotection has been shown to 
influence fitness in A. thaliana [65]. Recently, 
photoprotection was modeled in a tall three- 
dimensional canopy using ray tracing algorithms 
[66]. The reduction in canopy carbon gain was 
predicted to be as large as 30% under low temperature 


conditions. This would seem to indicate that there is 
room for improvement in terms of optimization of 
photoprotection. 

Acclimation of photosynthetic capacity to environ¬ 
mental conditions such as irradiance can occur over 
longer timescales such as days and weeks. It has been 
hypothesized that acclimation conferred an advantage 
in terms of carbon gain [49] . Recent experiments using 
A. thaliana indicate that over long growth periods under 
naturally variable light levels this is indeed the case [67]. 

The question of whether plant and crop responses 
are appropriate for any switch in environmental con¬ 
ditions has been expanded to include those of growth 
and storage of carbon. For example, there is evidence 
that transfer of carbon to storage organs can occur at 
the expense of allocation to new organs [68]. Cross 
et al. [69] observed that accessions that allocated less 
carbon to storage at night had a higher growth rate. 
This concept has yet to be tested in crop species. Genes 
that are involved in the regulation are being identified, 
for example, DELLA proteins are negative growth reg¬ 
ulators of central importance which are thought to 
integrate the effects of various growth-promoting hor¬ 
mones such as gibberellins and have been shown to 
have an effect on rates of tissue growth [70]. 

Source-Sink Processes and Partitioning of 
Assimilates 

This section discusses the different assimilate sinks that 
have either direct or indirect effects on light intercep¬ 
tion (LI) and RUE in crops. As discussed, radiation 
capture is a highly dynamic process affected not only 
by sun angle and fluxes in radiation intensity, but also 
by gross morphological above ground structures that 
evolve over a plant’s life cycle and include leaf area 
development, stem dynamics, and the emergence 
of floral structures. Within crop species, there is con¬ 
siderable interest in genetic effects on morpho- 
physiological traits that affect light interception and 
distribution. Genetic effects include early vigor; stem 
density (m -2 ) and dynamics; leaf anatomy and geom¬ 
etry; the composition, distribution, and duration of 
light harvesting and photosynthetic proteins in the 
canopy; the architecture of floral structures; and the 
continual interaction of these with crop development. 
Crop management is an additional factor which will 
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affect radiation interception principally through row 
spacing, N fertilization, and irrigation, and by control¬ 
ling biotic stresses that may reduce LI. 

Direct Effects of Sinks on RI and RUE 

It is axiomatic that all photosynthetic structures begin 
as carbon sinks while at some point becoming net 
exporters (i.e., assimilate sources). The exact invest¬ 
ment strategy in these photosynthetic structures will 
determine the RUE of the “photosynthetic canopy” as 
a whole. In this context, leaf area index (L) is a useful 
parameter to consider. Typically L values above 3 are 
considered optimal for maximizing RUE in annual 
crops. However, theoretical considerations suggest 
that the relative distribution of leaf area and light 
harvesting and photosynthetic proteins among differ¬ 
ent layers of the canopy can also modulate RUE and are 
currently targets for genetic improvement [25]. 

In the context of canopy photosynthesis, two other 
important sinks with direct impact on RUE are the 
stems and the reproductive structures. Stems, as well 
as being covered with a green leaf sheath, provide the 
skeleton for leaf display and, therefore, to a large extent, 
determine the height and geometry of the leaf canopy. 
Stems are highly dynamic and may appear and disap¬ 
pear within the main part of the crop cycle. In small 
grain cereals, full light interception is typically acceler¬ 
ated through tiller development, a strategy which 
results in the subsequent shedding of nonproductive 
tillers after optimum L is achieved; the investment of 
assimilates in tillers apparently being offset by the 
increased L [71]. The strategy also has benefit under 
favorable years when a larger proportion of tillers 
achieve reproductive success. Reproduction requires 
a further investment of assimilates in floral structures, 
which, though often photosynthetic themselves, may 
also intercept incident light and, therefore, shade other 
photosynthetic tissue. Reducing panicle height for this 
reason has proved beneficial in rice [72]. Genetic mod¬ 
ification of tiller dynamics and spike photosynthesis 
are both promising areas in crop breeding; neither has 
been systematically addressed yet as outlined above, 
and both traits interact with LI and potentially with 
RUE. There is considerable genetic diversity for tiller¬ 
ing capacity in small grain cereals and a tiller inhibitor 
gene (Tin) has also been identified in wheat [73]. 


Considerable morphological diversity is also apparent 
in the reproductive structures of many crop species. 
However, those which are photosynthetic show 
a complex physiology, the measurement of LI and 
RUE is extremely challenging, and studies to establish 
genetic diversity are scant [43]. Nonetheless, shading 
studies in wheat have suggested genetic diversity for the 
contribution of spike photosynthesis to grain filling 
under drought, and given that reproductive structures 
intercept a significant proportion of light in many 
crops, it may be expedient to incorporate them into 
models of canopy photosynthesis. 

Finally, non-photosynthetic sinks such as roots and 
structural components of the plant (stem wall, rachis, 
etc.) may impact on LI and RUE by competing with 
photo synthetic tissue for assimilates, however, these 
effects are not well documented [74]. 

Indirect Effects of Sinks on RI and RUE 

There are other non-photosynthetic sinks which may 
also compete directly with photo synthetic tissue for 
assimilates but whose indirect effects may be much 
more significant in terms of overall RUE of the crop. 
These are (1) the accumulation and remobilization of 
carbohydrates (such as water-soluble carbohydrates in 
wheat and starch in rice) to and from stems and (2) the 
partitioning of assimilates to the reproductive spike. 

While it seems clear that stem storage carbohydrates 
accumulate to provide a buffer against post-anthesis 
stress conditions when current photo-assimilates 
may be insufficient for grain filling [75-77], it is 
unclear what trade-offs may be involved in terms of 
assimilate partitioning. Given that genetic variation for 
storage carbohydrates is large and may constitute up to 
50% of the stem dry weight shortly after anthesis, it 
could affect not only competition for assimilates from 
other sinks (e.g., roots, spike) but also RUE if, for 
example, the demand for stem storage carbohydrates 
is great enough to solicit feedback responses of the 
photo synthetic apparatus. While the latter has not 
been studied, feedback effects that influence RUE 
have been shown in response to partitioning of assim¬ 
ilates to the reproductive spikes. 

A large body of evidence has shown that the num¬ 
ber of reproductive sinks that are set in a crop is the 
main factor determining yield potential [78, 79]. 
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Indeed the post anthesis sink size in wheat has been 
associated with RUE [80]. Sink strength associated 
with grain number is the most likely explanation for 
the relationships demonstrated between yield and pho¬ 
tosynthetic rate, for example, in a historic series of 
wheat cultivars [24]. Furthermore, in other studies in 
spring wheat it was shown using both genetic and phys¬ 
iological treatments that RUE responded to increased 
spike fertility, resulting in increased yield and above¬ 
ground biomass [3]. It has been suggested that one 
way to enhance the sink capacity in wheat is to lengthen 
the stem elongation phase. The stem elongation phase 
encompasses the spike growth period and this would 
therefore result in a heavier spike during this period [3] . 
Another strategy would be to alter the sinks that com¬ 
pete with spike index, such as roots, stems, leaf sheaths, 
and infertile shoots. It must be ensured that any reduc¬ 
tion in leaf lamina does not have an effect on LI and 
RUE. Reducing the allocation of biomass to roots may 
improve RUE by permitting increased partitioning to 
spikes (discussed in Reynolds et al. [3]). Some caution 
is urged since future yields may be dependent on 
increasing the ability to access soil water and nutrients. 
However, there is a possibility that the efficiency of 
uptake of water and nutrients could be improved with 
no change in root mass, for example, partitioning root 
length density at greater soil depths [81]. Structural 
stem carbohydrates (not the reserve carbohydrates 
discussed above) could be reduced by classical methods 
such as reducing plant height although wheat plant 
heights of below 70-90 cm are associated with lower 
biomass [3, 82]. Such approaches must be offset 
against increased lodging susceptibility. Other possible 
targets such as infertile tillers and awns are discussed 
in Reynolds et al. [3]. 

In particular, the maintenance of fertility under 
unpredictable environments is highlighted: in wheat, 
kernel set can be very sensitive to a number of environ¬ 
mental conditions such as moisture stress and irradi- 
ance. This is part of a set of evidence suggesting that 
plant signaling is involved in reducing grain number. 
The signaling (local or long distance within the plant) 
is a well-established phenomenon: the transport of 
molecules regulates growth, partitioning, and metabo¬ 
lism and is a fundamental feature of plant biology, 
likened in some cases to neural networks that sense, 
quantify, and memorize the environment around 


them [49, 68, 83]. The most well-known example is 
that drying soils induce the synthesis of abscisic acid in 
roots, and transport of this hormone to shoots induces 
a reduction in stomatal aperture to increase water use 
efficiency of remaining soil water [84]. It is possible 
that plant responses to environmental events are pre¬ 
emptive certainly but may also be simply too “conser¬ 
vative” predicting unfavorable conditions and setting 
seed size accordingly, while a higher yield can be 
attained by maintaining larger seed number under 
well-managed conditions. Floral abortion in maize 
in response to drought appears to be controlled by 
the up- or downregulation of a few enzymes [85]. In 
wheat, day length can alter sugar supply to fertile florets 
leading to cell death [86], and ethylene is a high- 
temperature signal leading to kernel abortion. Cytoki- 
nin appears to regulate the number of spikelets in rice 
[87]. Therefore, better targeted regulation of grain 
abortion before the onset of seed filling is a potential 
target for improvement of RUE and yield potential 
(Fig. 4). 

Theoretical Considerations Related to the 
Improvement of RUE 

The concept of RUE has received some criticism as 
being one that contains circular reasoning: one cannot 
have growth without biomass accumulation and vice 
versa. Additionally it is the product of almost every 
process in plant canopy growth and development, 
which makes a mechanistic description quite complex. 
However, it is established as a unifying concept in crop 
physiology, for example, [5, 10, 88]. Moreover the 
understanding of the component parts of the system 
and their integration is continually improving, and 
increasingly these parts are not viewed as a “black 
box,” giving more options in the future with regard to 
RUE improvement. 

For a given crop genotype, RUE is largely sensitive 
to leaf photosynthesis (and therefore nitrogen content) 
and the proportion of radiation that is diffuse or direct 
[89]. The latter is a much under-studied area in plant 
and crop physiology. The stability of RUE has been 
used to question whether it is possible to improve 
upon crops with existing C3 or C4 types. 

Theoretical figures were outlined above for the 
maximum efficiency with which photosynthesis can 
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Phenology: 

Time to maximum light interception 
and changes in canopy architecture 
Development and sink strength of tissues 
Time to full canopy closure 
Development and strength of sink tissue 


Canopy architecture: 

Full exploitation of erect habit 
Optimization of leaf posture and N 
distribution/duration 


Partitioning: 

Post anthesis sink strength, strength 
of temporary vegetative sinks 
Optimal partitioning between root, 
shoot and other sink organs according 
to environmental need 



Non-leaf photosynthesis: 

Ear, panicle, stem, culm, leaf sheath 


Metabolic efficiency: 

Optimisation of maintenance 
respiration, optimal partitioning 
between component leaf processes 
e.g. Calvin cycle, photorespiration, 
CH 2 0 synthesis, partitioning in time: 
growth vs storage 


Leaf photosynthesis: 

Photorespiratory flux 
Rubisco engineering 
C4 engineering 
Dynamic responses of 
photosynthesis, photoprotection 


Efficiency of light-limited tissue: 

Efficiency of light harvesting 
process, allocation of resources. 
Efficiency of use of the ‘sunfleck’ 
resource 


Crop Radiation Capture and Use Efficiency. Figure 4 

Summary of the potential target areas which would result in an improvement of crop radiation use efficiency 


operate and it was concluded that crops operate below 
this. However, what is the realistic “spare capacity” for 
RUE improvement in a major crop plant? Reynolds 
et al. [23] estimate potential productivity for the entire 
growth cycle of irrigated wheat in a specific and well- 
characterized location, NW Mexico. Radiation fluxes 
were measured throughout the season. Due to the time 
it takes for canopy establishment, large losses can occur 
before canopy closure, and this was taken into account 
using a model that used measured intercepted radia¬ 
tion and the time before canopy closure. A value 
of 1,748 MJ m -2 was obtained for radiation absorbed 
by photosynthetically active tissue. Estimates of field 
quantum requirement can vary between 10 and 30 mol 
quanta mol -1 C0 2 , taking into account photorespira¬ 
tion and photochemical inefficiency. For a wheat 
crop, best estimates would seem to be in the range 
15-24 mol quanta mol -1 C0 2 [90] which results in 
a range of RUE between 1.5 and 2.6 g carbohydrate 
MJ -1 . The calculated value of biomass ranges from 
2,620 to 4,545 g m -2 whilst the measured value for 


wheat in this environment is up to 2,100 g m -2 . This 
suggests that improvements in field RUE are 
conceivable. 

Sinclair and Horie [88] claimed that the observed 
stability of RUE arose from consistently high light- 
saturated leaf photosynthetic capacity (Amax). They 
plotted the leaf photo synthetic capacity against RUE 
for C3, C4, and leguminous crops types. The saturation 
of this response indicated that further modest increase 
in leaf photosynthetic capacity will only have a limited 
impact on the RUE, hence the stability of the RUE 
response [5]. However under conditions of restricted 
nitrogen, water, or under stress much lower values of 
photo synthetic capacity caused a larger reduction in 
RUE. This has been used as an argument that seeking 
modest improvements in leaf photosynthetic capacity 
may not be worthwhile. As pointed out here it is pos¬ 
sible to trace improvements in biomass production that 
are linked to Amax. Biomass, yield, and Amax were 
associated in irrigated wheat cultivars in warm condi¬ 
tions [23] and in temperate conditions [24]. In the 
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latter case, higher Amax may reflect a feedback 
response caused by greater partitioning to reproductive 
structures - the differences in Amax were greatest dur¬ 
ing grain filling and not well associated with above¬ 
ground biomass. 

There has been a lot of attention paid to the use of 
Amax in the selection of varieties with a higher photo- 
synthetic potential, largely because it is convenient to 
measure and shows genetic variability. However, it 
should be clear from this entry that Amax will mostly 
be expressed at the top of the canopy (most leaves will 
not be at light saturation) and may not be a good proxy 
for canopy photosynthetic rate especially when inte¬ 
grated over long time periods. Care should be taken 
when extrapolating from spot measurements to whole 
canopy photosynthesis. It is not surprising then that 
attempts to improve yield by simply selecting for Amax 
have had largely unsuccessful results. 

Perhaps more importantly, many of the improve¬ 
ments that show great promise are not solely associated 
with capacity at light saturation but rather to the dynamic 
responses of photosynthesis over time and therefore to 
total canopy carbon gain. These processes are much 
harder to measure experimentally although advances in 
techniques such as continual chlorophyll fluorescence 
monitoring may provide breakthroughs. As the knowl¬ 
edge of photosynthetic regulation improves, these should 
be installed into agricultural photosynthesis. It would 
seem necessary to tailor photosynthetic responses to 
improvements in canopy architecture. 

Sources of Variation in Agricultural RUE 

In any comparative analysis of RUE, it is necessary to 
ensure that the methods of analysis are directly com¬ 
parable and provide an accurate estimate that do not 
contain any of the potential errors in measurement 
outlined above. Additionally, consideration must be 
taken when comparing different environments because 
varieties may not be well adapted to their locality and 
therefore able to “express” their maximum RUE. Other 
reviews, for example, Sinclair and Muchow [5] provide 
a comprehensive survey of the literature for many key 
crop species. A summary is provided here of the species 
and of the environmental factors affecting RUE in 
agriculture, and available information for a few key 
species has been provided. 


It is in fact not common to find studies that specif¬ 
ically link RUE with yield progress; however, it has been 
shown, for example, that yield progress in wheat asso¬ 
ciates with both harvest index and total biomass pro¬ 
duction [74]. Some evidence also suggests that older 
wheat cultivars had a lower RUE and this was associ¬ 
ated with a lower post-anthesis sink capacity [4, 80]. 

Species Variation in RUE 

As pointed out by many workers, there is no “constant” 
for RUE and there is considerable variation between 
species. It is important to pay attention to the develop¬ 
mental stage of each crop in question and maximum 
attained RUE is referred to here. The highest recorded 
values for season-long RUE across C4 species are 
3.4 g MJ -1 and those of C3 species 2.8 g MJ -1 [5, 35]. 
Mitchell et al. [13] reported that average values for RUE 
in rice, wheat, maize, and soybean in the vegetative 
stage were 2.2, 2.7, 3.3, and 1.9 g MJ -1 , respectively. 

Therefore the tendency for a distinction between C3 
and C4 crops seems to be consistent. However, care 
must be taken. For example, sorghum, a C4 species, 
possesses a relatively low RUE and this has been attrib¬ 
uted to low leaf N content. Conversely, sugarcane, also 
a C4 species, seems to have an exceptionally high RUE. 
Potato, a C3 species, has shown values for RUE higher 
than all other C3 crops and even some C4 species 
(sorghum). It must once again be emphasized that 
the energy content of dry matter should be taken into 
account. It has been suggested that the high values for 
sugarcane and potato reported are due to the excep¬ 
tionally low energy content of the products (sucrose 
and starch). It is often claimed that sugarcane has the 
highest radiation use efficiency among the plant king¬ 
dom, although there are relatively few studies. 

A few studies have indicated the relatively low RUE 
of rice among C3 crops [13, 91], although there are 
surprisingly few studies available for this species and 
recent suggestions that RUE may not be currently 
closely linked to high yield potential [92]. 

Leguminous species such as soya bean have lower 
RUEs and a high percentage of PAR utilization. This is 
due to the higher lipid and protein content (see earlier 
section). However, even when the vegetative stage alone 
is considered, soybean has shown lower RUE values 
than wheat, rice, and maize [13, 93]. This has also 
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been observed in other grain legumes [5]. RUE of 
legumes may be an important future target. For 
detailed information on RUE values for each species, 
the reader is referred to detailed reviews such as [5]. 

In recent years, there has been much interest in 
“energy crops ” These are crops that are grown for the 
sole purpose of fuel production or combustion for 
energy generation. This has been stimulated by concerns 
over emissions of greenhouse gasses and the growth of 
crops represents a C0 2 -neutral strategy. Drawbacks 
include the clear competition with food supply and 
the possible threat to natural vegetation. Nevertheless 
it has been shown that certain species have the potential 
to be extremely productive in this regard, especially C4 
grasses such as Miscanthus [94]. Miscanthus has the 
highest annual primary production of any crop species 
producing 50% more biomass than corn due to a high 
leaf area and longer duration, although the energy 
conversion rate is about the same [95]. Other examples 
of energy crops under study are willow, poplar, and oil 
crops such as Jatropha , although radiation use effi¬ 
ciency of this category of crop species is not well stud¬ 
ied. These crops may find an application on land 
unsuitable for food crops and low input environments. 

Developmental Stage 

The stage of crop development has a clear effect on the 
RUE attained. Firstly the photo synthetic potential may 
be dependent on growth stage. Secondly the appear¬ 
ance and disappearance of vegetative and reproductive 
sinks can influence photosynthesis via the presence of 
feedback signaling mechanisms and carbohydrate 
accumulation; this has been discussed in detail above. 
In many species, a lower RUE was observed during 
earlier crop establishment stages in comparison with 
later stages and this was attributed to a lowered photo- 
synthetic capacity [5]. The effect of senescence during 
the post flowering has the effect of reducing photosyn¬ 
thesis, and therefore RUE is usually observed to be 
higher during the vegetative stages than post flowering. 
Recent studies have indicated that post-an thesis RUE 
in wheat is strongly linked to sink size: Older cultivars 
with smaller sinks had significantly reduced RUE dur¬ 
ing this phase [4]. 

There is an indication in some studies that in rice 
this post-anthesis effect is not as pronounced [13]. In 


a recent study, Takai et al. [96] suggested that 
a maintenance of high growth rate and RUE in the 
late reproductive phase of rice is a key to higher grain 
yield potential in this species and may be linked to the 
short tropical life cycle and requirement for rapid pho¬ 
tosynthesis during grain filling. 

Environmental Factors 

There are many practical difficulties involved with 
linking leaf-level photosynthesis with canopy-level 
RUE, even though it is technically straightforward to 
measure leaf photosynthesis. However, increasing the 
photo synthetic rate per unit leaf area of leaves has 
direct relationship with increasing RUE. It is frequently 
observed that the response between light-saturated 
photosynthetic capacity and RUE is curvilinear rather 
than linear. However, as discussed above it is critical to 
consider photosynthetic efficiency at all light levels. As 
a generalization it is fair to say that any factor, biotic or 
abiotic, that reduces the photosynthetic potential of 
a canopy is also likely to reduce the RUE. This applies 
to many of the common factors that reduce growing 
conditions below optimum such as water availability, 
nutrients, and extreme temperature. 

It is not surprising that nitrogen has a significant 
impact upon RUE due to the close and well-established 
relationship with leaf photosynthesis. There is a strong 
nitrogen-dependent effect on leaf area and consequently 
light capture in plants, therefore, nitrogen content per 
unit leaf mass or per unit leaf area is considered. This has 
been the subject of a number of studies in several spe¬ 
cies: Usually a curvilinear response with leaf nitrogen is 
obtained, with RUE increasing up to a saturating value 
beyond which response is limited such as sunflower 
(average canopy leaf nitrogen [97]), soybean (specific 
leaf nitrogen [98] ), and maize and sorghum (leaf nitro¬ 
gen per unit leaf area [99]). However, this relationship 
is not always observed (see [5]). In examining species 
differences, the plant matter energy content and the 
nitrogen-photosynthesis relationship is important. 
For example, C4 plants have a higher potential photo- 
synthetic nitrogen use efficiency [88]. Naturally site- 
variation in soil nitrogen is common and potentially 
a large source of variation in RUE. 

Soil water and atmospheric humidity have the 
potential to reduce photosynthetic rate and therefore 
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RUE. In experiments that imposed soil water deficit, 
reductions in RUE have been observed in some cases 
[100] but not others [100]. It has been suggested 
that the variability in response is a result of variation 
in the extent of the soil water deficit: When this was 
measured in a quantitative manner, it could be 
established that RUE would decline when the level 
of extractable soil water declined below a certain limit 
(in this case 30%) [101]. 

An increase in vapor pressure deficit has been asso¬ 
ciated with lower RUE in some studies. It is noted that 
the impact in RUE was greater than that would have 
been predicted from leaf photosynthesis alone and it 
was suggested that environmental factors associated 
with vapor pressure deficit are involved. Other studies 
indicated only a small impact of vapor pressure deficit 
on RUE [5, 102]. 

Temperature has been associated with RUE when it 
has an influence on leaf C0 2 assimilation rate and this 
has also been shown to be related to the physiological 
effects of nighttime temperature [5]. 

Future Directions 

Due to the effects of climate change caused by anthro¬ 
pogenic emissions, an altered environment for crop 
growth is likely to be required [103]. Some of these 
changes can be predicted with a high probability, for 
example, C0 2 is rising and is likely to continue at 
a similar rate to the current one with a slowing of rate 
according to internationally agreed emission cuts. 
Since 1750, atmospheric C0 2 has risen by around 
100 ppm and continues to rise at the rate of around 
2 ppm per year. By the end of the twenty-first century, 
temperature is predicted to rise by between 0.5°C and 
4°C in response to levels of the greenhouse gasses C0 2 , 
methane (CH 4 ), and nitrous oxide (N 2 0). Agriculture 
itself is a major contributor to greenhouses gasses. 
Studies have already indicated that climate change 
is having an effect on productivity of crop systems 
[103-105]. 

These changes must be considered according to 
geographical region and in the context of deleterious 
effects such as a likely reduction in water availability, 
a rise in pollutants such as ozone, and the increasing 
frequency of extreme damaging events such as 
flooding, drought, and storms. They must also be 


weighed against possible beneficial effects such as 
higher photo synthetic rates induced by elevated C0 2 
and a longer growing season at higher latitudes leading 
to higher biomass and higher yield. Accordingly, 
a reduction in yield in the tropics and a rise in temper¬ 
ate regions may be experienced. The effect of most of 
these factors on RUE has been discussed above and 
a detailed discussion of the wider impact of climate 
change on agriculture is beyond the scope of this arti¬ 
cle. Clearly suboptimal growing conditions will poten¬ 
tially reduce RUE and yield. However, it is important to 
ensure that the crops are adapted appropriately to 
future climates. 

Experiments using completely enclosed or open 
topped chambers in the field in which C0 2 levels are 
elevated above ambient have shown significant 
increases in growth and yield. The reasons are clear 
and related to the suppression of photorespiration 
and consequently an increase in RUE. Additional ben¬ 
eficial effects related to a lowered stomatal conductance 
[25, 106]. These data have been used in models that 
predict an increase in future crop production in many 
northern regions. Experiments with more realistic field 
conditions (Free Air C0 2 Enrichment, FACE) have also 
demonstrated an increase crop yields but these were 
lower than those predicted by chamber experiments. 
This is lowered further when additional effects such as 
ozone and disease are accounted for and the net result 
may be negative [107, 108]. This suggests that the 
predicted beneficial effects of an increased atmospheric 
C0 2 on plant growth have been overestimated. How¬ 
ever, as pointed out by Long et al. [107] there is still 
a paucity of data for C0 2 enrichment effects in many 
agroecosystems so a level of uncertainty remains. 

There is also a need for further work into the adap¬ 
tation of crops to high C0 2 levels: Theoretical work by 
Zhu et al. [62] using evolutionary algorithms to predict 
optimal levels of C0 2 for photosynthesis have shown 
that the relative levels of enzymes involved in primary 
metabolism are not optimized for carbon assimilation. 
It is possible that there has been insufficient adaptation 
during crop improvement since the start of the indus¬ 
trial revolution, raising the intriguing possibility that 
plants should be bred or engineered for adaptation to 
higher C0 2 levels of the future in order to maximize 
yield and RUE. Presumably this also applies to other 
environmental changes. 
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Therefore it is clear that there is a demand for greater 
crop yield per unit area of land with fewer resources 
available. This step change in yield will arise from an 
integrated set of targets, most of which have been 
outlined in this article. Assuming that management of 
the crop is optimal and improvement of partitioning and 
sink capacity reach their limits, it will be increasingly 
necessary to underpin any substantial yield improve¬ 
ments with increased biomass production and this 
means that RUE will need to increase. 

RUE represents a complex system and as pointed 
out by Reynolds et al. [23] , it is a less frequent outcome 
of the genetic recombination events required for breed¬ 
ing, in comparison to manipulation of sink size. It will 
be necessary to focus on methods for improving RUE 
of crops and to introduce this into breeding techniques. 
In addition, there have been significant advances in the 
plant molecular and genomic sciences in the past 
2 decades and these should be exploited to raise RUE. 
The discovery of new genes and combinations of genes 
requires routine screening of large amounts of leaf 
material for photo synthetic potential. It is difficult to 
do this manually using traditional gas exchange tech¬ 
niques; however, there are a number of techniques 
developing rapidly that could be exploited to act as 
surrogates or proxies of C0 2 assimilation rate. For 
example, chlorophyll fluorescence is a rapid method 
that can be used to image leaf material for photosyn¬ 
thetic efficiency extremely rapidly. It is routine to image 
chlorophyll fluorescence; however, it is technically dif¬ 
ficult to do this on a three-dimensional structure such 
as a canopy. Another method is canopy temperature 
depression which measures the difference between leaf 
and air temperature caused by evaporation. It therefore 
provides an indication of stomatal conductance and 
has been correlated with plant performance [109]. 
Considerable recent interest has been placed on the 
imaging plant growth leading to the emergence of the 
field of “plant phenomics.” Advances in methods for 
rapid and high-resolution screening of plant material 
for photosynthetic performance can be anticipated. 
Additionally, the complexity of RUE means that 
a mathematical understanding of the integration and 
functioning of component processes will be critical. 
Given the difficulty of direct measurement, modeling 
of canopy photosynthesis is already used routinely and 
will be needed for the continued testing of new traits. 


The Role in Wheat and Rice Research: Current 
Research at CIMMYT and IRRI 

The challenge of improving photosynthesis and there¬ 
fore RUE in crops during both optimal and suboptimal 
conditions has been recognized by the research organi¬ 
zations that were central to the last great agricultural 
revolution (the green revolution), the International 
Rice Research Institute (IRRI) in the Philippines and 
the International Maize and Wheat Improvement Cen¬ 
tre (CIMMYT) in Mexico. IRRI has a number of 
research programs that are centered toward the sus¬ 
tainable improvement of rice yield in changing envi¬ 
ronments. Perhaps the most relevant here is the C4 rice 
consortium whose goal is a step change in rice photo¬ 
synthesis through the introduction of the C4 pathway 
of photosynthesis (http://beta.irri.org/projectsl5/en/ 
c4rice). The impact this would have on yield and 
resource use efficiency of the poorer rice growing 
regions of the world should not be underestimated. 

Given the urgency of the current food crisis coupled 
with the likely negative impact of climate change on 
productivity, especially in less developed countries 
[110], both IRRI and CIMMYT have strategic initia¬ 
tives to raise the yield potential of rice and wheat, 
respectively, by around 50%. The IRRI C4 rice initiative 
is already underway as mentioned. CIMMYT is cur¬ 
rently facilitating a Wheat Yield Consortium (WYC) 
among a group of leading scientists with the view to 
raising the genetic yield threshold of wheat. A central 
issue is that the fundamental bottleneck to raising 
productivity, namely, photosynthetic capacity, has 
hardly changed since wheat breeding started. Nonethe¬ 
less, basic research in photosynthesis suggests that 
substantial improvements in yield are theoretically pos¬ 
sible. While increasing photosynthetic potential will 
require considerable research focused at cellular and 
subcellular processes (such as the genetic modification 
of Rubisco and its regulation), intervention at this level 
must go hand in hand with modification of structural 
and reproductive aspects of growth, since these will 
determine the net agronomic benefit of increased 
RUE .For example, even at current levels of yield poten¬ 
tial, a significant portion of yield is lost to lodging 
damage each year so improved structural integrity 
will be prerequisite to realizing genetic gains in RUE 
and biomass. The other major factor determining yield 
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potential is adaptation of the reproductive processes 
which affect harvest index [3, 111] and whose physio¬ 
logical and genetic basis is relatively poorly understood 
[112]. The aim of the WYC is to develop or identify 
sources and genes for the combination of traits neces¬ 
sary to realize improved expression of wheat yield 
potential and combine them through strategic crossing 
using the most expedient combination of conventional, 
physiological, and molecular breeding approaches; the 
consortium approach is expected to realize impacts in 
farmers fields in a shorter time frame than if bottle¬ 
necks to yield potential are identified successively and 
investigated in relative isolation. 
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Glossary 

Dehydration avoidance Dehydration avoidance is the 
strategy of the plants that are able to maintain tissue 
water potential as long (and as high) as possible 
under drought conditions. 
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Dehydration tolerance Dehydration tolerance is the 
strategy of the plants that are able to cope with 
severe tissue dehydration. 

Harvest index Harvest index is the biomass of the 
harvested product expressed as a percentage of the 
total crop biomass. 

Photoassimilates Photoassimilates is the energy¬ 
storing carbohydrates produced by photosynthesis 
in the green tissues of the plants. 

Water-use efficiency (WUE) Water-use efficiency 
(WUE) is the carbon gain (or biomass formed) 
per unit of water transpired or the ratio between 
photosynthesis (A) and stomatal conductance (g s ), 
termed as intrinsic WUE. 

Definition of the Subject 

Sustainable intensification of global agriculture is 
a major purpose (and challenge) for twenty-first cen¬ 
tury scientific, social, and political communities, in 
order to guarantee food security, while preserving nat¬ 
ural resources. Fast growing population and climate 
change could lead to a global crisis if efforts from 
different disciplines and countries are not congregated. 
Among limiting factors is water scarcity, which may 
dramatically decrease crop production worldwide. 

Mitigation measures are therefore a major goal for 
sustaining crop production and they are based either on 
management practices that will enable water savings or 
on breeding efforts for more adequate crops. Improved 
physiological and molecular knowledge on plant’s 
response to water deficits is essential to get improve¬ 
ments in crop yield under adverse environments. 
Because of the complexity of these responses, it is 
imperative to integrate disciplines as functional geno¬ 
mics, transcriptomics, proteomics, and metabolomics 
with plant physiology to improve breeding strategies. 

According to the present knowledge, the key factors 
responsible for high yield under drought in annual 
plants include an appropriate phenology that will enable 
escaping drought and getting the timing of flowering 
right, as well as high water-use efficiency (WUE) and 
harvest index. On the other hand, the basic knowledge 
for fruit tree crop breeding under water scarcity is 
much more fragmented than for annuals because of 
the highest complexity of fruit trees. Therefore more 
efforts are needed in this area of research. 


Introduction 

Scarcity of water resources is an increasingly important 
issue since it will dictate global production of food and 
feed for the next generations, as dramatically described 
by The Economist (May 2010): “Water is the new oil: 
a resource long squandered; now growing expensive 
and soon to be overwhelmed by insatiable demand. 
Aquifers are falling, glaciers vanishing, reservoirs dry¬ 
ing up and rivers no longer flowing to the sea. Climate 
change threatens to make the problems worse. Every¬ 
one must use less water if famine, pestilence and mass 
migration are not to sweep the globe.” 

The International Panel for Climate Change [1] 
predicts that water scarcity, together with incidence of 
high temperature, will increase in the near future in 
many regions of the globe with dramatic effects on 
agriculture. Of the world’s water resources, c. 80% is 
currently consumed by irrigated agriculture. In this 
context, the investment in research for improving 
water use by the crops (“more crop per drop” as in 
the CGR program Challenge Program on Food and 
Water) [2], either through improved management 
technologies [3, 4] or more efficient genotypes [5, 6], 
is crucial. Increased water use by the crop can be 
obtained by agronomic practices that decrease water 
losses by soil evaporation, runoff, through-flow, deep 
drainage, and competing weeds, thereby making more 
water available for the crops, and in irrigated agricul¬ 
ture by using deficit irrigation practices. 

Crop resistance to drought has been the subject of 
extensive research in the last decades, ranging from the 
physiological traits underlying resistance to water def¬ 
icits [7-12] to breeding efforts [13, 14] including clas¬ 
sical and genetic engineering approaches [15-17]. The 
world’s most important crops for human and animal 
consumption in terms of total production - wheat, 
maize, rice, soybean, barley, and sorghum - have 
endured large yield increases during the twentieth cen¬ 
tury [18-22] and recently became the target of investi¬ 
gation aiming at improved performance under harsh 
environments, namely water scarcity [5, 6, 15, 23]. To 
articulate the knowledge obtained in different scientific 
disciplines (agronomy, breeding, physiology, and 
molecular biology) still remains a challenge, with the 
ultimate goal of providing farmers with better 
performing crops under water scarcity [7, 14]. 
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Designing robust field trials is also essential in order to 
test improved crops under the multi-stress conditions 
they generally face under natural conditions. Indeed, 
there have been several reports of apparently promising 
biotechnological manipulations that have failed to 
deliver drought-tolerant crops in yield terms, when 
the novel material is transferred from the growth 
room to the field [4], 

The key factors responsible for high yield under 
drought in annual plants are an appropriate phenology 
that will enable escaping drought and getting the 
timing of flowering right, high water-use efficiency 
(WUE) and a high harvest index [14, 24]. To get high 
WUE would involve the improvement in crop transpi¬ 
ration efficiency but the increased partitioning of bio¬ 
mass into the harvested product will also lead by itself 
to higher WUE. The relative importance of each of 
these processes will depend on how water is available 
during the crop cycle [13]. Indeed, it is noticeable that 
under most dryland situations where seasonal rainfall 
is unpredictable, maximizing soil moisture use is 
a crucial component of drought resistance (avoidance), 
which is generally expressed in lower WUE [25]. 

It is also important to recognize that many of 
the traits that explain plant adaptation to drought 
(e.g., phenology, the size and depth of the root sys¬ 
tem, xylem properties or the storage of reserves) are 
mostly constitutive [21, 26] and therefore can be 
found within the species population when it exhibits 
a large intraspecific variability. Moreover, there is 
the general recognition that the efficacy of high 
throughput screening of plant genetic material will 
speed-up crop improvement via breeding programs 
(see, e.g., [19]). 

In fruit crop production, improvement under con¬ 
ditions of water scarcity has been achieved mostly by 
optimized management technologies such as deficit 
irrigation [3, 27], with breeding efforts being much 
more modest. Indeed the basic knowledge for tree 
crop breeding under water scarcity is much more 
fragmented than for annuals because of the highest 
complexity of fruit trees. In tree crops economic return 
is highly dependent on fruit quality, generally not 
related with total plant biomass produced and there¬ 
fore water availability [3] . Moreover, yield-determining 
processes in fruit trees may not be sensitive to water 
deprivation at some developmental stages [28] and 


fruit quality may even beneficiate from a mild to mod¬ 
erate water deficits, as is well documented in grapevines 
[29]. High-density fruit orchards use composite 
plants - the scion cultivar grafted into a rootstock. 
The rootstock can alter the behavior of the scion, 
including its vegetative growth, flower numbers and 
flowering time, crop yield [30], and also drought 
tolerance [31]. The mechanisms underlying the regu¬ 
lation of scion growth and development by the root- 
stock are not fully understood [32]. 

Research in the so-called new climate proof crops 
such as Chenopodium quinoa Willd (quinoa) and 
Amaranthus spp is also developing. Quinoa is an 
Andean seed crop, very nutritious [33], showing high 
tolerance to drought, frost, salinity, and biotic factors 
[34-36]. Quinoa has been selected as one of the crops 
to secure food production in the twenty-first century 
[37]. The protein quantity and the quality of quinoa 
seed is superior to those of cereals, because of the high 
content of lysine, methionine, and threonine, in addi¬ 
tion to a range of vitamins (B2) and minerals (iron, 
calcium). Although amaranth is less tolerant to a range 
of stresses as compared to quinoa, it tolerates higher 
temperatures because it is a C4 plant [38]. The protein 
content of amaranth grain ranges from 12% to 17%, 
with a high level of lysine, similar to quinoa [39]. 

In this review the focus is on the key factors respon¬ 
sible for sustained plant growth and production 
under water scarcity, referring to both annual and 
perennial (fruit) crops. During evolution, plants devel¬ 
oped different strategies to successfully cope with water 
stress; they comprised either acclimation to a slowly 
developing water deficits or the response to a sudden 
drought. These issues will be discussed in the context 
of the agriculture needs and related to production 
(fruit) quality, a major target of modern agriculture 
nowadays. 

Revisiting Plant Strategies to Cope with Water 
Scarcity: The Crop Perspective 

Plants respond and adapt to stress at whole plant, root, 
reproductive structures, and leaves, by using mecha¬ 
nisms that are being unraveled at the cellular and 
molecular levels [7, 40]. 

Classically, plant resistance to drought has been 
divided into escape, avoidance, and tolerance strategies 
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[7, 41, 42]. However, in practice, most plants combine 
a range of these different strategies [43]. 

Plants that escape drought are able to adjust their 
phenology to the environment, being able to complete 
their life cycle before drought occurs. Escape strategies 
rely on successful flowering and therefore reproduction 
before the onset of severe stress. Plant ability to store 
reserves in stems and roots and to remobilize them for 
grain filling under water scarcity is extremely relevant 
for grain crops under water scarcity [44, 45], as it has 
been described in cereals such as wheat, maize, and 
rice [46-48], and legumes [49]. When stem reserve 
storage/utilization is insufficient to support fruit 
growth under stress, delayed-senescence genotypes 
may be desirable in crops where yield is source-limited 
[50] . High nitrogen availability can further increase the 
reuse of stored carbohydrates in cereals under moder¬ 
ate soil drying, leading to increased grain yield; under 
non-limiting water conditions, however, abundant 
nitrogen was shown to reduce grain yield in stay green 
genotypes [47]. 

Dehydration avoidance, which is common to both 
annuals and perennials, implies the maintenance of 
tissue water potential as long (and as high) as possible 
under drought conditions. This can be achieved either 
by (1) minimizing water loss or by (2) maximizing 
water uptake. Decreased water loss may derive from 
stomata closure and/or reduction of absorbed light, 
including leaf rolling [51], increased reflectance (with, 
e.g., trichomes), or steep leaf angles [52]. Shedding of 
older leaves is also relevant to reduce water loss, while 
allowing the reallocation of nutrients to younger leaves, 
when water deficit is relieved. Maximizing water uptake 
may be accomplished by increased investment in the 
roots at the expense of shoot growth (Fig. 1), which is 
generally inhibited very early on in response to decreas¬ 
ing water availability [8, 45]. Deep roots have been 
reported as important drought resistance traits for 
both annuals and perennials in semi-arid regions 
[26]. However, in arid environments, where rainfall is 
sporadic and of short duration, plants may take advan¬ 
tage of shallow roots that proliferate quickly near the 
soil surface allowing water uptake by the plant before it 
evaporates [53]. For example, Zoysia japonica was 
reported as having a relatively shallow root system, 
moderate WUE, while exhibiting a high capacity for 
osmotic adjustment [53]. High root plasticity is a key 


factor in this strategy because it will determine how fast 
roots grow in response to sporadic rain, following 
a period of drought stress. 

Under slowly developing water deficits some plants 
show osmotic adjustment (OA), a cellular stress adap¬ 
tive response that may improve tissue resistance to 
desiccation, by maintaining cell turgor, likely 
supporting crop yield under stress [25]. Because this 
mechanism allows plant water uptake at lower soil 
water potentials it is considered a dehydration avoid¬ 
ance strategy. The alterations observed under OA com¬ 
prise increases in soluble sugars (like fructans and 
sucrose) [54], amino acids (e.g., proline, aspartic acid, 
and glutamic acid), methylated quartenary ammonium 
compounds (e.g., glycine betaine and alanine betaine), 
and some proteins, such as dehydrins [55] and cyclitols 
(e.g., D-pinitol, mannitol) ([56]; see also review [7]). In 
addition to the role played via the decrease in osmotic 
potential, these solutes may protect cell membrane and 
metabolic machinery under dehydration. According to 
Bohnert et al. [57] sequestration of H 2 0 molecules, 
reducing the solvent-protein interaction, may explain 
stabilization of protein complexes and membranes. 

The positive role of osmotic adjustment on yield 
has been subject to much discussion since benefits were 
often not observed (see review by [58, 59]). A possible 
explanation is that turgor maintenance in cells is often 
associated with slow growth. It is also likely that 
osmoprotection mechanisms are not functional until 
severe dehydration occurs, with the implication that 
OA may be critical to survival rather than to promote 
plant growth and crop yield under drought [7]. 

Mechanisms of protection against oxidative stress 
are also fundamental to cope with drought and co¬ 
occurring stresses under arid semi-arid environments, 
as it will be discussed below. 

Photoassimilates, Water Use, and Crop Yield 

As proposed by Monteith [60], yield potential (Yp) of 
a crop at a given location can be defined as: Yp = HI.Pn 
with Pn=Z(PAR*ABSc*cc) and PAR being the incident 
solar radiation in the specific location, ABSc the frac¬ 
tion of the radiation intercepted by the crop, cc the 
efficiency of the conversion of intercepted radiation 
into biomass and Pn the primary productivity (total 
biomass produced over the growing season) and HI the 
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SHOOT and BUD 
-Reduced shoot growth 
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-Flower bud development reduced 
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-Decreased germination potential 
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-Early maturity + delay ripening 
-Increase fruit drop + reduced size 
-Reduced quality 
-Metabolic activity disturbed 


LEAF 

-Root signal recognition 
-Stomatal closure 
-Reduced transpiration 
-Decrease C assimilation 
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-Reduce leaf expansion 
-Metabolic activity disturbed 
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-Increase root depth 
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-early maturity 
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-Decrease invertase activity 
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ROOTS 

-ABA synthesis, ROS 
-Root conductivity 
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Crop Responses to Available Soil Water. Figure 1 

Whole plant response to drought stress in fruit trees (left) and in annuals crops (right). Green boxes and letters correspond 
to shared responses between the species 
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harvest index (see also [61]). Under water scarcity 
a decrease in Yp may occur as a result of less intercepted 
radiation (due to lower total leaf area and smaller leaf 
angles) and a lower sc, as a result of a decrease in 
photosynthesis. 

On the other hand, as judiciously proposed by 
Passioura ([62]; see also [14, 22, 63]), crop yield 
under water-limited conditions can be estimated by 
the product of transpiration efficiency (biomass/water 
transpired) x water transpired (WU) x harvest index 
(HI). Therefore, optimizing yield under such condi¬ 
tions has to be performed by increasing either water- 
use (amount and pattern), transpiration efficiency/ 
WUE or partitioning of more biomass to grain. This 
can be achieved through better water management, the 
adjustment of crop phenology to the environment, or 
genetic improvement. The latter implies getting varie¬ 
ties that can give the so-called more crop per drop, 
either by improved carbon fixation under water deficits 
or having deeper roots to capture more water or 
converting more of the biomass into grain (increasing 
HI). Increasing WUE at the expenses o higher assimi¬ 
lated carbon rather than decreased WU is a desirable 
way to improve YP under water scarcity, but will gen¬ 
erally require biotechnological interventions [17]. 

In the last decades, increases in Yp under non¬ 
limiting water conditions have been achieved with 
genotypes of cotton, wheat, and rice that exhibit high 
stomatal conductance and transpiration. This trait 
allows a decrease in leaf temperature and a greater 
C0 2 fixation per unit of leaf and land area [25, 64]. 
Under water-limited environments crop production 
has relied mostly in dehydration avoidance traits as 
described above, which maximize soil moisture use 
(when it becomes available for example, under sudden 
rainfall), but is generally associated with lower WUE. 
On the contrary, drought resistance traits that reduce 
WU, and are therefore associated with high WUE, will 
unavoidably reduce YP. 

Few successful cases of breeding for water scarcity 
environments have been reported, one being the dry¬ 
land wheat grain yield improvement, with selection for 
high WUE in Australia [65]. This success has been 
explained by the fact that wheat is grown there mainly 
on stored soil moisture. Furthermore, as suggested by 
Blum [2], a major opportunity for yield improvement 
under water scarcity is the control of WU during the 


earlier part of the growing season in order to avoid lack 
of soil moisture later on, during the reproduction 
phase. An attempt to achieve this was done by Richards 
and Passioura [66] by selecting for reduced root xylem 
diameter. 

How Metabolic Processes Respond to Drought: 
Photosynthesis, Respiration, Photorespiration 

Photosynthesis 

Under water scarcity, restrictions in leaf carbon uptake 
take place as a result of increased resistance to C0 2 
diffusion induced both by decreased stomatal aperture 
(Fig. 2) [7, 9] and decreased mesophyll conductance 
[67]. At the whole plant level, total carbon uptake is 
further reduced due to the concomitant or even earlier 
inhibition of shoot growth [8]. Metabolic inhibition of 
photosynthesis usually occurs under more severe stress 
conditions or when various stresses co-occur, such as 
high light and temperature [8, 10], although claims for 
earlier alterations in photosynthetic metabolism are 
reported by Lawlor and coworkers ([12]; see a review 
by [68]). Interestingly, alterations in gene expression 
are also observed very early on, although these alter¬ 
ations are generally not turned into differential expres¬ 
sion of proteins or enzymes in the short term (see 
revision [9]). 

Stomata are able to detect a decrease in water avail¬ 
ability either in the soil or in the atmosphere by feed¬ 
back and feed-forward mechanisms. Feedback 
mechanisms include the response to dehydration in 
the leaf itself that is transmitted to the guard cells, 
either by hydraulic or by chemical signals. Feed¬ 
forward responses are generally mediated by hormonal 
signals and may take place before any alteration in leaf 
tissue water status occurred. They comprise responses 
of guard cells to high vapor pressure deficit, whose 
mechanisms area still not fully resolved [69-71] and 
to dehydration taking place elsewhere in the plant, 
namely in the roots [72]. Hormones, with particular 
relevance to ABA, but also cytokinins and ethylene, 
have been implicated in the root-shoot signaling, 
either acting in isolation or concomitantly. This long 
distance signaling by hormones may be mediated by 
reactive oxygen species [73]. 

Primary events of photosynthesis including elec¬ 
tron transport capacity are very resilient to moderate 
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Recovery Processes (e.g. Membrane fluidity) 

Crop Responses to Available Soil Water. Figure 2 

Direct effects of drought on stomata and mesophyll (g m ) conductance as well as on gene expression, resulting in 
alterations of photosynthetic metabolism and ultimately on plant acclimation (Adapted from [9]) 


drought [74], with the decline in PSII photochemistry 
being explained by a decrease in substrate availability. 
In fact, PSII activity often declines concomitantly with 
carbon uptake under water deficits, suggesting that 
photosynthetic electron chain is finely tuned by avail¬ 
able C0 2 [75]. However, under field conditions, when 
high light co-occur with water deficits, C0 2 depriva¬ 
tion at the chloroplast (driven by stomatal closure) may 
result in the production of excess reducing power and 
therefore in the decline of the quantum yield of PSII. 
These stresses also appear to inhibit the repair of PSII 
through suppression of the synthesis of PSII proteins, 
in particular, the production of D1 protein [76]. Pro¬ 
tection mechanisms against reactive oxygen species 
(ROS) that are formed under such conditions are an 
essential component of plant survival [77]. Such pro¬ 
tection may be achieved by the regulated thermal dis¬ 
sipation occurring in the light-harvesting complexes, 
involving the xanthophyll cycle [78, 79] and the lutein 
cycle [80]. Photoprotective mechanisms compete with 
photochemistry for the absorbed energy, thus leading 
to the downregulation of photosynthesis. If the 


limitation of the rate of C0 2 assimilation is accompa¬ 
nied by an increase in the activity of another sink for 
the absorbed energy, for example, photorespiration 
[81] or Mehler-peroxidase reaction [82], the decline 
in non-cyclic electron transport will be proportionally 
lower than the decrease observed in the rate of C0 2 
assimilation. It was estimated that, in the absence of the 
repair mechanisms, photodamage would lower the 
yield of photosynthesis to less than 5% of the yield 
achieved now [83]. Although these responses have 
mainly been documented in plants native to semi-arid 
regions they may also occur in crop plants, even irri¬ 
gated, when they are subjected to intense heat and 
irradiance, during the summer period. This is likely 
to increase its frequency in the near future. 

As for Rubisco activity it was shown to be very 
resistant to water deficits [84, 85], being generally 
affected only after severe stress [86, 87]. The same 
authors also found evidence that low C0 2 concentra¬ 
tion in the chloroplast (Cc) attained under severe 
water deficits could induce deactivation of Rubisco 
sites. It is further suggested that these effects are 
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species-dependent, with species adapted to low Cc being 
able to maintain active Rubisco longer in response to 
prolonged drought [87]. These findings are compatible 
with earlier data suggesting that decreased sink capacity 
(limited capacity to use photoassimilates), as induced, 
e.g., by shoot growth inhibition under drought, 
might feedback to decrease photosynthesis, namely by 
downregulating the enzymes of the photosynthetic 
carbon cycle [88]. 

When studying the effects of drought on photosyn¬ 
thesis it is also important to recognize that crop pro¬ 
ductivity is dependent on photoassimilates produced 
at the whole plant level, which partly explains why 
often there is no correlation between crop yield and 
leaf photosynthetic rate [61]. Therefore, the impact of 
drought on shoot and canopy growth may be as impor¬ 
tant to crop yield as the effects produced at the 
single leaf level. Indeed, water scarcity by inducing 
a significant decline in total plant leaf area via the 
inhibition of new leaf growth or the earlier senescence 
of older leaves will decrease total carbon uptake by the 
plant. Moreover, lower leaf angles induced by decreased 
leaf turgor will reduce total intercepted irradiance, with 
significant impact on crop yield (Pinheiro and Chaves, 
unpublished results). However, it must be emphasized 
that, at least in cereals (wheat, maize and soybean), 
yield is generally more limited by the sink than by the 
source [61, 89]. This explains why these crops have the 
potential to cope with moderate drought and still fill 
their grains. Of course, this is also possible due to the 
capacity of remobilizing photoassimilates, previously 
stored in the shoot, as was discussed above. 

Respiration and Photorespiration 

The effects of water deficits on dark respiration are still 
unclear, with reports of either decreases, maintenance, 
or increases in the rates of this process (see the review 
[90] ). Inhibition of respiration under drought has been 
observed in mature leaves of crops and herbaceous 
species as well as in roots, and is presumably related 
with a decreased availability of substrate to mitochon¬ 
dria under conditions of low photosynthesis and growth 
(e.g., [86, 91, 92]). This maybe explained by the need of 
herbaceous species to quickly respond to water scarcity, 
thereby lowering their respiration rates in order to 
optimize their carbon gain over shorter periods of time. 


On the contrary, trees and shrubs seem to show 
slower responses to drought than short-lived species. 
As suggested by Flexas et al. [86] and Atkin and 
Macherel [93], a higher demand for respiratory ATP 
(higher respiration rates) maybe required under severe 
water stress to compensate for the lowered ATP pro¬ 
duction in the chloroplasts. The increased maintenance 
respiration will support repair mechanisms needed 
under acclimation to drought that will ensure a better 
performance under extended periods of water scarcity. 

In general, the changes observed in respiration in 
response to drought are smaller than those observed in 
photosynthesis, thereby implying that respiration 
increases proportionally in relation to photosynthesis, 
with likely impact on leaf intercellular C0 2 and on 
plant carbon balance [68]. 

The role of photorespiration during drought stress 
has been scarcely studied [81, 94], partly due to diffi¬ 
culties in quantifying the rate of photorespiration [95]. 
Since photorespiration and photosynthetic metabo¬ 
lism are strictly linked and photorespiration depends 
on the recycling of RuBP in the Benson-Calvin cycle, it 
may be hypothesized that severe drought stress should 
result in lower photorespiration. As suggested by 
Osmond et al. [96], photorespiration and the Mehler- 
peroxidase pathway could protect the photosynthetic 
apparatus against photoinhibition in drought-stressed 
leaves, by sustaining photon utilization in non- 
assimilatory electron flow, when electron consumption 
by C0 2 assimilation is reduced due to low internal C0 2 
concentrations. However, other studies, such as the one 
by Brestic et al. [97] concluded that photorespiration 
was not important for photoprotection in drought- 
stressed French bean. 

In a recent work with glycine decarboxylase- 
deficient plants, Igamberdiev et al. [98] showed that 
photorespiration contributes to stomatal regulation. 
The data obtained with these mutants revealed that 
the photorespiratory mutants were able to decrease 
the rate of photorespiration, but only at the expense 
of increased water loss. Indeed, the necessity to 
maintain a high C0 2 concentration near the site of 
carboxylation in the chloroplasts of plants deficient 
in photorespiratory enzymes required an 
increased opening of the stomata, with a corresponding 
increase in water loss and decrease in water-use 
efficiency. 
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Yield and Quality Under Water Scarcity 

Agriculture depends to a large extent on the success of 
plant reproduction [99]. Drought affects crop produc¬ 
tivity as much as all other environmental factors com¬ 
bined and its impact on crops differs according to the 
attained developmental stage. The effects of water scar¬ 
city are quite different in annuals and perennial species 

[100] . Reproductive development of cereals, from mei- 
osis to seed set, is highly vulnerable to water deficit 

[101] and the meiosis stage appears to be the most 
stress sensitive period of reproduction in all studied 
species [102]. However, anthesis, pollen fertility, polli¬ 
nation, female fertility, and early zygote development 
were also reported to be susceptible and finally their 
failure altered the number and the final quality of grain 

[102] . Later in the development, water stress tends to 
reduce grain size [ 103] . In fruit trees, severe water stress 
during flowering was reflected in the final fruit number 
per tree, whereas water stress during the fruit-growth 
and maturity phases was reflected mainly in fruit size 
[104]. In addition, the timing and intensity of the 
drought period dictate the extent of alterations occur¬ 
ring in the final fruit quality [29]. 

Flower Initiation and Induction 

Grain crops are very sensitive to drought during floral 
initiation and floral pre-meiotic differentiation [105]. 
Water deficit at this stage causes pollen sterility, but 
when stress is severe, it usually affects also female fer¬ 
tility. In cereals, water stress during flower induction 
and inflorescence development leads to a delay in 
flowering or even to its complete inhibition [105]. 
The increase of abscisic acid (ABA) in response to 
water stress has been suggested to play a role in this 
delay [106]. 

In woody plants, flower bud initiation is an impor¬ 
tant feature in fruit tree cultivation. However, the effect 
of drought on floral meristems is among the least 
understood aspects of crop reproductive development 
under water deficit [107]. In species where flowering 
takes place prior to leaf emergence as Prunus trees, 
flower development occurs in the absence of new 
photoassimilates and at the expense of pre-stored 
reserves (starch), either in the flower itself or other 
plant organs [108]. This explains why in Prunus water 
stress effects are only detected in the subsequent 


reproduction event and not in the same year. This is 
the case of peach, apricot, and sweet cherry trees, where 
drought stress during flower bud initiation markedly 
slowed the progression of floral differentiation 
by delaying the differentiation of pistil primordia 
[109, 110]. 

In opposition, water stress has been demonstrated 
experimentally to induce flowering by a direct stimuli 
on stem and bud in citrus [111], apple [112], and pear 
[ 1 13] . In these species, water stress showed to stimulate 
polyamine accumulation, which is linked to floral ini¬ 
tiation, thus enhancing the production of floral 
primordia and re-flowering. However, severe water 
stress induces little flower production. Indeed, under 
severe stress during flower bud induction periods, gib- 
berellins level increased, inducing a low level of flower 
production per shoot [114]. 

Flowering and Pollen Development 

Loss of pollen fertility, spikelet death, and abortion of 
newly formed seed are associated with a decline in the 
water status of the reproductive structures and decline 
of carbohydrate availability [105]. By reducing photo¬ 
synthesis, drought directly interferes with inflorescence 
and flower number, flowers life span, flowers opening, 
and maintenance of floral organs in crops [115] as well 
as maintenance of nectar production in floral organs 
[116]. In addition, water stress reduces flower size and 
sucrose content in nectar [117] leading to an extensive 
loss in yield [118]. 

Drought also affects seed yield of plants through an 
effect on availability and viability of pollen grains 
[119]. The effect of drought on pollen fertility/avail - 
ability also depends on the species and it is considered 
a common symptom in angiosperms [120]. The 
hypothesis that ABA is a primary controlling agent in 
water stress induction of pollen viability still remains 
unanswered [102]. Recently in chickpea [119], water 
stress was shown to affect pollen growth in the pistil 
rather than affecting pollen viability. In fact, pistils 
from well watered plants pollinated with pollen of 
stressed ones showed fewer germinated pollen grains 
and fewer pollen tubes that reached the ovary, which 
suggested that drought has an effect on pollen tube 
growth, which is inhibited in the pistil [119]. Pollen 
viability is highly dependent on sugar unloading and 
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pre-anthesis stem reserve accumulation is considered 
a significant factor affecting flower development in 
water stressed plants (see Fig. 1) [121]. The decrease 
of acid invertase activity under low water potential 
impedes pollen to metabolize incoming sucrose in hex- 
ose in the developing pollen, which might lead to 
pollen sterility [122, 123]. This hypothesis is confirmed 
by the study of Koonjul et al. [124] who observed 
a downregulation of the soluble invertase gene Ivrl 
and Irv5 in wheat microspores, which correlates with 
accumulation of reducing (fructose and glucose) and 
non reducing (sucrose) sugars in the ear [125]. Never¬ 
theless, recent studies by Liu and Bennett [126] showed 
that stress induced sterility in rice is not only caused by 
disruption in sugar metabolism or by desiccation of 
reproductive tissue, but also by disturbance of anther 
pollen development and cell function. In wheat, micro- 
spores lost contact with the tapetum at first pollen 
grain mitosis and the filament degenerated in response 
to water stress, which resulted in total sterility [127]. 
The expression of various proteins related to anther 
wall degradation and cell wall modifications are mod¬ 
ified in stressed rice anther [126]. Furthermore, pollen 
from stressed plants might have a shorter life span 
and reduced vigor, which explain that pollen tube 
growth fails to reach the ovule [128]. The depletion of 
the adenosine triphosphate pool, increased concentra¬ 
tion of hydrogen peroxide, and downregulated tran¬ 
scripts in anthers of drought-stressed rice lead 
to a programmed cell death and may cause pollen 
sterility [129]. 

In woody species, as observed in apricot trees, post¬ 
harvest drought may induce an increase of aborted 
pistils in the subsequent year, decrease germination 
potential of pollen (see Fig. 1), and decrease the per¬ 
centage of fruit set due to an increase of fruit drop 

[130] . In almond, the number of fruiting positions 
per tree is negatively associated with water stress 

[131] . However, drought stress during flower initiation 
does not influence the percentage of spurs that flow¬ 
ered or set fruit during subsequent years. Although 
water stress had no apparent effect on spur mortality 
in the first year, more than a half of spurs died within 
three subsequent years. In addition, water stress 
reduces flower bud development in grapevine [132] 
and peach [133] and increases flower abscission [134] 
and young fruit drop [135]. 


Ovary Development 

Consequences of inhibited photosynthesis under 
drought and therefore insufficient assimilates are par¬ 
ticularly striking around the time of pollination when 
reproductive events occur rapidly, namely ovary 
growth [136, 137]. In maize the pollen does not loose 
availability even at severe water stresses (water poten¬ 
tial below —12.5 MPa) and low kernel number is 
explained by a poor receptivity of silks and/or by an 
ovary abortion. Maize ovaries are normally loaded with 
glucose and starch on the day of pollination [138, 139] . 
Under drought, sucrose, the main translocated product 
from the carbon fixed in photosynthesis, declines (see 
Fig. 1). The enzymes that convert incoming sucrose to 
glucose in ovary lose activity and starch starts to be 
hydrolyzed [137, 138]. Cell wall acid invertase hydro¬ 
lyses sucrose in the apoplast of the upper pedicel tissues 
and its activity creates a steep gradient in glucose con¬ 
centration between the upper pedicel and the nucellus 
of young ovary. Ovary abortion under drought was 
attributed to the decrease of glucose and invertase 
activity in upper pedicel, inhibiting sucrose transport 
[138]. Sucrose delivery decreased first in ovary and 
probably triggers an early downregulation of genes 
coding for sucrose processing enzymes ( INCW2 and 
IVR2). Glucose depletion occurs few days after and 
triggers an up-regulation of putative senescence genes 
as ribosome inactivation protein 2 [139], which suggests 
the beginning of failed ribosome function and later 
induces an up-regulation of phospholipase D1 gene 
leading to loss of plasma membrane integrity, indicat¬ 
ing the onset of senescence [137, 140]. This probably 
causes the irreversible loss in viability found during 
abortion [139]. Feeding sucrose during water stress 
largely prevents these changes, which confirm that 
senescence genes are “monitoring” the sugar status 
of the ovary cells and when sugars content decreases 
these genes turn on in sequence and orchestrate cell 
death [137]. 

A transcriptional study of placenta and endosperm 
of maize under drought showed that both tissues 
responded differentially to water shortage [141]. 
While most of the responding genes in placenta 
involved up-regulation, in endosperm these were 
downregulated. Downregulated genes relate to cell 
division and to endosperm growth, which may explain 
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arrested growth and thus decreased demand of photo- 
synthates [141]. 

These described events are common across crop 
species [137]. Although wheat requires water potentials 
to be much lower than maize to lose the same amount 
of photosynthetic activity [137], starch is also depleted 
in floral structures after drought (see Fig. 1) [142], 
inducing an impairment of ovule function [143]. Also 
in grapevine, drought induces lower availability of 
sugar due to decreased photosynthesis, which provokes 
ovary abnormalities and flower abscission [144]. 

Setter and coauthors [145] suggest an interaction 
between ABA and sugar that might induce a signal 
cascade that leads to the abortion process to initiate. 
The recent finding by Setter and Parra [146] supported 
the model of the interacting influence of carbohydrates 
and ABA at least at the pedicel-placenta tissues of basal 
kernels. Several studies in maize reported that water 
stress increases ABA in florets, suggesting it might 
trigger the abortion process [145, 147]. Young and 
Gallie [148] observed that regulation of programmed 
cell death during maize endosperm development 
involves the interaction of ABA and ethylene signaling 
pathways. However, Boyer and Westgate [99] explained 
that hormone effects are difficult to interpret because 
their expression on dry matter or water content basis 
can cause concentration to increase simply because the 
ovary dry mass or water content decreased, as it was 
shown in maize [149]. 

Yield 

All described abnormalities of flowering, fertilization, 
and zygote development that may occur under drought 
stress induce yield losses, and the timing of the occur¬ 
rence of drought stress determines the degree of dis¬ 
turbance. Water stress during ear formation and milk 
stages in maize was shown to induce early loss of lower 
leaves and a decrease in plant dry matter and in grain 
yield, as a result of reduced intercepted radiation [150] . 
Moreover, all the yield parameters were significantly 
affected. When water stress occurred at pre-anthesis 
a reduction in seed numbers was observed due to 
pollen sterility and ovary abortion. Post-anthesis 
water stress generally enhances whole-plant senescence 
and lead to a faster remobilization of carbon from 
vegetative tissues to seeds, thereby inducing earlier 


maturity and small seeds [119, 151-155]. This is 
a typical strategy of Mediterranean annuals, which 
exhibit a phenological drought-avoidance producing 
seed before water supplies are exhausted [45]. 
Remobilization of pre-stored carbon to the seed and 
acceleration of seed filling rate are associated to an 
alteration in the hormonal balance of grains, namely 
the decrease in gibberellins and the increase in ABA 
[48, 156]. The reduced duration of grain filling may 
partly result from reduced number and size of endo¬ 
sperm cells and therefore reduced capacity to accumu¬ 
late starch following drought stress [ 157] . Additionally, 
seed cell expansion is driven by water uptake [158] and 
when cell expansion in the seed stops, the end of seed 
growth and seed maturation are predetermined. 
Indeed, Yang et al. [152] suggested the enhancing of 
sink strength with water stress is promoted by alter¬ 
ations in sucrose synthase and starch branching 
enzyme activities taking place in grain during stress. 
Severe drought stress during seed filling generally 
resulted in seeds that are shriveled and deformed and 
with a reduced weight [ 152, 159] . The shortening in the 
duration of seed fill limits seed size because grain fill 
process fails to finish in a short maturation period 
[155, 160], consequently reducing seed yield [161]. 
Seed size reduction was highest in late maturing 
wheat genotypes, suggesting that early maturing geno¬ 
types partially escaped late-season water stress [162]. 
However if water stress occurs after cell division is 
completed it does not affect yield since sink size per 
kernel is already predetermined [ 152]. In sesame, most 
drought-tolerant cultivars abort a higher proportion of 
their seeds, and in that case more efficiently shunt 
available nutrient resources to the healthy growth of 
remaining seeds [163]. 

In woody plants and especially in fruit trees, it is 
generally accepted that post-harvest water stress may 
negatively influence fruit set and crop load in the next 
season, as it is the case in peach [164-167], in almond 
[168] and in sweet cherry [ 169] . This reduction in fruit 
set is due to the limitation placed by reduction in the 
accumulation of carbohydrate reserves. In fact, in cul¬ 
tivated Prunus species, floral bud-break and fruit set 
significantly precede net carbon export from leaves 
[167]. In citrus, severe water stress taking place during 
flowering led to decreased fruit number per tree, 
whereas in water restrictions occurring during the 
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fruit-growth and maturity phases, the effects were 
reflected mainly in fruit size [104]. Olive tree has 
a reputation of being a drought resistant crop. How¬ 
ever, drought incidence during winter, leading to a low 
level of soil moisture, may reduce vegetative growth 
[170] and have an important impact on flowering and 
fruit set, resulting in a drastic decrease on olive fruit 
number and size [171, 172]. In fact, severe water stress 
in phase I of fruit growth was shown to reduce meso- 
carp cell division and number [170, 173] suggesting 
a high sensitivity of olive mesocarp cell size to water 
stress. Grapevines are also well adapted to semi-arid 
climates, nonetheless severe water stress during the 
summer largely limit grapevine yield and cluster weight 
[27, 29]. This reduction is mainly observed in drought¬ 
avoiding genotypes, which optimize survival at expense 
of yield and reduced sugar allocation to reproductive 
tissues and thereby reducing fertility [174]. 

The effect of drought in crop plants is enhanced by 
high temperature and effects are usually difficult to 
separate [118]. Differences among genotypes are 
observed; for example barley is much less sensitive to 
short periods of very high temperature than wheat, and 
combined drought and high temperatures are more 
likely than high temperature alone to explain the 
reductions in grain weight observed in the field [175]. 

Fruit and Seed Quality 

Drought influences end-use quality of major food 
crops in the world, as it is the case of wheat [176], 
barley, and maize [177]. Indeed, water stress in wheat 
changes the patterns of proteins [178]; in barley, grain 
(3-glucan content and malt fine extract decreased with 
drought stress [179]; in soybean drought during seed 
development has a large effect on isoflavone concen¬ 
tration in the seeds [180]; in maize, water stress 
increases flavonoid content in seeds but reduces carot¬ 
enoids and phenol content inducing a decrease in the 
antioxidant capacity of kernel oil [181]. In lupin seeds, 
raffinose quantity and accumulation pattern are 
reduced by water stress, suggesting an increase in nutri¬ 
tive values of seeds since raffinose is considered to 
be a major cause of flatulence in animals and 
humans [161]. 

Starch granule shape, volume and structure are 
important factors contributing to starch quality in 


wheat [182]. Post-anthesis water stress affects the pro¬ 
portions of different types of starch granules and 
increases the percentage of small granules per seed 
[183]. 

Protein content has an important implication in the 
processing qualities of cereals since it affects the func¬ 
tional properties of processed wheat products [184]. In 
wheat, grain protein content decreases linearly with the 
severity of drought stress during grain development 
[ 158] . Recently Zhao et al. [184] showed that the inten¬ 
sity of drought dictates the intensity of the effect on 
protein accumulation and reported a differential accu¬ 
mulation of proteins (especially gliadins and glutenins) 
in mature grains. The decrease in the accumulation of 
oil due to water stress was reported in wheat [184], 
soybean [185, 186], lupin [187] and chickpea [188]. 

In fruit trees, drought influences fruit development, 
metabolism, and final composition, and its timing and 
intensity dictate the extent of alterations occurring in 
final quality [29]. With regard to the effect of water 
stress on organoleptic properties, fruit quality is signif¬ 
icantly affected by severe water stress during the fruit- 
growth and maturity periods [104]. Under water 
restriction, fruit maturity tends to be hastened, 
supporting the hypothesis that increasing water restric¬ 
tion might encourage early fruit maturity as observed 
in peach [189, 190]. Post-harvest water deficit was 
shown to increase fruit soluble solid concentration in 
the following season in peach [191, 192]. The opposite 
effect, reducing firmness and soluble solid concentra¬ 
tion, was reported in sweet cherry [169]. 

Water stress was shown to decrease oil content and 
yield in young olive trees and induce lower level of 
phenolics in oil [172]. Olive fruits import assimilates 
from the canopy, but also produce them in situ by 
photosynthesis in the mesocarp; the fruit is capable of 
retaining chlorophyll even when its color change [193] 
and this makes a significant contribution to oil pro¬ 
duction [194]. This “autonomy” of olive fruits may 
explain the maintenance of oil quality in trees under 
water deficits [172]. In fact, although irrigation led to 
decreased contents of some undesirable sensory quali¬ 
ties as phenols, some favorable intense green notes were 
also reduced, which suggests complex effects of water 
stress on olive quality [195]. In grapevine, severe water 
stress influences final composition of the berries by 
delaying their ripening but also through an indirect 
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effect on berry size [196, 197] that affects sugar and 
flavonoids metabolism in the berry (review [29]). 
However, moderate water deficit was shown to have 
beneficial effects on grape berries, for example by 
enhancing photoprotection mechanisms [198] and 
likely having a positive effect on wine quality [29, 
196]. Indeed, moderate water deficit promotes sugar 
accumulation by inhibition of shoot growth with 
a subsequent reallocation of carbohydrates to fruits 
[199] and by the activation of ABA-mediated uptake 
of hexoses [198]. Under such conditions, sugar accu¬ 
mulation in berries accelerates anthocyanin accumula¬ 
tion or/and biosynthesis [200, 201]. The enhancement 
of carotenoids content and the up-regulation of genes 
encoding enzymes involved in biosynthesis of berry 
aroma were also observed after mild stress imposition 
[198, 202] . Similar benefits of mild water deficit in fruit 
quality were also observed in apricot [203], citrus [104] 
and olive [204]. 

The Breeding Achievements 

Conventional Breeding 

To face drought, farm-management practices and plant 
breeding are used for the improvement of crop yield 
[18], being the second approach the most promising in 
the long term [64]. Drought is the most recalcitrant to 
breeders’ efforts due to the complexity of plant 
drought-tolerance mechanisms [205] and breeders 
have no reliable method of distinguishing sensitive 
from tolerant germoplasm other than by measuring 
yield [19]. Therefore, the main objective of a 
drought-tolerance breeding program is to select the 
variety presenting the better yield under stress condi¬ 
tions [206]. However, yield is a trait which is charac¬ 
terized by low heritability, polygenic control, and 
a high genotype x environment interaction (G x E) 
[21,207]. 

During the last 50 years, most of the progress has 
been derived from empirical (conventional) breeding 
[208, 209] by selecting desired traits recognized at the 
phenotypic level. Generally, there is a minimum of 
4 years extensive yield testing to further selection at 
multiple locations before a new variety is released to 
farmers. Empirical selection for grain yield was effec¬ 
tive during the last decades. However, when plants are 
crossed some undesirable traits may be transferred 


along with those of interest - including some with 
negative effects on yield potential. In addition, breed¬ 
ing can only be done between species inter-compatible, 
which limits the new traits that can be added to those 
that already exist in a particular species [206]. Further¬ 
more, the variability of rainfall from year to year 
increases G x E and reduces heritability for yield, 
thereby limiting yield progress [19]. The indirect or 
analytical approach based on an understanding of the 
crop physiology, may help to target the key traits that 
are limiting yield [64] . Such an approach may therefore 
complement conventional breeding programs and has¬ 
ten yield improvement [210-212]. 

A number of physiological studies have identified 
some traits which presence/expression are associated 
with plant adaptability to drought prone environment 
[205] and have been already successfully exploited in 
crop improvement [213]. Increasing water use by 
increasing early shoot vigor to escape dryer periods is 
a common strategy in Mediterranean region [214], but 
in very dry years this strategy may result in reduced 
yield. Early maturity by adjusting crop development 
with seasonal rainfall pattern in environments where 
they experiment terminal drought, or late flowering 
with short grain filling period in environments with 
early season drought, are examples of traits that are 
being used in breeding. Leaf cuticle waxes appear to 
play a key role in day and night transpiration and 
selecting for waxy leaves may reduce water loss and 
prevent leaf senescence during grain filling [215]. 
Enhancement of water extraction, by optimizing root 
architecture that resulted in greater water capture, has 
also the potential to significantly increase grain yield 
[63] . In addition, enhancement of root osmolytes accu¬ 
mulation was used in cereals, but a clear evidence of 
their beneficial role in crop yield was not observed [58] . 

Carbon isotope discrimination (A) is an attractive 
feature for C 3 plants breeders since it may provide an 
indirect sensing of transpiration efficiency (TE) [13]. 
In fact, genotypes showing higher TE (lower A) maybe 
more productive under certain environmental condi¬ 
tions [216]. However, in the Mediterranean region the 
opposite was found and positive correlation between A 
and yield are reported [217], which is probably due to 
the fact that genotypes with lower TE have maintained 
a higher C0 2 conductance due to a better water status 
or by a faster plant development or better access to 
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water [216]. Aerial infrared (IR) methods on vege¬ 
tative material offer the possibility to detect vari¬ 
ation on TE and present the advantage to reduce the 
cost of measuring carbon isotope discrimination 
[19, 216]. 

Fruit tree species propagation is possible only by 
vegetative methods. However, with the spreading of 
different species in new areas, the use of rootstock has 
become the first alternative to adapt scion cultivars to 
soils or climatic conditions that are otherwise not fully 
suited to their cultivation. Climatic areas or soils 
subjected to transient drought conditions require root- 
stock capable of inducing efficient water use by the 
scion cultivar. Unfortunately, there is only limited 
understanding of how rootstock provides tolerance 
to drought [218] and the majority of rootstock breed¬ 
ing programs addresses fruit quality. Traditionally, 
rootstocks have been produced using conventional 
hybridization breeding techniques, which are time 
and space consuming. Initial selection of main root¬ 
stocks is made from open pollinated fruit tree germ- 
plasm, generating thereby intra- and interspecific 
hybrids. The extended use of interspecific hybrids 
is due to the lower susceptibility of this type of root¬ 
stocks to biotic and abiotic stresses and may also 
overcome soil problems as drought [219], which 
might derive from heterosis (hybrid vigor) or from 
the elevation of ploidy levels of these rootstocks 
[220]. However, the use of known drought-tolerant/ 
resistant rootstocks may not fulfill grower’s intents in 
terms of fruit commercial traits, mainly because they 
may provide excess vigor. In addition, the problem of 
graft compatibility of hybrid rootstocks with commer¬ 
cial scion complicated the use of a wide range of culti¬ 
vars [221, 222]. 

New Breeding Approaches 

Compared to conventional approaches, genomics offer 
unprecedented opportunities for dissecting quantita¬ 
tive traits into their single genetic determinants, the 
quantitative trait loci (QTLs), thus paving the way to 
marker-assisted selection (MAS) and, eventually, clon¬ 
ing of QTLs and their direct manipulation via genetic 
engineering. Several studies have reported QTLs for 
roots architecture and investigated their effect on 
yield under water stress in rice [223] and also in 


maize, where QTL reported for leaf ABA concentration 
showed an effect on root size and grain yield [224, 225] . 
In sorghum QTLs related to £ stay green’ trait and yield 
were identified [226]. 

Some QTLs of one character were shown to overlap 
with QTLs of other characters. This is the case in maize, 
where QTLs of leaf growth sensitivity to water deficit 
overlapped with QTLs for leaf responses to evaporative 
demands, thus suggesting that hydraulic mechanisms 
are involved in that response [227]. Similarly, overlap 
between QTLs of leaf growth sensitivity with silk 
growth was observed [228]. QTLs were also reported 
for anthesis-silking interval in maize [15], for seed 
weight and yield under different water conditions in 
rice [229] and durum wheat [230]. It was also possible 
to identify WUE QTLs in brassica [231], rice [232], 
and wheat [233]. However, despite all the recent tech¬ 
nological breakthroughs, the overall contribution of 
genomics-assisted breeding to the release of drought- 
tolerant cultivars has so far been marginal [234] , in part 
due to the fact that a given QTL can have positive, null 
or negative additive effects depending on the drought 
scenario [235]. In addition, cloning QTLs for yield 
maintenance under drought conditions and drought 
physiological traits is limited because the unavailable 
information about the biochemistry of some of these 
traits [236]. 

In recent years, proteomic and transcriptomic 
studies have advanced the basic understanding of pro- 
tein/gene regulatory networks that are active during the 
exposure of plants to water stress [237, 238] and there 
have been several reports of apparently promising bio¬ 
technological manipulations emerging from these tools 
[239]. Remarkable dehydration tolerance has been 
obtained under laboratory conditions using bacterial 
RNA chaperones overexpression [16], NFYB2 class 
transcriptionals regulators [240], as well as modulating 
the expression of dehydration response element bind¬ 
ing (DBF/DREB1) transcription factor [241], using of 
detoxification of reactive oxygen species that accumu¬ 
late under stress [242, 243] and of hormone interven¬ 
ing in drought signaling [244]. Introducing RNA 
silencing in some crops to downregulate poly ADP 
ribose polymerase and thus inducing tolerance to 
wide range of stresses [245] and overexpression of 
cyanobacterial flavodoxin [246] were also reported. 
However, these reports and others have failed to deliver 
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drought tolerance in yield terms when the novel mate¬ 
rial is transferred from the growth room to the field [4] . 

Fruit trees also have benefited from new technolo¬ 
gies and it is now possible to reduce the breeding time by 
MAS. Very recently fruit breeders have begun to exam¬ 
ine the possibilities of using such techniques in the 
production of new rootstocks. The aim is to improve 
an already good rootstock for yield by modifying its 
gene expression or introducing new genes to improve 
resistance to abiotic factors. Unfortunately, the mecha¬ 
nisms by which rootstocks bring about their beneficial 
effects on the vigor and cropping of scions are still very 
poorly understood. Until more research is conducted in 
this field and genes that control such processes are 
identified, progress in this area may be slow. Recent 
studies showed that ectopic expression of the transcrip¬ 
tion factor Osmyb4 gene in transgenic apple trees, 
improved water stress tolerance and might result in 
long term ameliorated productivity. Further experi¬ 
ments are required to assess the agronomic value of 
the plants produced and to verify to what extent the 
expression of Osmyb4 may contribute to enhanced 
drought tolerance under field conditions [247]. 

Summarizing, to date increased drought resistance 
of the major crop plants (sustained yield under water 
scarcity) has been dependent upon the screening of 
a wide range of germoplasm in order to identify genetic 
variation in major traits involved in stress resistance 
[19, 248, 249]. In addition, the effective use by the crop 
of a limiting water supply has been achieved by 
adjusting crop phenology to its environment [2, 14] 
or by using agronomic practices aiming at an improved 
water use, such as deficit irrigation [4, 29]. A variety of 
approaches have been successful, particularly in the 
irrigation of top quality fruit and vineyards [3, 27, 29, 
250, 251] but also in annual crops [252-255]. In addi¬ 
tion to minimizing changes in shoot water status, def¬ 
icit irrigation is able to control the balance between 
fruit and vegetative growth, with likely positive impact 
on fruit quality [8, 256]. 

Future Directions 

Drought has a great impact on the vegetative and 
reproductive development of annual and fruit tree 
crops and thereby impacts on final yield and seed and 


fruit quality. The key factors responsible for high yield 
under drought in annual plants are an appropriate 
phenology that will enable escaping drought and get¬ 
ting the timing of flowering right, high water-use effi¬ 
ciency and a high harvest index. The impact of drought 
is highly complex and involves diverse processes as 
photosynthesis, respiration, reserve accumulation, fer¬ 
tilization, gamete and embryo development, and fruit 
and seed development. Due to the complexity of crop 
reproduction and the still incomplete knowledge of 
mechanisms underlying the response of reproductive 
events under water stress, it has been until now difficult 
to construct a model system to guarantee successful 
reproductive development and high yield under water 
scarcity. How resource availability (namely carbohy¬ 
drate reserve) influence the development of reproduc¬ 
tive organs under water stress is among the least 
understood aspects. On fruit trees, it is also urgent 
to understand (1) how rootstock confers resistance 
and/or tolerance to drought to the scion cultivar to 
better adjust breeding strategies; (2) the key events 
of fruit ripening. In addition, differences between 
species imply different strategies to improve/maintain 
yield and quality under such conditions. Spatial 
strategies also differ: a particular trait may be associ¬ 
ated with higher yield in one environment and may 
have no/or negative effect in other environment [13]. 

Conventional breeding still appears to be the 
most effective for a significant drought-tolerance 
improvement of new cultivars. The application of mod¬ 
ern molecular tools to understand responses of crops to 
water scarcity has only recently begun. In addition, to 
integrate them in breeding programs it is imperative 
to incorporate an accurate measurement of the pheno¬ 
type under water stress. However, the speed of molecular 
technologies is not matched by the speed on 
phenotyping, being the latter one of major limitations 
to improve selection methods for water-limited environ¬ 
ments [19]. Moreover, it is imperative to integrate dis¬ 
ciplines, as functional genomics, transcriptomics, 
proteomics and metabolomics with plant physiology 
to improve breeding strategies. This would allow 
researchers to address drought tolerance/resistance 
through an integrated approach based on the knowledge 
of expression of genes and their products at a particular 
phenological and developmental stage of the crop. 
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Glossary 

Critical plant N concentration Critical plant 
N concentration is defined as the minimum plant 
nitrogen concentration of a crop corresponding to 
its maximum crop mass. 

Critical crop N uptake Critical crop N uptake is 
defined as the minimum crop nitrogen uptake for 
achieving maximum crop mass. 

Harvest index (HI) Harvest index (HI) is the ratio 
between harvested biomass Y (grains, tubers) and 
aboveground crop mass W at crop maturity. 
Intercepted photosynthetic active radiation 
(IPAR) Intercepted Photosynthetic Active Radia¬ 
tion (IPAR) is the proportion of the incident PAR 
which is intercepted by the crop at a given time. 
This proportion is related to the size of the canopy, 


the Leaf Area Index, and depends also on canopy 
structure: leaf angle and geometry. 

Leaf area index (LAI) Leaf area index (LAI) is the total 
canopy leaf area of a crop per unit of soil area. LAI 
allows the estimation of the proportion of the inci¬ 
dent light which is intercepted by the canopy, and 
then which can be used for photosynthesis of the 
whole crop. 

Nitrogen absorption efficiency (NAE) Nitrogen 
absorption efficiency (NAE) is the increase in crop 
nitrogen uptake per unit of supplemental N supply 
rate. 

Nitrogen conversion efficiency (NCE) Nitrogen con¬ 
version efficiency (NCE) is the increase in crop dry 
mass (d W) or in crop yield (dY) per unit of sup¬ 
plemental crop N uptake corresponding to an 
increase in nitrogen supply rate. 

Nitrogen use efficiency (NUE) Nitrogen use efficiency 
(NUE) is the increase in crop dry mass (dW) or in 
crop yield (dY) per unit of supplemental N supply 
rate. So NUE = NAE x NCE. 

N dilution N dilution is the process corresponding 
to more rapid accumulation of nitrogen-free 
compounds than nitrogen compounds within 
plant as plant grows, leading to decline in plant 
nitrogen concentration with plant mass 
accumulation. 

Nitrogen nutrition index (NNI) Nitrogen nutrition 
index (NNI) is an index which allows the estima¬ 
tion of the crop nitrogen status. This index is cal¬ 
culated at any moment as the ratio between the 
actual plant nitrogen concentration of the crop 
and the critical plant N concentration (see this 
definition) corresponding to the actual crop mass. 

Photosynthetic active radiation (PAR) Photo syn¬ 
thetic active radiation (PAR) is the part of solar 
radiation spectrum corresponding to wavelengths 
that are active for photosynthesis. 

Radiation use efficiency (RUE) Radiation use effi¬ 
ciency (RUE) is the ratio between the quantity of 
biomass accumulated within a crop and the quan¬ 
tity of photosynthetic active radiation (PAR) 
intercepted by this crop during the same period of 
time. 

RuBPc-o Ribulose bisphophate carboxylase/ 
oxygenase, the enzyme located within chloroplasts 
which allows the carboxylation of C0 2 . 
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Definition 

A prerequisite for the analysis of crop responses to 
nitrogen (N) is the determination of the plant nitrogen 
content and repartition. How much N is incorporated 
within plants and crops? Within which plant tissue? For 
which physiological function? Thus, according to the 
answers to these questions, it is possible to determine 
a critical plant nitrogen status as the minimum plant 
N concentration that allows the maximum plant (or 
crop) growth rate. It has been demonstrated that this 
critical plant N concentration decreases as plant grows 
as the result of an ontogenetic plant architecture devel¬ 
opment leading to a dilution of N compounds within 
increasing proportion of free-N compounds as plant 
gets bigger. This N dilution process can be formulated 
through a negative power relationship between plant 
N concentration and crop mass. This critical N dilution 
curve allows the discrimination of situations of 
N deficiency (below the curve) and situations of 
N luxury consumption (above the curve). So 
a Nitrogen Nutrition Index (NNI) can be calculated 
for quantifying the intensity of N deficiency or 
N luxury of any crop at any stage of its life cycle. This 
possibility for determination of crop N status and for 
quantification of N deficiency either in terms of inten¬ 
sity or timing allows the complete inversion of the 
approach: instead of response of crop to N supply, the 
problem is to study the response of crop to 
N deficiency. By this way, the check treatment is the 
non-limiting N conditions where crop growth poten¬ 
tial is limited only by genetics and climate. Then the 
effect of intensity and timing of N deficiency period on 
the different plant growth processes and yield compo¬ 
nent elaboration can be studied with a more generic 
approach. This new method for analysis of the effect of 
plant N nutrition on crop yield allows the identifica¬ 
tion of physiological, agronomical, and genetical ways 
for improving nitrogen use efficiency of crops. 

Introduction 

Nitrogen is considered to be the most important lim¬ 
iting factor, after water deficit, for crop production 
worldwide. Over the last 50 years, the worldwide use 
of mineral fertilizers is one of the key elements for 
producing sufficient food to meet the demand of 
increasing human population [1, 2]. During this 


period, the use of mineral N fertilizers in agriculture 
systems increased sevenfold in parallel with the dou¬ 
bling of agricultural food production. This huge 
amount of mineral N is provided by industrial pro¬ 
cesses of chemical reduction of atmospheric nitrogen. 
So even if this resource can be considered as quantita¬ 
tively non-limited, the energy cost of the Haber-Bosch 
process necessary to obtain mineral N fertilizers and 
the large greenhouse gas emission associated with 
oblige to reconsider the sustainability of the use of 
mineral N in agriculture. Moreover, the use of large 
amounts of N in intensive agriculture production sys¬ 
tems has led to important environmental problems 
such as the eutrophication of freshwater [3] and marine 
ecosystems [4], pollution of ground water, and gaseous 
emissions of N oxides and ammonia in the atmosphere 
[5, 6]. In consequence, problems associated with sus¬ 
tainable development, global changes, environment 
protection, and global food security are now 
questioning the efficiency of use of N fertilizers in 
agricultural systems [7]. 

During the last decades, the relatively high prod¬ 
ucts/fertilizer price ratio, incited farmers to adopt an 
insurance strategy in fertilizer management: applying 
excess of N to avoid any restriction in crop N supply 
and then any penalty in crop yield. These practices, 
when they were generalized on large agricultural areas 
led to a progressive increase in soil N surplus accumu¬ 
lation and an elevated risk of N leaching with dramatic 
consequences on ground water quality [8] . Adoption of 
a more restricted strategy for supplying and timing of 
N fertilizers is now a prerequisite for a more sustainable 
agriculture development. But such a strategy is difficult 
to be adopted in practice because of the non- 
predictable variations in weather which determines 
both soil N mineralization and crop growth potential. 
In consequence, a reduction in N application rates to 
avoid surplus of N in soils would increase the proba¬ 
bility of temporary crop N nutrition deficiency, and 
then, as a consequence, an increased risk for a lower 
crop production and then a deterioration of the eco¬ 
nomic outputs the farmers are expecting. Therefore, 
optimizing crop production with the goal of reducing 
environment hazards requires an improvement in the 
understanding of the regulatory mechanisms by which 
crops absorb nitrogen from the soil and use it effi¬ 
ciently for yield component and quality elaboration. 
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But it requires also a better understanding of soil 
microbial processes and the interaction between 
C and N cycles in order to optimize crop N residue 
cycling, soil organic matter dynamics, and to improve 
the capacity to predict soil N supply. 

Fertilization management has to be conceived as 
a mean to match as closely as possible crop N supply 
with crop N demand in order to limit accumulation of 
mineral N within soils, and then emissions to hydro¬ 
sphere and atmosphere. By this way, matching timing 
of N supply with timing of crop N demand appears 
very necessary. It is fundamental to underline here the 
necessity for a high precision for adjustment of 
N supply to N demand: for an intensive wheat crop 
producing about 8 t ha -1 of grain, the total N demand 
is about 240 kgN ha -1 and the loss of only 20 kgN ha -1 
through leaching (i.e., less than 10% of the total 
N demand) can lead to pass over the admissible limit 
of nitrate concentration (50 mg l -1 ) in drainage water! 
Moreover, the loss of only 2-3 kgN ha -1 through emis¬ 
sion of N 2 0 to atmosphere (that corresponds to 1% of 
the total N demand) is considered as highly detrimental 
for greenhouse gas effect. So the problem to face now is 
not to understand how the yield of the different crop 
species can be improved by addition of N fertilizers, but 
(i) to determine the dynamics of crop N demands all 
along the crop development cycle; (ii) to determine the 
timing of the soil N supply according to soil character¬ 
istics, climate, and soil agronomic management; (iii) to 
determine the crop responses to different intensity and 
timing of N nutrition deficiency; and then (iv) to 
manage timing of crop N fertilization using diagnostic 
and decision tools for optimizing trade-off between 
minimizing crop yield reduction and minimizing 
environmental impacts. 

N in Plants 

For most of the crop species, plant life cycle can be 
roughly divided into two main phases: (i) the vegetative 
growth phase, when young developing roots and leaves 
behave as sink organs that absorb and assimilate inor¬ 
ganic nitrogen for amino acid and protein synthesis, 
and (ii) the remobilization phase when senescing tis¬ 
sues start to behave as source organs translocating 
organic molecules to ensure formation of new devel¬ 
oping and/or storage organs [9]. The first phase is 


dominated by the dynamic of leaf area expansion as 
a mean for light capture, and then the role of nitrogen 
in both leaf growth and leaf photosynthesis is capital. 
The second phase is dominated by the development 
of reproductive organs such as seeds, fruits, tubers, 
and bulbs, or by storage organs allowing survival for 
perennial species such as trunks for trees or roots and 
stubbles for herbaceous species like grasses. As 
a consequence, the same N absorbed by the plant can 
be used successively for different functions and then 
analysis of crop response to N nutrition cannot be 
simply reduced to an addition of different elementary 
functions, but as a complex and integrated adaptive 
system with strong interactions among different 
processes. 

In plants, N is required primarily for the synthesis 
of proteins, both structural and enzymatic, as the more 
important components of cells. There is large variation 
in the composition of the different cell types within 
a plant according to the different types of tissues. Cells 
which store carbohydrates or are part of support and 
transport tissues have relatively lower protein and 
nucleic acid concentration than metabolically active 
cells within meristems and photosynthetic parenchyma 
[10]. The relative numbers of each type of cells deter¬ 
mine the organ composition. So organs that have rela¬ 
tively slow metabolism but are specialized in support 
and transport, such as roots and stems, contain little 
protein and thus have small N concentration in dry 
matter. Organs which store starch or other carbohy¬ 
drate, such as grains, fruits, and tubers, have also low 
N concentration. On the contrary, leaves having a high 
metabolic activity through photosynthesis have high 
N concentrations. So, plants are very heterogeneous 
systems in terms of N concentration at every level of 
organization: cells, tissues, and organs. Therefore plant 
N demand, i.e., the quantity of N required for the plant 
to achieve its potential growth and development, 
depends on the morphologic and histologic plant 
composition. 

Leaves are essential organs by which plants capture 
light and assimilate carbon. Light capture, photosyn¬ 
thesis, and the associated respiratory processes require 
a large number of different enzymes, pigments, and pro¬ 
teins [11]. About 50% of the soluble protein of the leaf 
is in RuBPc-o enzyme alone, another 25% is in the light 
harvesting and electron transport components [11]. 
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Thus the chloroplast, as the location of RuBPc-o and 
other photosynthetic enzymes, and according to its 
high protein membrane component, contains the 
major part of the total leaf N. Composition of leaves 
and their N concentration varies with age, development 
stage, and environment. Leaves of cereals have the 
greatest concentration of N just after their full expan¬ 
sion that corresponds to their maximum in photosyn¬ 
thetic activity. After a period the N concentration in the 
leaves decreases in parallel with metabolic activity 
as senescence progresses [12]. Remobilization of 
N-components occurs during the senescence process, 
and then amino acids from proteolysis are transported 
to younger developing leaves, or reproductive organs. 
A consequence of this ageing and senescing processes is 
that older leaves situated in the lower canopy where 
light intensity is limiting for photosynthesis, provide 
N for new leaf production at the top of the canopy. This 
recycling process thus tends to optimize photosynthe¬ 
sis in relation to N supply and light at the level of the 
canopy [13]. 

To achieve large rates of photosynthesis in well- 
illuminated conditions, leaves require a large concen¬ 
tration of RuBPc-o and other N-components and, 
hence, large N concentration [14]. As N supply rises, 
the amount of N per unit area in leaves increases, 
enhancing the rate of C0 2 assimilation [12, 15]. Thus, 
there is a good correlation between C0 2 assimilation 
rate and leaf N content. However, with very large 
N supply, leaf N content may increase without any 
increase of photosynthetic rate, unless larger C0 2 con¬ 
centrations are used [16]. Plants with C 3 photosyn¬ 
thetic pathway contain more N per unit leaf area than 
those with C 4 metabolism. So C 4 crops use N more 
efficiently than C 3 ones [17]. Ample N increases the 
number of chloroplast compared with deficient N. Also 
the density of protein in the stroma is increased by large 
N supply. All these processes contribute to accumulate 
N within leaf tissues as plant N supply increases. 

Although the primary role of RuBPc-o in plants is 
to assimilate C0 2 , a storage function for N has been 
mooted for this enzyme [18, 19]. Thus, N of RuBPc-o 
have two successive functions within plants: C0 2 
assimilation and source of N supply for reproductive 
and storage organs. Storage of N within the plants for 
further reuse can be considered as an adaptive mecha¬ 
nism for buffering plant N nutrition in highly variable 


soil N supply conditions. In this way, plants can store 
N reserves during periods of vegetative growth and 
large soil N availability, and then to reuse these 
N compounds at the end of their life cycle when min¬ 
eral soil N is exhausted. Accumulation of RuBPc-o in 
excess within leaf tissues is not the only way for plant to 
store N. Accumulation of vegetative storage proteins 
(VSP) within different plant tissues or organs is 
a complementary way for storing N reserves, as dem¬ 
onstrated in alfalfa [20] and in perennial grasses [21]. 
For perennial plants, the storage of N reserves within 
perennial organs such as roots, rhizomes, tubers, or 
trunks is a prerequisite for regrowth after defoliation 
or after winter. 

Due to the importance of light capture and photo¬ 
synthesis for plant growth, much attention has been 
paid to allocation of N to leaves within canopies. How¬ 
ever, allocation of N to other vegetative tissues is also 
quantitatively important. Green leaf N content repre¬ 
sents only 53% of total shoot N in a lucerne crop [22], 
and only 30% of shoot N in a wheat crop at the 
beginning of the grain filling period [23]. Lemaire 
et al. [24] showed that for a large number of crop 
species there exists trade-off between accumulation of 
N in leaves for optimizing crop photosynthesis and 
accumulation of N within stem tissues for optimizing 
the plant architecture development and the elaboration 
of reproductive organs. 

N compounds as amino acid and proteins are used 
within meristematic tissues for initiation and expan¬ 
sion of new organs: leaves, stem internodes, tillers, and 
branches and roots, and also inflorescences, fruits, and 
seeds. The use of N for leaf area expansion during 
vegetative growth period is of first importance because 
it determines the dynamics of LAI expansion of the 
crop and then the capacity of the crop to intercept 
light and to accumulate biomass. The rate at which 
N is supplied to meristems greatly determines cell pro¬ 
duction rate while the final cell size is only little affected 
[25-27]. As cells enlarge in size, their N concentration 
decreases through a dilution by accumulation of 
greater quantities of N-free compounds until final size 
is reached. So any limitation of N supply to meristem 
tissues leads to a reduction in the cell production rate 
and in the size of the final organ produced. For repro¬ 
ductive meristems, shortage of N reduces more the 
number of grains than the grain size [28-30]. 
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When analyzed at the level of a plant population, it 
has been clearly demonstrated that N is allocated to 
individual plants according to their own contribution 
to interception of light [31]. N resources are allocated 
preferentially to dominant plants and then stressed 
plants cannot respond to N supply because of the lack 
of light. In consequence, competition for light among 
individual plants within a dense canopy determines 
greatly competition for N resources. 

In conclusion, plants and canopies are very hetero¬ 
geneous systems in terms of N content, concentration, 
and repartition. This heterogeneity can be analyzed at 
different levels of organization: organite, cell, tissue, 
organ, whole plant, and plant population. N com¬ 
pounds, mainly as proteins are involved in all meta¬ 
bolic processes and then they are used successively for 
different functions such as cell division, organ growth 
and development, light capture, photosynthesis and 
respiration, reserve formation and recycling. Once 
absorbed by the plant, N is used several times through 
internal recycling processes and reallocation to the 
different plant parts. So analysis of plant N nutrition 
is very complex because it involves several inter¬ 
dependent metabolic functions and due to constant 
feedback mechanisms, it can be studied only through 
a dynamic approach. 

Crop N Demand 

Crop N demand (N expressed in kg ha -1 ) at any time of 
the crop cycle can be defined as the result of critical 
crop mass (Wc) that is the maximum crop mass attain¬ 
able in a given environment without any limitation of 
nutrients and its critical plant N concentration % 
Nc [32]: 

N = %NcWc (1) 

The critical plant N concentration (%Nc) has been 
defined as the minimum plant N concentration 
corresponding to maximum crop mass [33]. This con¬ 
cept can be applied in dynamic terms, such that the 
daily crop N demand (or critical N uptake rate 
expressed in kg ha -1 day -1 ) is the quantity of N the 
crop has to absorb each day to maintain its potential 
growth rate over a given period of time. 

Many studies conducted on a large range of crop 
species (see [34] for a review) have shown that the 


critical plant N concentration (%Nc) is regularly 
decreasing as crop mass accumulates during the crop 
growth period. Moreover, it has been shown that this 
dynamic of decline in %Nc could be represented by 
a unique and constant relationship whatever the 
conditions and the genotypes for a same species [35]: 

%Nc = a(Wc) -b (2) 

Coefficient a represents the critical plant 
N concentration for Wc = It ha -1 . Coefficient b is 
dimensionless, and it represents the dynamic of the 
dilution of N within the dry matter. 

Mixing Eqs. 1 and 2 allows the expression of the 
dynamics of the critical crop N uptake, i.e., the crop 
N demand, in relation with the dynamics of potential 
crop biomass accumulation: 

Nc = a'(Wc) 1 ~ b (3) 

where coefficient d is the crop N demand (or the 
critical N uptake) for a potential crop mass accumula¬ 
tion of 1 t ha -1 . Value of d is equal to 10 a, when a is 
expressed in % and d is expressed in kg ha -1 . 

Values of a (and d) and b have been established for 
the main cultivated species according to the method 
developed on wheat by Justes et al. [36] . This method as 
illustrated on Fig. 1 requires a family of response curves 
of plant N concentration (%N) vs crop mass (W) 
across increasing N supply rates. For each response 
curve, the critical plant N concentration (%Nc) is esti¬ 
mated as the intersection point between the oblique 
line representing the response of both %N and W to 
increased rate of N supply, and the vertical line 
representing the increase in %N with further 
N supply rates once the maximum crop mass (Wc) is 
reached that correspond to luxury N accumulation. As 
the determination of %Nc is done at different stages of 
growth for a large range of value of W, then it is possible 
to fit the series of %Nc-Wc data points for getting the 
critical N dilution curve and to calculate the value of 
coefficients a and b. 

These values are presented in Table 1. For a given 
species, coefficients a (and d) and b remained constant 
whatever the climatic conditions (years and locations). 
Moreover, Femaire et al. [37] showed that for wheat, 
maize, canola, sorghum, and sunflower the same value 
of coefficients holds either in temperate or in 
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Crop Responses to Nitrogen. Figure 1 

Determination of the critical plant nitrogen (N) concentration of maize crops as the intersection point between the oblique 
line corresponding to the response of both plant nitrogen concentration (%N) and crop mass (I/I/) to increasing 
nitrogen supply rates, and the vertical line corresponding to the increases in plant N concentration without corresponding 
increase in crop mass that corresponds to luxury nitrogen accumulation [46] 


subtropical conditions. Comparison among species 
allows a clear distinction between C 3 and C 4 groups 
for the value of coefficient a (and a') while coefficient b 
is unaffected by the type of the metabolic pathway. 
Within each of the metabolic groups, it is difficult to 
establish clear differences between species because of 
the high correlation between the coefficient a (or a') 
and b. Then the fitted curves corresponding to Eq. 2 for 
the different crop species are relatively close to each 
other [38]. 

The derivative of Eq. 3 allows the expression of the 
crop N demand in dynamic terms: 


A theory has been developed for explaining 
the close relationship between crop N demand and 
crop biomass accumulation rate [33, 35]. The hypoth¬ 
esis is that plant mass W is composed of two compart¬ 
ments: Wm, the metabolic tissues involved directly in 
plant growth processes (photosynthesis and meristem 
activity) with a high N concentration %Nm; 
and Ws, the structural tissues involved in plant archi¬ 
tecture and transport with a low N concentration %Ns. 
Then: 

W = Wm + Ws (5) 


d N 
d t 


dNdW , h 



(4) 


Equation 4 shows that the daily crop N demand 
follows the daily crop growth rate, but for a similar 
daily crop growth rate, the daily crop N demand 
declines as the crop mass increases. 


and: 

%N = — (%NmWm + %NsWs) (6) 

Supposing that Wm increases allometrically with 
W then: 
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Crop Responses to Nitrogen. Table 1 Values of coeffi¬ 
cient a' and b of Eq. 3, N = cffWc) 1-6 , for different crop 
species 


Crop species 

a' (kgN 
ha ’) 

b 

(dimensionless) 

References 

Temperate 
grasses (C 3 ) 

48 

0.32 

[39, 40] 

Alfalfa (C 3 ) 

48 

0.33 

[41] 

Pea (C 3 ) 

51 

0.32 

[42] 

Wheat (C 3 ) 

53 

0.44 

[36] 

Canola (C 3 ) 

45 

0.25 

[43] 

Rice (C 3 ) 

52 

0.52 

[44] 

Tomato (C 3 ) 

45 

0.33 

[45] 

Maize (C 4 ) 

34 

0.37 

[46] 

Sorghum (C 4 ) 

39 

0.39 

[47] 

Tropical 
grasses (C 4 ) 

36 

0.34 

[48] 


Wm = kW a (7) 

and then: 

%N = Jfc(%Nm - %Ns) W a_1 + %Ns (8) 

This equation is very close to the empirical Eq. 2. 
The main difference is the asymptotic value of %Nc 
which is equal to %Ns and not to zero. But the 
difference is very little for the range of W from 1 to 
15 t ha -1 [35]. 

Following Hardwick [49] assumptions, it can be 
postulated that because Wm is associated mainly with 
photosynthesis it scales for plant area, and then to crop 


Leaf Area Index (LAI): 


Wm = p(LAI) 

(9) 

and then: 


k „ 

LAI = - W“ 

P 

(10) 


So Eqs 3 and 10 indicate that both N uptake and 
LAI are allometrically related to crop mass. For a large 
range of crop species, cultivated either under temperate 
or sub-tropical climate, the hypothesis for a common 
value of coefficients b and a cannot be rejected, but 


then a direct relation of proportionality could be 
established between crop N uptake and crop LAI [37]: 

N = LAI (11) 

/c 

So, at crop level, the dynamic of expansion of LAI is 
driving the crop N demand. 

Lemaire et al. [37] showed that the two coefficients 
b and a evolves in parallel during the time course of the 
crop growth. Just after seedling, and until a crop mass 
of approximately 1.5 t ha -1 is reached the value of b 
and a is high, close to 0.90-0.95 while it decreases 
rapidly to a value of 0.6-0.7 after this stage when 
competition for light between individual plants within 
the canopy is established. 

Lemaire et al. [37] tried to determine among the 
two parameters, crop mass or LAI, who is the funda¬ 
mental driver of crop N demand. They concluded that 
Eq. 3 was the more robust across genotypes and envi¬ 
ronments, but in fact the two Eqs. 3 and 11 each 
represent an incomplete expression of the feedback 
regulation by shoot growth of N absorption capacity 
of the roots and of N partitioning within the canopy as 
described within the above section. A more complex 
analysis of N allocation between leaf and stem and of 
remobilization of N from old leaves would be necessary 
to better capture genotypic differences in N uptake 
capacities. Nevertheless, the robustness of the relation¬ 
ship between critical crop N uptake (Nc) or critical 
crop N concentration (%Nc) with crop mass accumu¬ 
lation (W) across environments and genotypes allows 
the use of these empirical relationships as a base for the 
determination of crop N demand and for diagnosis of 
crop N status. 

Regulation of Crop N Uptake 

The theory developed above indicates that plant 
N uptake seems to be regulated by plant growth itself. 
In steady state N supply conditions, plant N uptake is 
feedback regulated by shoot signals, with a positive 
signal from photosynthetic C supply and a negative 
one from reduced N recirculation to roots [50-52]. 
So the relationship between N uptake and LAI as 
attested by Eq. 11 can be explained by the fact that 
LAI expansion allowing increased light interception 
provides larger C supply to roots, and also increases 
N storage capacities within expanding leaves as 
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RuBPc-o that avoids the depletion of N uptake by 
recirculating reduced N compounds in the phloem. 
This leads to the proportionality between crop 
N uptake and LAI for most of the species, but the slope 
of the relationships, i.e., the N uptake per unit LAI, is 
variable across species according to their morphology 
and, more particularly, their leaf/stem ratio. So leaf area 
expansion is not the only way for plant to store reduced 
N. Stem growth and leaf thickness increase are also 
a possibility for sequestrating N to avoid the repression 
of N uptake capacity of roots. In consequence, a more 
general and stable relationship between N uptake and 
crop mass can be obtained as attested by Eq. 3. Never¬ 
theless, this relationship is not linear because N uptake 
per unit of crop mass decreases as the LAI per unit of 
crop mass, i.e., the leaf area ratio (LAR) of the crop, 
decreases, that determines the N dilution effect. 

In the field, in a variable N supply condition, plant 
N uptake is co-regulated by both crop growth rate 
potential and the soil N availability. Two groups of 
transport systems, with low and high affinity for 
nitrate, operate in plants [53]. Devienne-Barret et al. 
[54] have proposed a model accounting for this co¬ 
regulation of plant N uptake by soil nitrate concentra¬ 
tion and plant growth: 


d N , _ h (d W\ 

— = a c (l - b) W~ (— ) 


Vh 
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K h + C 


v. 


max 
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k l + c 
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Crop Responses to Nitrogen. Figure 2 

Nitrogen (N) Uptake - Crop Mass (1/1/) trajectories for 
different steady states N supply: critical N uptake (Crit.), 
maximum N uptake (Maximum), non-fertilized (Soil), and 
suboptimal N application (Fert.). The two lines represent 
either (i) two growth stages of the same crop, (ii) two crops 
having different growth rate, or (iii) an environmental 
effect. The lines represent the response curves to increased 
N supply 

So whatever the cause of the variation in crop mass 
provided that N supply remains at steady state, any 
increase in crop mass (AW) is accompanied by 
a corresponding increase in crop N uptake (AN). As 
soil N supply increases, plant N uptake increases as 
a consequence of both (i) the increase in soil 
N concentration (C) and (ii) the increase of plant 
growth rate itself. So, at any moment, crop growth 
rate is the consequence of crop N uptake and vice versa. 


where V and K are coefficients of the Michaelis-Menten 
formula and subscripts describe the high (H) and the 
low (L) affinity transport systems for nitrate; C is the 
actual concentration of nitrate in soil solution, Wis the 
crop mass in t ha -1 and b is the allometric coefficient of 
Eq. 3. 

Figure 2 represents the crop N uptake vs crop mass 
trajectories for different steady-state N supply condi¬ 
tions, i.e., C = constant. Among these curves, it is then 
possible to identify the critical N uptake curve as 
defined above. It is also possible to imagine that the 
maximum N uptake curve corresponds to the higher 
quantity of N a crop is able to accumulate at a given 
crop mass. The difference between this curve and the 
critical curve correspond to the quantity of luxury 
N the crop is able to store. 


Diagnosis of Crop N Status: Nitrogen Nutrition 
Index 

The main consequence of the theory developed above is 
that neither the plant N concentration nor the crop 
N uptake per se can indicate unequivocally the crop 
N nutrition status. Eqs. 2 and 3 indicate that these 
two variables have to be interpreted in relation with 
crop mass. 

As shown in Fig. 2, the critical N uptake curve 
separate situations where N supply is limiting crop 
mass accumulation for situations where N is accumu¬ 
lated in excess without any supplemental increase in 
crop mass. For a given situation and at any time course 
of the growth period of the crop characterized by an 
actual crop N uptake (Na) corresponding to an actual 
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crop mass Wa, it is possible to determine a Nitrogen 
Nutrition Index (NNI) as the ratio between Na and the 
critical N uptake, Nc, corresponding to the same crop 
mass provided the critical N uptake of the crop species 
has been determined. NNI can also be determined 
directly from actual plant N concentration and dilution 
curves as illustrated in Fig. 3: 


NNI 


%Na 

%Nc 


(13) 


Values of NNI close to 1 indicate that at the date of 
the determination of Na or %Na the crop were in 
situation of non-limiting N supply. Values more than 
1 indicate a luxury consumption of N. Values less than 
1 indicate an N deficiency, the intensity of which can be 
estimated by the value of the NNI: a value of 0.6 
indicating that crop N availability was only 60% of 
the critical level. Such an index of crop N status has 
been used as a diagnostic tool for analyzing a posteriori 
agronomical data from field experiments or farm 
observations in order to explain variations in yield by 
differences in crop N status [55]. 

Nevertheless, this approach does not take into 
account that the minimum plant N concentration is 
not 0 but is equal to %Ns (Eq. 8), which is the mini¬ 
mum plant N concentration for the plant to stay alive 
as postulated by Angus and Moncur [56]. So a more 
complex nitrogen nutrition index can be calculated: 


NNI' = 


%Na - %Ns 
%Nc - %Ns 


(14) 


This new index, NNI', is therefore physiologically 
more relevant than NNI, but it involves a greater degree 
of uncertainty related to the value of %Ns which has 
not been documented for many crop species. Lemaire 
and Gastal, [35] by using Eq. 8 derived a value of 0.77% 
and 0.82% for %Ns for wheat and maize, respectively. 
So variation in %Ns across crop species could be not 
very important. Therefore, the use of Eq. 2 for deter¬ 
mining crop N status can be recommended owing to its 
simplicity. 

NNI estimates the instantaneous crop N status at 
the period of time when actual plant N concentration 
%Na and actual crop mass Wa are estimated. But under 
changing N supply in the field, it is necessary to deter¬ 
mine NNI several times during the growth cycle. An 
integrated value of NNI can be obtained by the 
weighted mean of NNI during the different growth 
periods, each time interval representing the duration 
in days or degree-days. Lemaire and Gastal [35] showed 
that it was possible to establish a linear relationship 
between the NNI int and the relative biomass accumu¬ 
lation as expressed as the ratio between actual (Wa) and 
potential (Wm) biomass: 


Wa/Wm = FC(NNI int - NNI 0 ) (15) 
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Crop Responses to Nitrogen. Figure 3 

The use of the critical N dilution curve for the determination of the nitrogen nutrition index 
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where K is the response of the crop to increment in its 
average N nutrition status and NNI 0 is the minimum 
crop N status to allow plant growth. This minimum 
theoretically corresponds to %Ns. 

Jeuffroy and Bouchard [57] characterizes the 
N deficiency period of a wheat crop by both its length 
(deficit N duration, (DD) and its intensity (ID) by 
means of the 1-NNI minimum observed value, and 
they calculated an integrated index of crop N status 
by the mean of the product ID x DD = IDD that 
represents about twice the area between the curve of 
NNIi dynamic and the horizontal line NNIi = 1 
(Fig. 4). They showed that IDD explained 96% of the 
variation in grain number of wheat within a large 
experimental sites x years data set. 

So it is clear that NNI is a good basic tool for 
analyzing actual plant status in crops, and then to 
interpret agronomical data in field conditions in 
order to detect if or not plant N deficiency periods 
occurred, with which intensity and timing and to ana¬ 
lyze the consequences on crop growth and the crop 
yield components. But despite its high informative 
value as diagnostic tool for crop N status, NNI is 
difficult to use practically in field conditions and it 
remains more a research than an agronomical manage¬ 
ment tool. NNI determination is time consuming 
because of the necessity to determine crop mass. Then 



Crop Responses to Nitrogen. Figure 4 
Estimation of NNIint from a sequential determination of 
NNIj: NNN int = 1/N ZNNIjrij The intensity of N deficiency (ID) 
is estimated by the lowest value of INNj, and the duration 
of the deficiency (DD) is equal to the number of days 
with NNIj < 1 


it is necessary to use noninvasive and cost-effective 
methods for a rapid and operational determination of 
plant N status, and then to use NNI as a reference for 
calibration of these indirect methods. 

The theory developed above showed that the 
N dilution effect and then the dependency of plant 
N concentration from the crop mass is the result of 
two processes: (i) the decline in plant leaf area ratio 
(LAR) as crop mass increases, and (ii) the preferential 
allocation of N to the well-illuminated upper layer of 
leaves as canopy develops. Therefore, Lemaire et al. 
[58] suggested that while plant N concentration 
declines with crop mass accumulation, the N content 
per unit of leaf area within the upper layer of leaves 
was more stable and would correlate well with the crop 
NNI. Then, it would be possible to use the 
N concentration of the well-illuminated leaf layer as 
an indicator of crop N status independently of the crop 
mass. Farrugia et al. (2004), using this correlation, were 
able to develop a method of diagnostics of grassland 
N status [59] and of maize crop N status [60]. The leaf 
N concentration can be measured directly from leaf 
samplings, but it can also be estimated indirectly by 
noninvasive methods. The leaf color chart (LCC) is an 
easy-to-use and inexpensive diagnostic tool for moni¬ 
toring the relative greenness of a rice leaf as an indicator 
of the leaf and then of the plant N status [61]. The 
estimation of chlorophyll content of leaves by portable 
systems based on leaf transmittance or leaf reflectance 
in specific wave bands is also well correlated with leaf 
N content, and can be a method for crop N status 
diagnosis [62, 63]. But these predictions are in general 
dependant on cultivars and years [64]. Houles et al. 
[65] demonstrated that through remote sensing it was 
possible to estimate both crop LAI and the quantity of 
chlorophyll per unit of soil area, and then the quantity 
of N within the canopy per unit of soil area. Therefore 
it would be possible to recalculate the NNI of the crop. 
Such an estimate can be intensively repeated in space 
and time that allows very precise information on the 
spatiotemporal dynamics of crop N status that is very 
useful for precision agriculture. 

Crop Responses to N Deficiency 

The use of the concept of critical crop N concentration 
and the possibility for diagnosis of the actual crop 
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N status totally reversed the approach of the response 
of crop to N fertilizers. The response of crop to N was 
studied by analyzing the response curve of yield to 
increasing fertilizer rates. 

The difficulty for such an approach is that (i) the 
actual N supply for the crop is not known, because the 
quantity of N provided directly by the soil, and then 
the yield in absence of any fertilization is very variable, 
and (ii) the potential yield, i.e., the yield achievable in 
a given climatic condition when N supply can be con¬ 
sidered as non-limiting is unknown. So, the crop 
N demand and the total crop N supply are unknown 
variables and it is possible to get only the regression 
between increment in yield and increments in 
N fertilization rates. This approach led to provide 
huge families of crop response curves to N fertilizers 
very variable to each other according to soils, climates, 
years, species, and cultivars, and the only way was to 
analyze this variability with statistic approaches with 
any possibility to identify and to quantify processes in 
order to develop prediction models. 

The possibility for diagnosis crop N status allowed 
the identification of situations where crop growth and 
development were not limited by N deficiency, and 
then to develop crop potential growth models as 
resulting from climate conditions. Then using the crit¬ 
ical N uptake concept, it was possible to derive from 
these models the dynamics of crop N demand all along 
the crop cycle. So instead of having the check treatment 
with no N fertilizer and to analyze response of crop to 
increments in N supply, the check treatment is now the 
crop with critical N status, and then the response of the 
crop to the occurrence of periods of N nutrition defi¬ 
ciency of different timing and intensity all along the 
crop growth cycle and the consequences on yield com¬ 
ponents and quality of the yield can be analyzed. 


N Deficiency Effects on Crop Mass Accumulation 

According to Monteith [66], crop mass accumulation is 
linearly related to the quantity of photosynthetic radi¬ 
ation (PAR) intercepted by a crop during its life cycle. 
The slope of this regression can be interpreted as the 
radiation use efficiency (RUE). 

As illustrated in Fig. 5, and as shown by Belanger 
et al. [67] , N deficiency affects both the quantity of PAR 
intercepted by the crop and the efficiency with which 


800 
700 

600 

7 

E 500 

Q) 

% 400 
cc 

o 300 
in 

200 
100 
0 

0 50 100 150 200 250 300 350 

Intercepted PAR MJ m -2 

Crop Responses to Nitrogen. Figure 5 

Relationships between the quantity of photosynthetic 
active radiation {PAR) intercepted by a tall fescue sward 
and the accumulation of aboveground biomass for three 
contrasted N supply rate: NO: no N application; N60:60 kgN 
ha -1 ; N240: 240 kgN ha -1 . The slope of the regression 
represents the radiation use efficiency [67] 



the intercepted radiation is used for biomass elabora¬ 
tion (RUE). These authors, using NNI, have shown that 
in relative terms, RUE was more affected by moderate 
N deficiency (NNI = 0.6-0.8) than the quantity of PAR 
intercepted while for more severe deficiency, the 
response of the two variables converged. This type of 
response has been confirmed on sunflower [68] and on 
sorghum and maize [69]. LAI expansion seems a little 
bit more responsive to N deficiency than RUE: for an 
NNI of 0.6, RUE was reduced by 30% while LAI was 
reduced by 40% (Fig. 6). 

RUE is an integrated variable accounting for pho¬ 
tosynthesis and respiration, but also the allocation of 
assimilates to root [70]. These authors showed that, in 
fact, canopy gross photosynthesis was less affected by 
N deficiency than the accumulation of total biomass 
reflecting higher respiration losses in N-deficient crops, 
and the accumulation of shoot biomass was more 
affected than the accumulation of total biomass 
reflecting then an important increase of allocation of 
biomass to roots in N deficient situations. The lower 
shoot/root ratio in N deficient crops is widely 
documented [71, 72]. In fact this increased allocation 
of assimilate to roots is the consequence of the lower 
activity of shoot meristems (leaf and stem extension) 
that allows a greater quantity of carbon to be used for 
root growth. So the more sensible plant growth process 
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Crop Responses to Nitrogen. Figure 6 

Effects of crop N status determined by the nitrogen 
nutrition index (A/A//) of tall fescue swards receiving 
different N application rates and (i) the relative quantity of 
intercepted PAR, PAR act /PAR max (•), (ii) the relative 
Radiation Use Efficiency, RUE act /RUE max (■), and the 
relative LAI, LAI act /LAI max (A). (Redrawn from [67]) 

to N deficiency appears to be the leaf area expansion 
rate, with two major consequences: (i) a reduction 
in the dynamic of light interception and then in 
the canopy photosynthesis and crop C supply and 
(ii) a preferential allocation of C for root growth that 
contributes to increase root foraging activity for a fur¬ 
ther increase in N uptake capacity. 

The response of leaf area of plants and canopies to 
N deficiency is brought about by a decline in the 
expansion rate and size of individual leaves combined 
with reduction of branching or tillering. The accumu¬ 
lation of nonstructural carbohydrates in N-deficient 
leaves indicates that C supply is not the cause of the 
leaf area expansion under N deficiency [38]. 

N deficiency decrease the rates of cell division and 
cell expansion with little effect on final cell length [26], 
so the reduction of leaf size in N-deficient plants is 
mainly due to a reduction in cell number. Gastal et al. 
[73] proposed a quantitative relationship between the 
leaf elongation rate (LER) of tall fescue and the NNI of 
the sward: 

LERactual = 1.39 - 1.9e -L49NNI (16) 

LER critica i 

Subscript “actual” corresponds to any suboptimal 
N condition, and subscript “critical” refers to a non¬ 
limiting N nutrition. For a severe limitation in 


N nutrition, NNI = 0.4, LER is then reduced to about 
30% of its maximum value in non-limiting conditions. 
That demonstrates the high responsiveness of leaf 
expansion to N deficiency. 

Lemaire et al. [24] used the relationship between 
LAI and crop mass (W) of Eq. 10 for studying the 
different types of response of crop species to 
N deficiency. This approach allows the separation of 
the reduction of LAI being directly associated with the 
reduction in crop mass (i.e., simply a crop size effect) 
from any specific reduction of LAI at similar crop mass 
(i.e., a modification of plant architecture through 
a reduction of leaf area ratio: LAR). They demonstrated 
that under a similar intensity of N deficiency as esti¬ 
mated through NNI, the reduction of LAI of maize is 
totally allometrically related to its reduction in crop 
mass (crop size effect), while for wheat N deficiency 
provokes in supplement a reduction of LAI at same 
crop mass, i.e., a reduction in LAR. These two types 
of responses represented two extremes and tall fescue 
behaves like maize while canola behaves like wheat; 
sorghum and sunflower having intermediate responses 
[24]. Hence classification of crops in either metabolic 
(C 3 vs C 4 ) or botanical (monocots vs dicots) groups 
does not correspond to any particular response type. 
In response to N deficiency, some species such as maize 
or tall fescue tend to maximize light interception by 
minimizing the reduction in LAI, at the expanse of 
N concentration per unit leaf area, and then leaf pho¬ 
tosynthesis, while other species such as wheat and 
canola tend to develop an inverse response. Do these 
two opposite strategies mean that there exists a trade¬ 
off between photosynthesis per unit of LAI and leaf 
area expansion as already proposed by Sinclair and 
Horie [74]? 

N Deficiency Effects on Leaf and Canopy 
Photosynthesis 

The response of leaf photosynthesis to irradiance 
largely depends on the leaf N content. Leaf photosyn¬ 
thesis at saturating light intensity (A max ) increases 
asymptotically with leaf N content [75]. This relation¬ 
ship shows a positive intercept on the leaf N content 
axis, indicating that when leaf photosynthesis rate 
becomes zero, leaves would still contain significant 
amount of N, corresponding the structural leaf 
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N (N s ) of Eq. 8. The variation in the A max /SLN (leaf 
N content per unit leaf area) relationship seems rela¬ 
tively limited among cultivated species of the same 
metabolic group [10, 76], despite Sinclair and Horie 
[74] showed a lower A max at similar SLN for soybean. 
This variation among species could be due to (i) dif¬ 
ferences in nitrogen costs of PEP-carboxylase and 
RuBPc-o and the relative amounts of these two 
enzymes in leaves, and/or (ii) the possible accumula¬ 
tion of vegetative storage proteins within leaves of 
legume species such as soybean. 

There is a large discussion whether A max has to be 
related to either leaf N content per unit mass or per unit 
leaf area [75]. Lemaire and Gastal [32] indicated that 
none of these two relationships are completely right. 
The two components of photosynthesis, light 
harvesting by chlorophyll and C0 2 reduction by 
RuBPc-o, are affected by leaf N status, and these two 
processes have to be expressed on a leaf tissue volume 
basis, and not only on an area basis. Then leaf thickness 
is an important parameter to take into account. Spe¬ 
cific leaf area (SLA) is the parameter allowing corre¬ 
spondence between leaf N content per unit area and per 
unit dry matter basis. But relationship between leaf 
thickness and SLA is weak because of the variations of 
nonstructural carbohydrate content within leaves. 

The quantum efficiency that is the response of leaf 
photosynthesis to light at low irradiance is only little 
affected by N deficiency [77, 78]. Moreover, the dark 
respiration of leaves seems to increase with increasing 
leaf N [78]. Hence, as the leaves are progressively 
shaded within the canopy the effect of N deficiency 
on leaf net photosynthesis becomes lower and lower 
and then negligible. 

Gastal and Belanger [77] showed that canopy gross 
photosynthesis of a tall fescue sward at high irradiance 
(CGP max ) only responds smoothly to N deficiency: 
a reduction in NNI from 1 to 0.4 reduced the relative 
CGP max from 1 to 0.6 only. This low responsiveness of 
canopy photosynthesis to N deficiency is due to the fact 
that as canopy develops, a greater number of leaves are 
shaded and then their photosynthesis does not respond 
to N shortage. When irradiance becomes more limited, 
the responsiveness of canopy photosynthesis to 
N deficiency becomes more limited. So when canopy 
photosynthesis is integrated over day and for a long 
period where crop LAI is high, the response of canopy 


photosynthesis to N deficiency appears relatively 
limited, which explains the limited influence of 
N deficiency on RUE. 

N deficiency Effects on Harvest Index and 
Components of Grain Yield 

Lor grain crops, yield is closely related to grain number 
per unit area of soil. The elaboration of grains depends 
on flows of C and N compounds to reproductive 
meristems during a narrow window period around 
flowering, anthesis, and very initial grain development, 
and also on environmental conditions (temperature, 
radiation, and water stress) during this period. This 
critical period coincides with the maximum rate of 
crop N uptake [79]. So any limitation of crop growth 
rate at this period by N deficiency decreases grain 
number and then grain yield on cereals such as maize 
[28], wheat [29, 80]. Jeuffroy and Bouchard [57] 
established for wheat a relationship between grain 
number and the severity and duration of the 
N deficiency before anthesis as calculated by NNI 
method. The effect of N deficiency on grain 
number is the consequence of two simultaneous effects: 
(i) a lower crop growth rate at anthesis restricting 
C supply to spikelets and then a spikelet abortion 
[81]; and (ii) a decrease in the N content in the 
spike stems [29] corresponding to a direct effect of 
N deficiency on floret fertility. 

The other grain yield component, grain weight, is 
generally less affected by N deficiency at anthesis than 
the grain number [47]. However, it is necessary to take 
into account the negative correlation between grain 
weight and grain number: a reduced grain number 
resulting from pre-anthesis N deficiency can lead to 
a more favorable source:sink ration during grain filling 
period and then to an unaffected grain weight. Grain 
filling in both carbohydrates and proteins depends on 
(i) the recycling of C and N compounds previously 
accumulated within vegetative organs during pre- 
anthesis growth period, and (ii) post-anthesis photo¬ 
synthesis and N absorption. The relative importance of 
these two components depends on plant species and 
their capacities to store C and N compounds in their 
vegetative organs. So some species such as wheat or rice 
are able to develop large LAI [7-9] and then to store 
large quantities of N, then they are able to feed 80-90% 
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of the N demand for grain development by recycling 
N stored within vegetative biomass, while other species 
such as maize, because they develop less LAI [4, 5] 
with lower N content because they have C 4 metabolism, 
are obliged to feed 40-60% of their grain N demand 
through post-anthesis N absorption. As a consequence, 
crop species like maize are more susceptible to terminal 
soil N shortage than crop species like wheat or rice. 

Delaying leaf senescence for species like maize or 
sorghum should allow these species to continue to 
maintain high N absorption rate after anthesis. The 
supplement of carbohydrates allocated to roots permit¬ 
ted by this delayed leaf senescence allows the mainte¬ 
nance of root absorption capacity [82]. So using 
stay-green genotypes in low N supply conditions 
seems to be beneficial for both sorghum [83] and 
maize [84]. 

Nitrogen Use Efficiency in Crops 

From an agronomic point of view, nitrogen use effi¬ 
ciency (NUE) of a crop represents its capacity to pro¬ 
duce a supplement of yield (dY) for each added unit of 
N fertilizer (dNf) that corresponds to the derivative of 
the crop yield response curve to the rate of N supply: 
Y = f(Nf). As this response curve is asymptotic, NUE 
generally declines with the higher rate of N supply, 
indicating that the first N unit applied is more effi¬ 
ciently converted into yield than later applications. 
According to the crop species types, Y can represent 
either the aboveground biomass, as for forage crops, or 
the grain part as for cereals, grain legumes, or oil seed 
crops. 

Such a global approach does not allow a clear mech¬ 
anistic analysis of the physiological traits controlling 
NUE between different species and cultivars because 
crops respond to total N supply (Nt) that include 
fertilizer supply (Nf) and soil N mineralization (Ns). 
So according to variations in Ns due both to soil and 
climatic conditions and to previous crop management, 
different value of Nt can be obtained with the same Nf, 
leading to large differences in NUE. For this reason, 
Moll et al. [85] propose to define NUE as the yield 
produced per unit of N available in the soil, considering 
two components: 

1. The Nitrogen Absorption Efficiency (NAE) which 

measures the ability of a crop to uptake N from soil 


2. The nitrogen conversion efficiency (NCE) which 

measures the ability of a crop to use absorbed 

N for dry matter and grain production 

NUE = (NAE) (NCE) (17) 

Nitrogen Absorption Efficiency 

Genotypic differences in N uptake at different levels of 
N supply have been shown in rice [86] , wheat [87] , and 
maize [88]. Modern cultivars have a higher N uptake 
capacity because they have a higher biomass produc¬ 
tion. This effect is accounted by Eq. (x) as it shows that 
the crop N uptake rate increases with crop growth rate. 
So any factor enhancing the potential crop growth rate, 
genotype or environment, increase de facto the 
N uptake capacity of the crop. More interesting from 
a plant breeding point of view would be to increase the 
N uptake capacity of crop at similar crop biomass 
production. Lemaire and Gastal [32] reported data 
comparing N uptake of tall fescue and cocksfoot swards 
at similar biomass. These data show that cocksfoot 
had a higher N uptake capacity than tall fescue 
under a sub-limiting N supply condition, while the 
two species had similar N uptake capacity under non¬ 
limiting N supply. Similarly, Lemaire et al. [89] showed 
that grain sorghum had a higher N uptake capacity 
than maize under limiting N supply, while the two 
species had similar N uptake under non-limiting 
N supply. 

The fact that N uptake capacity and then nitrogen 
absorption efficiency (NAE) does not vary too much 
among crop species is due to the fact that the critical 
N uptake curve, as described by Eq. 2 is not very 
different among species of the same metabolic group 
(C 3 vs C 4 ). Then differences between crop species in 
NAE under non-limiting N supply conditions would 
reflect their differences in their potential biomass 
accumulation according to their respective metabolic 
group. Under limiting N supply conditions, clear dif¬ 
ferences emerge among species. As for cocksfoot vs tall 
fescue, sorghum appears to have a greater root length 
density than maize. This would confer these species 
a better capacity of interception of soil mineral N. 
Soil N recovery that determines NAE is the result of 
the nitrogen balance between crop N uptake rate, 
immobilization by soil microbial communities and 
losses through leaching, denitrification, and 
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volatilization [90]. Root architecture and biochemical 
composition, and perhaps root exudates could play 
an important role in this balance, and then on NAE. 
So because of a permanent turnover of N in soil, 
plants having a dense root system are able to compete 
more efficiently against microbes for capturing min¬ 
eral N when the principal source of N is the minerali¬ 
zation of organic matter, i.e., in limiting N supply 
conditions. In non-limiting N supply conditions, 
there is ample mineral N in soil, the immobilization 
capacity of microbial community is saturated and 
then the differences among species tend to disappear. 
Only the difference in N uptake capacity linked to 
difference in crop growth capacity can then be 
observed. 

Some coincidences between quantitative trait loci 
(QTL) for root architecture and NAE have been 
detected on wheat [91]. A large genotypic variability 
has been identified across maize lines for the density 
and length of lateral roots [92] , suggesting that it would 
be possible to breed this species for improving its NAE 
under low N supply conditions. Similarly, genetic stud¬ 
ies showed that NAE was the most important compo¬ 
nent of the Nitrogen use Efficiency in rice [93] and 
wheat [87]. 


Nitrogen Conversion Efficiency 

The efficiency of conversion of absorbed N by crop into 
yield has to be analyzed at two levels: 

- The efficiency for crop biomass production (W), 
i.e., the supplement of biomass (dW) associated to 
the supplement of N uptake (dN) when N supply 
increases 

- The harvest index HI, i.e., the proportion of crop 
mass allocated to grain 

Figure 7 allows a more detailed analysis of the NCE 
for crop mass production. This figure represents two 
N uptake vs crop mass (IV) trajectories for crops at two 
level of N supply: a limiting N supply rate and a non- 
limiting one corresponding to critical N status. At each 
date the slope of the line joining data points 
corresponding to the two N supply rates is equal to 
AN upt /AW which is the reciprocal of NCE. The sup¬ 
plement of nitrogen taken up by the crop, AN upt , due to 
the increase in N supply rate, can be defined as the sum 
of two components (i) (AN^t)^ the quantity of N the 
limiting N supply treatment should have been absorb 
in supplement for reaching the corresponding critical 
level, and (ii) (AN upt ) 2 , the supplemental increase in 
N uptake associated to the accelerated growth rate 
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Crop Responses to Nitrogen. Figure 7 

Schematic representation of the components of the nitrogen conversion efficiency. The first component A(N upt )1 
corresponds to the quantity of N necessary for crop to reach its critical N status, and its NCE is 0. The second component 
A(N upt )2 corresponds to the quantity of N necessary for the synthesis the supplement of biomass AW and whose the 
NCE is dW/dN = l/ubfW) 1 5 that increases with crop mass 
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resulting to the higher N supply level. The NCE of 
(ANupJi is 0 as the increment in N uptake is made at 
a constant crop mass. The NCE of (AN upt ) 2 is not 
constant and can be approached only for a given crop 
mass by the derivative of Eq. 3: 

= ab(W) b ~ 1 (18) 

This equation indicates that the efficiency of con¬ 
version of N absorbed into crop mass dW7dN upt 
increases as crop mass increases. Such an analysis 
allows the identification of two different sources of 
variation for NCE: (i) when W is increased only by 
other factors than N supply, climate and/or genotype, 
then NCE increases with crop mass, and (ii) when W is 
increased by N supply, then NCE is lowered by the cost 
in N uptake for restoring the plant N status. This 
approach allows the study of NCE as a dynamic process 
where the time has to be explicitly taken into account 
through crop growth rate: the higher crop growth rate, 
the higher NCE of the crop. Then it allows the identi¬ 
fication of the trivial effect of plant mass per se: a small 
crop, either because genetically small or because of 
unfavorable environmental conditions will have 
a lower NCE as compared to a bigger crop. This reason 
explains why a good correlation is observed between 
genotypes growing either in high or low N supply con¬ 
ditions [88]. In consequence, breeding for a high NCE 
must lead to the selection of genotypes to higher 
growth potential. There is no clear indication until 
now if NCE variation among genotypes would persist 
when comparisons are made at same crop mass. Since 
coefficients a and b of Eq. 2 differ little or not at all 
between species within C 3 or C 4 groups, the chances to 
find intraspecific differences in NCE when comparing 
plants having a similar growth potential remains very 
limited. 

When considering yield (Y) and not only crop 
biomass (W)> NCE can be highly variable among crop 
species as resulting to large genotypic variations in 
harvest index (HI). For grain crops as cereals, grain 
legumes, and oil seed species, HI is an important source 
of variation in grain yield per unit of N uptake. The 
repartition of N between the harvested and non- 
harvested plant parts that is the nitrogen harvested 
index (NHI) is then an important aspect of the grain 
nutritional quality [95]. Grain produced per unit of 


N uptake and grain N concentration are in general 
inversely related [96], and a variable proportion of 
the variation in yield per unit N uptake is accounted 
for by grain protein concentration or by NHI, 
according to crop type. 

Grain protein N concentration is the result of two 
concomitant processes: (i) the rate of accumulation of 
proteins during grain filling and (ii) the rate of accu¬ 
mulation of free-N compounds (carbohydrates) within 
grains that lead to a dilution of N as grains develop. So 
a low grain N concentration can result in both a N 
deficiency during grain filling period and a large accu¬ 
mulation rate of starch. Thus, late application of 
N fertilizer to avoid any N shortage during grain filling 
can lead to an increase of the grain protein concentra¬ 
tion. But as the capacity of roots for N absorption at 
this stage of the crop is largely impaired by the begin¬ 
ning of leaf senescence and the shortage of C allocated 
to roots, the recovery of this N application is low, and 
then, large mineral N residues in soil increase the risk 
for subsequent N leaching. 

Conclusion 

Since soil N availability, N uptake and distribution 
within plant and crop and finally crop growth are 
permanently interrelated during crop development 
and growth, the traditional view where crop N uptake 
was totally regulated by soil N supply must be 
reconsidered and replaced by a more dynamic 
approach where plant N uptake rate at any moment is 
co-regulated by both soil N supply and plant growth 
capacity itself. This co-regulation lead to the concept of 
critical plant N concentration and N dilution that link 
plant and crop N status to plant and crop mass. Such an 
approach allows the determination of the dynamics of 
plant N demand and plant N status all along their life- 
cycle. Critical N-curves are now available for most of 
the crop species growing either in temperate or tropical 
conditions. This allows the determination of the nitro¬ 
gen nutrition index of any crop in any condition for 
diagnosis of its actual N status and estimating the 
necessity for applying N fertilizers. By this way, it 
would be possible now to adjust more precisely the 
quantity and the timing of N fertilizer supply for 
matching the crop N demand according to target 
yields. As a consequence, a reduction of the risk for 
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N losses to atmosphere and hydrosphere should be 
obtained while the crop productivity would be 
maintained. 

One other perspective is to improve the efficiency of 
use of N within agro-systems. It appears that improv¬ 
ing the ability of crops to absorb and accumulate 
N efficiently from soil (nitrogen absorption efficiency) 
is the first objective for high nitrogen use efficiency in 
cropping systems. This ability of plants and crops to 
take up mineral N from soil has to be investigated both 
at high or low N supply conditions. As demonstrated 
above, in both conditions, N uptake capacity of a crop 
is directly dependent upon its growth capacity as deter¬ 
mined by (i) its own genetic potential, (ii) environ¬ 
mental conditions such as soil and climate, (iii) 
cropping management techniques, and (iv) interactions 
between these variables. So any improvement in crop 
growth capacity by both breeding cultivars and crop 
management (irrigation, P, K, S fertilization, planting 
density) will increase NAE of the crop and then will 
reduce the risk of accumulation of soil N mineral sur¬ 
plus with environmental hazards. But more important 
would be to improve NAE of crop species at similar crop 
growth potential. This would allow achieving a given 
target yield with less N fertilizer. Such an objective 
requires a more efficient root system to increase the 
competitive ability of the plant for using soil mineral 
N against microbial communities. This objective could 
be reached through a combination of plant breeding 
strategy and crop management techniques such as soil 
tillage and soil structure conservation. 

Future Directions 

The improvement of the ability of plants and crops to 
absorb and accumulate N efficiently from soil, that is 
measured by NAE, appears to be the first objective for 
a plant breeding strategy. As demonstrated above, the 
N uptake capacity of a crop primarily depends on its 
growth potential under given climatic conditions. So 
adaptation of genotypes for fast growth potential is the 
first way for improving its N uptake capacity. However, 
it should be also possible to increase the plant N uptake 
capacity at similar growth potential. By this way, it 
could be possible to increase the quantity of N a crop 
is able to uptake from soil in low N supply conditions 
and then to reduce the quantity of N fertilizers 


necessary to obtain a given target yield. Under low 
N supply conditions, root development and architec¬ 
ture as well as the interactions with rhizosphere 
through root exudates may be of major importance 
for the determination of the N uptake capacity of 
crops. Under high N supply conditions, it is the down 
regulation of root absorption capacities by shoot sig¬ 
nals which determine the crop N uptake capacity. So 
for breeding genotypes with higher N uptake capacities 
both in low and in high N supply conditions, it is 
necessary to analyze more deeply all the regulation 
processes of N absorption at physiological and molec¬ 
ular levels. This kind of approach should be one of the 
major tasks in the next decade. The development of 
large-scale genomic, proteomic, and metabolomic 
studies is necessary for such an objective. However, 
the difficulty will be to integrate the huge amount of 
data generated by these studies. All elementary pro¬ 
cesses studied at cellular or molecular level have to be 
scaled up to the level of whole plant and crop where 
they are agronomically relevant. The nitrogen nutrition 
of crops and its efficiency for yield production is con¬ 
trolled by elementary physiological processes such as 
nitrate or ammonium transport through cell mem¬ 
branes in roots, nitrate reduction, ammonium assimi¬ 
lation, and protein synthesis. Each of these metabolic 
processes is regulated at molecular level, and then is 
susceptible to have more or less genetic variability. But 
when all processes are scaled up to whole plant or crop 
level, a large part of this elementary variability is buff¬ 
ered because of existing trade-off among elementary 
processes and feedback mechanisms. Quantitative 
genetics using both mutants and genetic engineering 
appears to be one of the most promising tools to allow 
the identification of key regulatory genes that are likely 
to control a variety of physiological and developmental 
processes involved in the determination of crop nitro¬ 
gen absorption efficiency. But this approach must be 
always accompanied by a modeling approach in order 
to analyze all the feedback and trade-off at whole plant 
and crop level. 

Breeding plants to increase nitrogen conversion 
efficiency for aboveground dry matter production 
appears to be more difficult because, apart from the 
difference between C 4 and C 3 groups, no clear inter¬ 
specific differences are observed among cultivated 
crops. So it seems that intraspecific variability should 
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be very low. However, if the N efficiency for grain yield 
is considered, it should be possible to detect inter- and 
intraspecific differences. Detailed analysis of grain 
development and grain-filling processes should pro¬ 
vide information about the genetic variability. The 
development of models describing both N partitioning 
and translocation within plant, and dynamics of 
grain development, taking into account the coupling 
between N and C fluxes between plant organs are nec¬ 
essary for detecting the key regulation processes for 
optimizing grain yield and quality. This approach will 
require a cooperative and integrated effort between 
plant molecular physiologists, geneticists, crop physi¬ 
ologists, and agronomists and the intensive use of 
bioinformatics. 
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Article Outline 

Transgenic Crops and Molecular Breeding 
Crop Physiology and Agronomy 

Population growth in the coming decades will put 
severe pressure on human food, animal feed, and 
fiber production. Bioenergy applications are already 
exerting increasing pressure on agricultural commod¬ 
ities and land use with severe economic consequences, 
particularly in the developing world. Any crop pro¬ 
ductivity increases must come necessarily from 
enhancing crop performance as further land expan¬ 
sion for agriculture is unlikely to take place. Environ¬ 
mental sustainability and social justice issues are 
becoming increasingly key elements in debates on 
how to assure adequate food for the ever-increasing 
global population. It is likely that the efficiency 
of increasing productivity would benefit by 
complementing the conventional empirical 
approaches with opportunities presented by the devel¬ 
opment of new knowledge and technologies in the 
field of Crop Science. Therefore, in this section a 
number of seminal articles by world experts in the 
field are featured. Through this collection of articles 
key advances in the field are highlighted and the 
reader is pointed to future directions in terms of 
opportunities and constraints for a more productive 
and sustainable agriculture. This is not meant to be an 
exhaustive list of topics; rather the aim is to highlight 
particular technologies and topics that have a real 
potential to make a substantial contribution to 
a more productive and environmentally friendlier 
agriculture in the short to medium term. This section 
comprises 45 articles that can be divided into two 
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parts: (a) molecular approaches and (b) crop physiol¬ 
ogy and agronomy. 

Transgenic Crops and Molecular Breeding 

World food and feed security are increasingly depen¬ 
dent on continuous crop improvement and in partic¬ 
ular the development of crops with increased resistance 
to abiotic stresses (►Abiotic Stress Tolerant Crops: 
Genes, Pathways and Bottlenecks). Plants unlike ani¬ 
mals are not mobile, consequently they need to develop 
strategies to combat natural enemies such as herbivores 
and environmental stresses. Drought, salinity, submer¬ 
gence, and temperature stresses amongst others are all 
important abiotic constraints which limit crop yields. 
Significant advances in our understanding of molecular 
mechanisms underpinning a plant’s ability to combat 
abiotic stresses has resulted in the creation of transgenic 
plants with improved resistance to such stresses. 

Sustainable, renewable resources are those derived 
from biological sources, primarily plant biomass which 
can be regenerated with minimal inputs using energy 
from the sun (► Biomass Crops for Biofuels and Bio¬ 
based Products). Biomass for biofuels includes many 
sources of material derived from agricultural harvests 
including grains, agricultural residues such as stalks 
and leaves, perennial crops such as hay and trees, ani¬ 
mal manures, building waste wood, municipal solid 
waste such as paper, and various food industry wastes. 
Humans currently consume at least 25% more raw 
materials every year than are replaced through biolog¬ 
ical growth. In order to sustain quality of life and 
maintain adequate environmental resources, those 
resources must be balanced and renewable. Pressure 
on those resources has never been greater with the 
world population currently at seven billion people, 
and estimated to plateau at 10.5 billion by 2050. 

Once a gene is introduced into a plant its expression 
may be controlled by a number of different factors 
(►Transgene Expression in Plants, Control of). 
Among these, promoters are most important as they 
regulate expression temporally and spatially. Transcrip¬ 
tional fusions of genes, with the Cauliflower Mosaic 
Virus 35S promoter when integrated into the plant 
genome (mostly dicotyledonous plants), result in 
transgenic plants with high and constitutive expression 
of the transgene. However, 35S-driven transgene 


expression is very variable with ca: 20% of the 
transformants exhibiting high expression levels, while 
the majority (ca: 80%) display an intermediate or low 
and unstable transgene expression. The possible causes 
of variation in transgene expression in a population of 
transgenic plants need to be identified and addressed 
in order to generate useful plants with stable and pre¬ 
dictable levels of expression of introduced transgenes. 

Plant transformation is a fundamental component 
of both basic and applied plant biology (► Crop Plants 
Transformation Methods). For basic research, transfor¬ 
mation allows scientists to study how genes function 
and allows the study of both endogenous genes and 
transgenes. This has increased our understanding of 
how plants grow, develop, and defend themselves 
against pests, diseases, and harsh environments; how 
photosynthesis is controlled; and the basis of primary 
and secondary metabolism. For applied research, trans¬ 
formation can be used to improve the agronomic per¬ 
formance of crops, making them hardier, more 
nutritious, more productive, or converting them from 
conventional crops into green factories producing 
chemical precursors, novel oils, industrial enzymes, 
and pharmaceuticals. Plants have been cultivated for 
food and animal feed, fibers and structural materials, 
and small molecules that can be used as dyes, scents, 
and medicines since the dawn of history, and for the 
same length of time people have sought to improve 
plants by breeding them and selecting the better¬ 
performing and most useful varieties. The limitations 
of this approach, that is, the fact that breeders are 
restricted to the existing gene pool in each group of 
sexually compatible species, and that breeding takes 
a long time to achieve its goals, can be overcome by 
plant transformation, thus accelerating the develop¬ 
ment of plants with novel, beneficial traits. Plant trans¬ 
formation includes both the uptake of naked DNA 
(direct DNA transfer) and the transfer of DNA by the 
conjugation-like method adopted by Agrobacterium 
tumefaciens and A. rhizogenes (Agrobacterium- 
mediated transformation). 

Genetics, molecular biology, genomics, and other 
disciplines have now provided us with an insight to the 
genes that encode important crop traits on which 
humans now depend. Knowledge of the molecular 
genetic basis of valuable crop traits will help provide 
solutions to our requirements for sustainable 
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existence, faced with a growing global population and 
diminishing natural resources (► Crop Traits: Gene 
Isolation). Isolated genes encoding useful traits and 
their use in plant transformation experiments allow 
the development of an in-depth understanding of the 
mechanisms that control such important crop traits. 
Thus, through such investigations that utilize isolated 
genes, one is able to understand and harness traits 
involved in plant domestication such as plant architec¬ 
ture, to those offering solutions for high crop produc¬ 
tion purposes such as growth under unfavorable 
environments or disease outbreaks. Cloning or identi¬ 
fication of the genes involved in crop traits provide an 
insight into factors which determine gene-trait rela¬ 
tionships. Understanding the molecular basis of crop 
traits provides a route to their controlled modulation 
ultimately leading to the development and implemen¬ 
tation of novel genetic engineering solutions to create 
plants with superior and sustainable characteristics. 

Conventional plant breeding practices alone will 
not be able to achieve sustainability in today’s agricul¬ 
ture. Advances in plant genomics research are opening 
up a new era in plant breeding where the linkage of 
genes to specific traits will lead to more efficient and 
predictable breeding programs (► Crop Breeding for 
Sustainable Agriculture, Genomics Interventions in). 
Plant genomics is a rapidly developing field, which is 
radically improving our understanding of plant biol¬ 
ogy by making available novel tools for the improve¬ 
ment of plant properties relevant to sustainable 
agricultural production. Recent advances in high 
throughput genomics technologies including next gen¬ 
eration sequencing and high-throughput genotyping 
have helped immensely in understanding the functions 
and regulation of genes in crop plants. The ever- 
increasing availability of genome sequences in crop 
plants has facilitated greatly the development of geno¬ 
mic resources that will allow us to address biological 
functions and a number of basic processes relevant to 
crop production leading to sustainable agriculture. It is 
therefore expected that genomics will be an integral 
part of the agricultural/plant breeding practices of the 
future for improving crop productivity to achieve food 
security and sustainable production. 

Plants have been used as sources of small molecular 
weight compounds (secondary metabolites) with 
applications as medicines, flavors, and fragrances for 


millennia. However, many plants containing such high- 
value secondary metabolites are difficult to cultivate or 
are becoming endangered because of overharvesting. In 
addition, the chemical synthesis of plant-derived com¬ 
pounds is often uneconomical due to their highly com¬ 
plex structures. The biotechnological production of 
valuable secondary metabolites is an attractive alternative 
to the extraction of whole plant material (► Medicinal 
Plants, Engineering of Secondary Metabolites in Cell 
Cultures). Functional genomics may open entirely new 
avenues to screen unexplored medicinal plant species 
for their pharmacological value. 

Molecular breeding (MB) allows the stacking of 
favorable alleles, or genomic regions, for target traits 
in a desired genetic background, thanks to the use of 
polymorphic molecular markers (MM) that monitor 
differences in genomic composition among cultivars, 
or genotypes, at specific genomic regions, or genes, 
involved in the expression of those target traits 
(► Molecular Breeding Platforms in World Agricul¬ 
ture). The use of MM generally increases the genetic 
gain per crop cycle compared to selection based on 
plant phenotyping only, and therefore reduces the 
number of needed selection cycles, hastening the deliv¬ 
ery of improved crop varieties to the farmer. In contrast 
to the private sector, MB adoption is still limited in the 
public sector, and is hardly used at all in developing 
countries. The situation is critical in developing coun¬ 
tries due to resource-limited breeding programs. As 
a result, the developing world has yet to benefit from 
the MB revolution, and most countries indeed lack the 
fundamental prerequisites for a move to informatics 
powered breeding. A sustainable web-based Molecular 
Breeding Platform (MBP) as a one-stop-shop for infor¬ 
mation, analytical tools and related services to help 
design and conduct marker-assisted breeding experi¬ 
ments in the most efficient way will alleviate many of 
these bottlenecks in the developing world. Such 
a platform will enable breeding programs in the public 
and private sectors in developing countries to acceler¬ 
ate variety development using marker technologies for 
different breeding objectives. 

With the advent of molecular techniques, plants 
have the potential to serve as production vehicles for 
natural or engineered products that were previously 
limited to other hosts. Plant molecular pharming of 
industrial proteins refers to recombinant proteins used 
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in industrial processes and produced in plants (► Plant 
Molecular Pharming, Industrial Enzymes). Enormous 
quantities of a variety of enzymes go into making 
products such as paper, leather, detergents, pharma¬ 
ceuticals, food, beverages, chemicals, and fabric, to 
name a few, and economical production of these indus¬ 
trially important enzymes is crucial to commerce. This 
production must be balanced with the need for sus¬ 
tainability and environmental stewardship. Sustainable 
production of industrial enzymes requires that 
resources are not overexploited and wastes are not 
polluting. The use of plants as “green” factories can 
meet both criteria. Combined with modern farming 
and containment methods, transgenic plants have the 
potential to produce large quantities of target material 
safely and sustainably. 

The demand for recombinant medical proteins has 
increased in recent years and modern biotechnological 
methods have, until recently, ensured the production of 
safe and effective biopharmaceuticals to meet this 
demand. Various production platforms are currently 
used in the pharmaceutical industry, most based on the 
fermentation of engineered pro- and eukaryotic micro¬ 
organisms, insect cells, or mammalian cells (► Plant 
Molecular Pharming, Pharmaceuticals for Human 
Health). The growth of the market for biopharma¬ 
ceuticals is predicted to outpace production capacity 
using these platforms in the next decade, so alternatives 
are necessary. The production of pharmaceutical 
proteins in plants began with a monoclonal antibody 
expressed in transgenic tobacco plants more than 
20 years ago. Since then many different plant species 
have been genetically engineered to produce valuable 
pharmaceutical proteins. Major progress has been 
achieved in transformation and expression technology, 
downstream processing of transgenic plant material, 
and the adaptation of regulatory procedures to encom¬ 
pass the new production platforms, allowing the first 
plant-made pharmaceuticals to begin clinical trials. 

Since the successful expression of complete anti¬ 
bodies in transgenic plants and the first report of 
plant-based vaccine production in 1992 a large number 
of different vaccines, antibodies, as well as antibody 
fragments have been produced in plants for medical 
or veterinary purposes (► Plant Molecular Pharming, 
Veterinary Applications). Novel processing methods 
have been developed over the past several years to 


facilitate the development of recombinant proteins 
for veterinary applications. 

Humankind has had an ever-increasing impact on the 
environment. With the increasing intensification of agri¬ 
culture, particularly during the twentieth century, this 
impact has become even more pronounced, often with 
undesirable or unacceptable consequences, including 
water pollution, soil erosion, and loss of habitat, often 
accompanied by a loss of biodiversity (► Transgenic 
Crops, Environmental Impact). Pests, particularly 
weedy plants, demonstrate a remarkable ability to 
adapt to agricultural production systems. The practice 
of growing monocultures, typically used in intensive 
agriculture, increases the number of pests; these are 
currently predominantly controlled through use of pes¬ 
ticides. However, with increasing exposure to pesticides 
many pest populations are evolving resistance to these 
compounds. An additional problem encountered with 
many pesticides, and particularly insecticides, are their 
nontarget effects on beneficial insects. Transgenic crops 
expressing genes conferring resistance to insect pests 
and/or herbicides are becoming increasingly more 
widely grown and have the potential to eliminate many 
of the problems in conventional agriculture thus reduc¬ 
ing agriculture s negative impact on the environment. 

Annual losses worldwide due to plant diseases are 
estimated to be ~14% of total losses and about $220 
billion. In addition, the need for measures to control 
diseases limits the acreage of land available for cultiva¬ 
tion; restricts the crops that can be grown in fields already 
contaminated with certain pathogens; and necessitates 
the use of agrochemicals for treating seeds, fumigating 
soils, spraying plants, and applying postharvest treat¬ 
ments. Such control measures add to the cost of food 
production and toxic chemicals can be harmful to 
human health and the environment (► Transgenic 
Crops Resistant to Fungal, Bacterial and Viral Patho¬ 
gens). Resistance to pathogens can be achieved by 
application of disease-suppressing cultural practices, 
use of plant defense-promoting substances, deploy¬ 
ment of biological agents antagonistic to the pathogens 
that cause disease, agrochemicals, conventional breed¬ 
ing strategies, and genetic engineering. The need for 
controlling plant diseases effectively is not only a major 
challenge, but also a necessity to reduce food losses 
while improving food quality and safeguarding the 
environment. 
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Genetic Engineering can increase the nutritional 
quality of crops by increasing the availability of 
essential nutrients, which are often limited in human 
diets and lead to specific deficiency diseases 
(► Biotechnology and Nutritional Improvement of 
Crops). Food insecurity is one of the most important 
social issues faced today, with nearly one billion peo¬ 
ple enduring chronic hunger and an additional two 
billion people suffering from nutrient deficiencies, 
most in the developing world. Strategies to address 
food insecurity must ultimately address underlying 
problems such as poverty and poor governance/infra- 
structure, but the improvement of agricultural pro¬ 
ductivity in the developing world is an important 
goal, and biotechnology is one of a raft of measures 
being considered to achieve it. 

The economic impact of transgenic crops has been 
immense (► Global Economic Impact of Transgenic/ 
Biotech Crops (1996-2008)). This has been a major 
driver in their rates of adoption amongst farmers in 
the USA and other industrialized countries but perhaps 
more importantly by small holders in the developing 
world. Analysis on farm income effects through exten¬ 
sive analysis of existing farm level impact data for 
biotech crops confirms this to be a major reason for 
their broad adoption worldwide. 

Biotechnology offers efficient and cost-effective 
means to produce a diverse array of novel, value- 
added products and tools. The first generation of 
commercialized biotechnology products were crops 
focusing largely on input agronomic traits whose 
value was often opaque to consumers. The coming 
generations of crop plants can be grouped into broad 
areas each presenting what, on the surface, may appear 
as unique challenges and opportunities (► Transgenic 
Crops, Next Generation). The present and future focus 
is on continuing improvement of agronomic traits such 
as yield and abiotic stress resistance in addition to the 
biotic stress tolerance of the present generation, 
crop plants as biomass feedstocks for biofuels and 
“bio-synthetics,” value-added output traits such as 
improved nutrition and food functionality, and 
plants as production factories for therapeutics and 
industrial products. From a consumer perspective, 
the focus on value added traits, especially improved 
nutrition, is undoubtedly one of the areas of greatest 
interest. 


The rapid development and deployment of modern 
biotechnology in the last decades has made biosafety 
a major issue. Although modern biotechnology can 
benefit agricultural productivity in developing coun¬ 
tries transgenic crops remain an issue with regard to the 
conservation and sustainable use of biodiversity, as well 
as to human health. The perceived risks, which relate to 
the release of transgenic crops into the environment as 
well as the placement of genetically engineered crops 
onto the market, have as much to do with social values 
as scientific concerns. Regulatory frameworks for 
transgenic crops and the underpinning legislative 
frameworks have been or are being developed world¬ 
wide (► Commercialisation of GM Crops: Compari¬ 
son of Regulatory Frameworks). These are viewed as 
essential components for the prudent deployment of 
transgenic crops worldwide. 

The safety measures associated with transgenic crop 
deployment are embedded in process- or product- 
based regulatory approaches. The EU regulatory 
approach is process-based, precautionary, and includes 
mandatory labeling and traceability requirements for 
transgenic crops and their derived food and feed prod¬ 
ucts (► Transgenic Crops, Risk Assessment and Regu¬ 
latory Framework in the European Union). During its 
development, the EU regulatory system has become 
increasingly more stringent and unduly onerous. In 
the EU, the risk analysis consists of three components: 
risk assessment, risk management, and risk communi¬ 
cation. When analyzing potential risks, it is important 
to bear in mind that the real choice is not between 
transgenic crops that are inherently risky and tradition¬ 
ally bred ones that are completely safe. The cultivation 
of existing crops and those with novel traits (including 
transgenic crops) will have both positive and negative 
consequences. To fully acknowledge the overall out¬ 
come of adopting specific crops, and to assess and 
manage more effectively the environmental footprint 
of agriculture as a whole, broader and more balanced 
legislative oversight is needed in the EU. 

A framework for a better communication about 
science and regulation and production of GM crops is 
described in ► GM Crop Risk Debate, Science and 
Socioeconomics 

While transgenic herbicide resistant crops have 
been a boon to agriculture, reducing both production 
costs and ecological impacts of farming, weeds have 
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rapidly evolved resistance to the major herbicide used 
in transgenic crops (glyphosate), rapidly rendering the 
technology less sustainable than had been thought 
(►Sustainable Herbicide-Resistant Crops). While no 
practice in agriculture has been sustainable forever, the 
period of sustainability can be extended. Methods to 
extend both the usefulness to crops where needed as 
well as the sustainability of transgenic herbicide tech¬ 
nologies such as rotations of crops and herbicides, 
increasing the targets of herbicide action, suppressing 
herbicide targets in rotation, are needed. 

Crop Physiology and Agronomy 

Seed dormancy is a means of restricting germination to 
the season when environmental conditions are suitable 
for plant establishment. From an agricultural perspec¬ 
tive, dormancy is a problem (► Seed Dormancy and 
Agriculture, Physiology). Many important challenges 
face agriculture in relation to dormancy and these 
apply to cultivated crops as well as noxious weeds. 
The physiological mechanisms responsible for the 
expression of the character are now better understood 
and molecular information underpinning the process 
is gradually being generated and incorporated into 
strategies to solve dormancy-related problems. 

One of the first decisions a farmer needs to make is 
to choose the particular genotype to be grown in the 
fields based on anticipated or projected economic 
returns. This is a critical choice that determines the 
sustainability of the agricultural system (► Genotype 
by Environment Interaction and Adaptation). Identi¬ 
fying breeding implications on specific adaptive traits 
and the different statistical approaches for genotype by 
environment interaction characterization is important. 

Attaining global food security by means of 
increased crop productivity will require an increase in 
gains from selection achieved through conventional 
breeding. The identification of molecular markers 
associated with loci controlling traits of agronomic 
interest coupled with the exploitation of marker- 
assisted breeding (MAB) approaches provides the 
opportunity to accelerate gain from selection 
(► Marker-Assisted Breeding in Crops). Genomic 
selection is already having a positive impact on the 
improvement of crop yield, mainly in the private sector 
where high-throughput infrastructures allow breeders 


to handle the large number of molecular data points 
that are needed for deploying genomic selection effec¬ 
tively. Ultimately, an effective exploitation of MAB to 
enhance crop performance will rely on a closer integra¬ 
tion between molecular approaches and conventional 
breeding. 

The next generation of highly productive crops in 
an increasingly variable and changing climate, will rely 
on genetic interventions based on process understand¬ 
ing, selection of target traits in managed environments, 
and high-throughput phenotyping and genotyping 
(► Plant Breeding Under a Changing Climate). There¬ 
fore, it is crucial to understand the recent advances in 
plant breeding for high yield potential environments 
and also those where abiotic stress is a major limitation 
to productivity. 

Agronomic systems are defined as site-specific 
management of soils and crops on the basis of eco- 
regional and physiographic characteristics, and in the 
context of socioeconomic and policy environments. 
These systems are strong determinants of agricultural 
production, sustainable use of resources, and their 
environmental impact. Agricultural soils and ecosys¬ 
tems can also be used for sequestration of atmospheric 
C0 2 by enhancing photosynthesis, increasing net 
primary productivity (► Agronomic Interactions with 
C0 2 Sequestration). 

Crop management comprises a set of agronomic 
practices such as tillage systems, methods of fertiliza¬ 
tion, and crop rotations (► Cropping Systems: Shaping 
Nature). Cropping system may vary among farms 
depending on availability of resources and particular 
constraints. The different cropping systems may deter¬ 
mine water and nutrient availabilities, carbon cycle, 
erosion, and the pathogen inoculum in the soil. 

Understanding plant development or the progres¬ 
sion of plants through their life cycle is important 
because of the need to know and predict when harvest- 
able products are at their optimum (► Crop Develop¬ 
ment Related to Temperature and Photoperiod). 
Current knowledge on how temperature (including 
vernalization) and photoperiod regulate crop develop¬ 
ment is of major interest and determines how crops 
adapt in a wide range of environments. Thus, it is 
critical to understand the physiology and genetic basis 
of crop development, and to predict as accurately as 
possible the timing of key developmental events. 
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Reduced crop productivity from the theoretical 
potential maximum commonly occurs because of 
high temperatures. In addition, air temperatures are 
predicted to increase during the twenty-first century. 
Crop physiological and developmental processes are 
sensitive to temperature so that high temperatures do 
frequently affect negatively crop productivity 
(► Sustainable Productivity, Heat Tolerance for). The 
effects of high temperature, elevated C0 2 and their 
interaction on crops, are therefore important to under¬ 
stand and subsequently to mitigate potential negative 
effects of genes that confer heat tolerance. 

Since plants are immobile, their distribution greatly 
influences the ability of a crop to capture and use 
environmental resources (radiation, water, and nutri¬ 
ents), which are necessary for growth and yield. The 
spatial arrangement of plants and the temporal devel¬ 
opment of their structures (mainly leaves and roots) 
define the ► Spatial Crop Structure in agricultural sys¬ 
tems. Density and spatial arrangement of crops may 
affect intraspecific competition and resource use effi¬ 
ciency, allowing full or partial use of available 
resources. 

The rate of accumulation of dry plant matter is 
entirely dependent on the interception of energy from 
the sun in the wavelength range 400-700 nm. This 
energy is utilized by photosynthesis to synthesize car¬ 
bohydrates and other biological molecules needed for 
essential plant processes (► Crop Radiation Capture 
and Use Efficiency). Given the current emphasis on 
global food security, there is currently much interest 
in raising the radiation use efficiency of key crops in 
important agro-ecosystems. 

Scarcity of water resources is an increasingly impor¬ 
tant issue since it will dictate global production of food 
and feed for the next generations (► Crop Responses to 
Available Soil Water). Key factors responsible for 
sustained plant growth and production under water 
scarcity, for annual as well as perennial (fruit) crops 
are of paramount importance. 

Irrigated agriculture is currently responsible for 
over 40% of total production on 17% of all cultivated 
land area. It is therefore imperative that irrigated agri¬ 
culture not only sustains its current rates of productiv¬ 
ity but also increases (► Irrigation Management for 
Efficient Crop Production). Irrigation expansion is 
currently under pressure from other sectors to reduce 


its share of the fresh water resources. Efficient crop 
production under irrigation in the future would be 
essential to produce more food with less water. This is 
an immense challenge, not easy to achieve without 
novel and innovative approaches in irrigation manage¬ 
ment and crop productivity. 

Sustainability of fertilizer use is very important 
(►Fertilizer Science and Technology), as fertilizers 
are indispensable because nutrient supplies from the 
soil are normally inadequate for high-yielding crops 
and compensate for nutrient removals by previously 
cultivated crops. In addition, fertilizer may also 
improve the quality of human food and animal feed. 

Nitrogen is the most important limiting factor, after 
water deficit, for crop production worldwide (► Crop 
Responses to Nitrogen). Therefore, understanding how 
the yield of different crops can be improved by addition 
of nitrogen (N) fertilizers is critical. There are several 
important issues regarding the dynamics of crop 
N demands during the crop development cycle, the 
timing of the soil N supply according to soil character¬ 
istics, climate and soil agronomic management, the 
crop responses to different intensity and timing of 
N nutrition deficiency, and the time management of 
crop N fertilization using diagnostic and decision¬ 
making tools to optimize trade-offs between 
minimizing crop yield reduction and minimizing envi¬ 
ronmental impacts. 

A substantial increase in the effectiveness with 
which available water and nutrients are used is required 
to ensure food security and environmental protection. 
An essential component of crop improvement is 
breeding for deeper or denser root systems. These char¬ 
acteristics promote soil moisture and nutrient capture 
and high dry matter production in cultivars subjected 
to water and/or nutrient stresses (► Roots and Uptake 
of Water and Nutrients). The current understanding of 
the structure and functions of crop root systems and 
the avenues for the optimization of root anatomy and 
morphology traits that could be applied to the genetic 
and agronomic improvement of crop root systems for 
more effective below-ground resource capture are thus 
very important. 

Lodging is the process by which the shoots of small 
grained cereals are permanently displaced from their 
vertical stance (► Lodging Resistance in Cereals). The 
reduced lodging risk of shorter varieties enabled them 
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to respond to greater amounts of fertilizers and this was 
a significant reason for the steady improvement in 
global cereal grain yields starting in the late 1960s. 
However, lodging is still a major problem in many 
countries and there is an urgent need to improve lodg¬ 
ing resistance to further increase the yield of cereal 
species. 

Plant growth and yield are severely affected by 
saline soils. High concentrations of salt in the soil 
make it difficult for plants to take up water, whilst the 
accumulated salts in cells, particularly sodium and 
chloride ions, are toxic to plant metabolism 
(► Increasing Salinity Tolerance of Crops). These two 
factors result in a reduction in plant growth, an 
increase in the rate of leaf senescence, and a loss in 
crop yield. Crop salinity tolerance can be improved, 
but a more in-depth understanding of osmotic and 
ionic stresses is needed. 

Agroecology provides the basic ecological princi¬ 
ples needed for studying, designing, and managing 
agroecosystems that are productive, sustainable, and 
economically viable (► Agroecological Basis for 
Managing Biotic Constraints). Rather than focusing 
on one particular component of the agroecosystem, 
agroecology emphasizes the interrelatedness of all of 
its components and the complex dynamics of ecologi¬ 
cal processes including all environmental and human 
elements. From a management perspective, the agro¬ 
ecological objective is to provide balanced environ¬ 
ments, sustained yields, biologically mediated soil 
fertility, and natural pest regulation through the design 
of diversified agroecosystems and the use of low-input 
technologies. 

Plant diseases cause substantial crop losses every 
year (► Crop Diseases, Management and Control of). 
Controlling plant diseases is therefore vital to 
maintaining crop productivity. Crop diseases can be 
controlled using a variety of methods; however, plant 
pathogens are genetically adaptable and can overcome 
plant resistance, and the toxic effects of pesticides. 
Ensuring that crops are adequately protected from 
diseases depends therefore on being a step ahead of 
the pathogens by improving existing control measures 
and developing new approaches. 

Increase of crop yields may be achieved by maxi¬ 
mizing the proportion of sunlight energy that is fixed 
by the crop or by reducing the amount of energy that is 


lost by insect pests, diseases, and weeds. More than 50% 
of the potential yield of agricultural crops is lost by the 
three constraints. ► Integrated Pest Management 
(IPM) aims to diminish losses caused by insect pests 
in agriculture, in an economically, ecologically, and 
sociologically acceptable manner. A major challenge 
for ecology is the development of a scientific approach 
to better understand processes in agroecosystems in 
order to implement more rapidly sustainable IPM 
systems. 

Yield increases will continue to play a dominant 
role in world food security (► Crop Yields Around 
the World: Closing the Gap and Raising the Potential). 
Understanding the gaps between potential and actual 
yield is of paramount importance in order to increase 
actual yields. These include several aspects with respect 
to the natural resource base of the plot (climate, soil 
type, topography) and long-term management 
investments. 

Grain quality of field crops is related to seed 
structure and composition. Grain composition is the 
major reason why only a limited number of plant 
species are used for food and fiber (► Grain Quality 
in Oil and Cereal Crops). It is impossible to put 
forward a unique grain quality definition for any 
species because this depends on the specific product 
end use. Therefore, understanding the physiological 
bases of seed composition and structure is essential 
to produce grains with a particular quality 
specification. 

Agricultural production takes place under erratic 
and unpredictable conditions, particularly the avail¬ 
ability and timing of radiation and rainfall patterns 
which are extremely difficult to predict. Their effects 
are compensated to some extend by the qualities of the 
land and the interventions of the farmer. Any method¬ 
ology that would improve the predictability of the 
availability of resources and their impact on the per¬ 
formance of the production system could in principle 
improve performance and reduce the level of uncer¬ 
tainty (► Simulation Models as Tools for Crop Man¬ 
agement). Crop growth simulation models are viewed 
as excellent tools for the reduction of this uncertainty. 
Advantages and limitations of such models need to be 
understood in the context of past experiences and the 
current state of the art in order to ascertain their best 
possible uses. 
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Glossary 

Backcross To cross the progeny of a hybrid to one of 
its parents, which when repeated for many genera¬ 
tions would yield advanced backcross progeny. 
cDNA Complementary DNA (cDNA) refers to the 
DNA copy of RNA transcripts, which can be cloned 
into vectors to generate a library (collection) of 
different cDNAs. 

Chromosomal recombination Breakage and rejoining 
of parental homologous chromosomes during 
meiosis, resulting in the exchange of chromosomal 
segments. 

Cloning vector DNA molecule into which another 
DNA fragment can be integrated and replicated to 
produce large quantities of the cloned DNA. Exam¬ 
ples are plasmids, lambda phage, cosmids, yeast 
artificial chromosomes (YACs), and bacterial arti¬ 
ficial chromosomes (BACs). 

Complementation A method of validating a gene clon¬ 
ing by using wild-type allele to rescue the function of 
mutant allele through genetic transformation. 

Crop trait Any morphological, physiological, or other 
biological feature measurable at the plant level, 
present in different forms in different individuals 
that enable genetic analysis. 

Domestication An artificial selection process 
conducted by humans to produce plants that have 
more desirable traits than wild plants. 

EST Expressed sequence tag is a short subsequence 
from a transcribed cDNA sequence. 


Forward genetics A strategy to identify or clone genes 
that are responsible for a phenotype of interest. 

Gene A gene is an ordered sequence of nucleotides that 
encodes a specific functional product (i.e., a protein 
or RNA molecule). 

Marker Molecular or genetic marker is a DNA 
sequence at a known location on a chromosome. 

Mutation Changes in a genomic sequence, occurring 
naturally or artificially, that can either have 
no effect, alter the product of a gene, or 
prevent the gene from functioning properly or 
completely. 

NIL Near-isogenic lines (NILs) refers to genotypes or 
lines that are genetically almost identical except for 
a small chromosome fragment or DNA sequence by 
which they differ. 

ORF Open reading frame (ORF) refers to a DNA 
sequence that does not encode a stopcodon and 
predicted to encode a protein. 

PCR Polymerase chain reaction, a method in which 
DNA is amplified or increased in number of copies, 
using oligonucleotide primers flanking the DNA 
sequence and an enzyme that carries out the 
reaction. 

Polyploid Organism that has more than two paired 
sets of chromosomes that is present in a diploid. 

Positional cloning A method of cloning genes based 
on their position in the genome, using molecular 
markers in a genetic map located close to the gene 
and then identifying cloned DNA fragments 
containing the gene of interest. 

Quantitative trait locus A region on the genome asso¬ 
ciated with a particular phenotype showing contin¬ 
uous and measurable variation such as height or 
weight. 

Reverse genetics A strategy to identify the function of 
genes revealed by DNA sequencing, by analyzing 
the phenotypic effects of the gene sequences by 
mutations or changes in expression. 

RFLP Restriction fragment length polymorphism 
(RFLP) is a molecular marker revealed by differ¬ 
ences in restriction fragments between homologous 
(similar) fragments of DNA. 

STS marker A sequence-tagged site (STS) marker is 
a short DNA sequence at known location in the 
genome, with nucleotide differences/polymor¬ 
phisms that make it a useful marker for mapping. 
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Transposon A piece of DNA that can move within the 
genome of an organism, and when inserted in genes 
can cause a mutation. 

Definition of the Subject and Its Importance 

Crop plants have been selected from their wild ances¬ 
tors by humans for food, fiber, health, recreation, 
industry, and other special purposes. Crop breeding 
has then added on desired traits for convenience in 
crop production. Genetics, molecular biology, geno¬ 
mics, and other disciplines have now provided us an 
insight to the genes that encode these very important 
crop traits on which the human race now depends. 
Knowledge of the molecular genetic basis of valuable 
crop traits will help provide solutions to our require¬ 
ments for sustainable existence, faced with the threats 
of growing population needs, climate change, and 
dwindling natural resources. This entry describes the 
historical and conceptual discoveries underlying the 
basis of important crop traits, leading from plant 
domestication such as in plant architecture to those 
offering solutions for high crop production purposes 
such as growth under unfavorable environments or 
disease outbreaks. Detailed descriptions are provided 
on the cloning or identification of the plant genes 
involved in crop traits that give an insight on the 
evidence required for establishing the relationship. In 
addition, the molecular basis of the crop trait provides 
an understanding of the biological processes involved, 
the interaction to other traits, and a framework that 
can be built on to engineer novel solutions to the future 
needs from crop plants. 

Introduction 

The domestication of the major crop plants from around 
10,000 to 4,000 years ago resulted in the selection of 
crops on which humans are now dependent [1]. The 
crop traits involved in domestication, transformed the 
wild species to a crop species catered to the needs of this 
paradigm shift of human civilization to an agricultural 
lifestyle. These early crop traits that made agriculture 
possible were selections from wild species that allowed 
humans to collect grain from the sown crop involving 
non-shattering, larger grain and fruit, determinate 
growth with more synchronous harvest, easier accessi¬ 
bility to the grain from protective hard outer coat, and 


an overall increase in edibility along with a plethora 
of many specialized traits. The genetic analysis 
and molecular isolation of key domestication traits is 
now uncovering the gene, the regulatory processes, 
and the selective sweeps from the nearest 
wild ancestors that accompanied the domestication 
process [2]. 

Gene cloning and DNA sequencing from the simple 
bacterial organisms to that of the complex polyploids 
such as wheat, has enabled researchers to examine gene 
sequences one at a time and make conclusions on 
similarity between close and distantly related species, 
propose functions based on molecular and biological 
properties. In fact the generation of gene sequences of 
a multitude of organisms is the fastest growing dataset, 
which promises to reveal the identity and function of 
all living forms. Genome sequencing started with 
model and standard genomes. The model plant 
Arabidopsis thaliana , a common weed, was selected 
for genome sequencing and molecular genetics analysis 
[3], with the DNA sequence of a specific ecotype 
Columbia [4]. Now, the 1001 genomes project intends 
to finish off high-quality resequencing of Arabidopsis 
[5] that will provide sequences of genes from the eco¬ 
types adapted to diverse environments, which might 
reveal the effect of natural selection and adaptation of 
this very effective weed species. This also opens up 
a new era of going beyond a “reference” genome 
sequence of an organism toward sequence-based 
maps of closely related genotypes, which can provide 
a more accurate landscape of differences between genes 
and genomes and the relationships to expressed traits. 
The power of this technology was demonstrated 
by revealing the low level of mutations that occur 
between plants following a few generations of self- 
fertilization [6]. 

Gene isolation and cloning in higher plants has now 
come a long way since the early days of struggling with 
genome size and complexity. Isolation of mRNA and 
characterization of translated proteins was one of the 
primary methods to demonstrate the function of spe¬ 
cific nucleic acids as shown for soybean leghemoglobin 
mRNA [7], which was then proven to be transcribed 
from the soybean genome [8]. One of the first 
approaches in plants to characterize the different com¬ 
ponents of a plant cell in terms of the complexity of 
nuclear and polysomal RNA, was using tobacco an 
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early amenable model for biological studies [9], and 
then analysis of the proportion of the genome and the 
genes that were transcribed [10]. One of the first plant 
genes cloned from genomic DNA was a soybean ribo- 
somal DNA (rDNA) gene from a lambda genomic 
DNA library [11]. Soybean continued to be a species 
of interest, resulting in the generation of a complemen¬ 
tary DNA (cDNA) library for the analysis of auxin 
response [12]. In maize, differential expression of the 
gene for the chloroplast encoded large subunit of 
ribulose bisphosphate carboxylase was shown in bun¬ 
dle sheath cells and absent in mesophyll cells [13], 
describing the regulation of enzymes distinguishing 
C4 plant cell types. These early studies showed the 
potential to address different plant traits by the analysis 
of cloned or isolated characterized genes, and the novel 
insights they provided. 

In the course of improvement in techniques for the 
isolation of plant genes, methods were developed to 
make a correlation or causative association between 
specific functions and gene sequences. These functions 
could be defined on the basis of specific proteins with 
a biological function or genes determining a phenotype 
as revealed by isolated mutants or naturally occurring 
variants. 

One of the important technologies that helped 
bridge the gap between genes and functions was the 
development of transformation technologies that 
allowed the expression of isolated genes in plants and 
the monitoring of the resultant phenotype. Following 
early demonstrations of the transfer and expression of 
the Agrobacterium tumefaciens Ti-plasmid into plant 
cells [ 14] , these Ti-plasmids were engineered to transfer 
DNA (T-DNA) and express genes in transformed plant 
cells, demonstrated by a number of research groups 
around the same time in 1983 [15-19]. In addition to 
Agrobacterium -mediated transformation systems, 
other methods were developed such as direct DNA 
transfer to plant protoplasts [20], electroporation 
[21], and particle bombardment [22]. 

Cloning of plant genes can presently be accom¬ 
plished by a variety of high-throughput ways. The 
cloning of expressed genes in the form of cDNA librar¬ 
ies has been done in plasmids, lambda vectors, and 
other expression systems that allow the selection of 
specific genes based on hybridization, DNA sequenc¬ 
ing, or functional expression assays. The plant genome 


can be fractionated into representative fragments and 
cloned in a variety of sizes in appropriate cloning 
vectors. Genome fragments, in progressively increasing 
sizes from a fewkilobases (Kb) to megabases (Mb), can 
be cloned in vectors such as plasmids, lambda phage, 
cosmids, bacterial artificial chromosomes (BACs), 
yeast artificial chromosomes (YACs), and modifica¬ 
tions to these. While the smaller clones are useful for 
characterizing single genes or fragments thereof, the 
larger clones are used in positional cloning methods 
to identify genes using molecular markers located close 
to the gene of interest. 

To enable the isolation of plant genes that are asso¬ 
ciated with plant biological functions/processes or crop 
traits, a causal relationship has to be made between 
DNA sequence and observed phenotype. Genetic anal¬ 
ysis of mutants or other genetic variants that are local¬ 
ized on the chromosome can be used to clone the 
corresponding genes by methods such as map-based 
cloning (Fig. 1) and transposon tagging (Fig. 2), which 
are the two most commonly used methods for cloning 
genes by a forward genetics strategy centered around 
the phenotype of interest. In positional or map- 
based cloning, with a general strategy described in 
Fig. 1, the mapped position of a gene on the chromo¬ 
some/genome, which specifies a specific trait or phe¬ 
notype is the starting point to identify molecular/ 
genetic markers close to the gene and then using inte¬ 
grated physical and genetic maps “walk” or “land” 
closer to the gene. The proof of isolating the gene is 
by recombination analysis, complementation by trans¬ 
formation, analysis of genetic variants by sequencing, 
or mutational analysis of the candidate genes. In trans¬ 
poson tagging, described in a scheme in Fig. 2, an 
insertion sequence from a native or heterologous trans¬ 
poson is used to generate mutants of a specific pheno¬ 
type or genetic locus, and analysis of the transposon/ 
insertion-tagged mutant by co-segregation analysis to 
a transposon probe, transposon revertant analysis or 
complementation of the mutant by transformation, 
and among the many ways evidence can be obtained 
to prove the identity of the tagged gene. With the 
availability of genome sequences of many plant species, 
these quite anonymous sequences need to be assigned 
a function, generally through a reverse genetics strat¬ 
egy. In reverse genetics strategies many tools are avail¬ 
able, in most models and important crops, to generate 
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Crop Traits: Gene Isolation. Figure 1 

Example of map-based cloning of a gene, (a) Locus mapped to a chromosome region between markers SSR604 and 
SSR605 via rough-scale mapping, with L0D (log of odds) score indicating linkage, (b) The gene further delimited to a 
100-kb genomic region between markers SSR7004 and SSR7005 via fine mapping and physical mapping, (c) Precision 
mapping identified break points flanking a region containing a single gene. Black bars represent DNA from parent 1 
(phenotype: high), white bars represent DNA from parent 2 (phenotype: low), and patterned bars represent DNA of 
heterozygote. The vertical lines show where recombination break points are positioned along the chromosome 


or select mutations in specific genes and thus attribute 
functions by the analysis of the mutant phenotypes. 

This entry describes the specific methods that have 
been used in the isolation and characterization of genes 
that determine important crop traits that agriculture is 
based on. The crop traits identified in many plants have 
homologues in the model plant Arabidopsis that can 
provide clues to the function. The examples described 
are described in some detail and others summarized in 
a table (Table 1) for the reader to refer to. 


Crop Traits: Gene Isolation 

Dwarfing Genes in Cereals 

The biggest increase in agricultural production in the 
modern era, around the middle of the last century, is 
attributed to the “Green Revolution” where grain yields 
of the major world cereals, wheat and rice were 
improved along with complementary crop production 
practices [23, 24]. The spectacular increases in wheat 
and rice yields during the Green Revolution were 
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Use flanking DNA sequence as probe to screen 
genomic or cDNA library of wild type plant 

3 



Sequence PCR product to get information on 
DNA Sequence flanking transposon tag 



Crop Traits: Gene Isolation. Figure 2 

Example of gene tagging using heterologous transposons. An "in cis two-component transposon construct" 
integrated into plant chromosome via Agrobacteriurn-mediated transformation. The construct contains 
a mobile nonautonomous transposon and an immobile transposase source. The transposase mediates transposition of 
the mobile nonautonomous transposon into new chromosomal positions. Segregation in the next generation creates 
stable mutants containing the transposon inserted in a gene with no transposase source. These stable mutants can be 
used in forward or reverse genetics. Identification of integration position is typically done by sequencing of DNA flanking 
the transposon isolated using techniques, for example, TAIL-PCR, Inverse PCR, or plasmid rescue. The flanking sequence is 
used as a probe to screen genomic and cDNA clones. Functional complementation is done by transforming wild-type 
genomic clone containing the tagged gene into the insertional mutant line 


enabled by the introduction of dwarfing traits into the 
crop varieties. The founder of this Green Revolution, 
Norman Borlaug received the Nobel Peace prize in 1970 
in recognition for his efforts in bringing food security 
to major parts of the developing world [25]. Identifi¬ 
cation of the genes responsible for these traits showed 
that they interfere with the action or production of the 
gibberellin (GA) plant hormones [26]. 

The dwarfing gene of wheat, Reduced Height-1 
( Rht-1 ), was isolated based on sequence similarity to 
the previously isolated Arabidopsis GIBBERELLIN 
INSENSITIVE (GAT) gene [27]. GAI is a member of 


the DELLA proteins and a repressor of GA responses 
[28]. The wheat Rht-Blb and Rht-Dlb dwarfing 
mutants had similar characteristics to the Arabidopsis 
gai mutant, including reduced plant height, reduced 
responses to gibberellin, and increased in planta gib¬ 
berellin levels. Compared to the wild-type Rht-1 allele, 
the Rht-Blb and Rht-Dlb dwarfing alleles each 
contained a nucleotide substitution that created 
a stop codon near the N-terminus of the protein [27]. 
It was proposed that translational reinitiation after these 
introduced stop codons might result in an N-terminally 
truncated product that confers the mutant phenotype. 
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Crop Traits: Gene Isolation. Table 1 Some isolated genes in crops 


Gene (s) 

Crop 

Molecular and phenotypic function 

Cloning 

method 3 

Validation 

method 13 

Causative 

change 0 

Reference (s) 

7. Dwarfing gene 

Rht-1 

Wheat 

Transcriptional regulator (SH2); plant height 

SS 

Mut, HE 

EStp 

[27] 

sell 

Rice 

GA20 oxidase; plant height 

SS, MBC 

Var 

Coding 

[30-32] 

d 

Tomato 

Cytochrome P450 enzyme; plant height 

TT 

FC 


[141] 

2. Flowering/heading date 

Hd6 

Rice 

Protein kinase; flowering time 

MBC 

FC 

EStp 

[33] 

Hdl 

Rice 

Transcriptional regulator (zinc finger); 
flowering time 

MBC 

FC 

Coding 

[34] 

Hd3a 

Rice 

Phosphatidylethanolamine-binding protein; 
flowering time 

MBC 

OE, Sil 

- 

[37] 

Ehdl 

Rice 

B-type response regulator; flowering time 

MBC 

FC 

AC 

[142] 

Vgtl 

Maize 

Transcriptional regulator (AP2); flowering time 

MBC, AM 

OE, Sil 

Reg 

[143] 

E3 

Soybean 

Phytochrome A; flowering time 

MBC 

Mut 

AC? 

[144] 

Vrnl 

Wheat 

Transcriptional regulator (MADS); 
vernalization 

MBC 

Var 

Reg 

[145] 

Vrn2 

Wheat 

Transcriptional regulator (ZCCT); vernalization 

MBC 

Sil 

AC 

[146] 

3. Fruit ripening, shape, and weight 

Rin 

Tomato 

Transcriptional regulator (MADS-box); fruit 
ripening 

MBC 

FC, Sil 

Coding 

[147] 

Ovate 

Tomato 

Regulatory protein (OVATE); fruit shape 

MBC 

FC 

EStp 

[148] 

fw2.2 

Tomato 

Cell signaling; fruit weight 

MBC 

FC 

Reg 

[40] 

4. Grain yield 

Gnl 

Rice 

Cytokinin oxidase/dehydrogenase; grain 
number 

MBC 

Sil 

Reg/EStp 

[42] 

GS3 

Rice 

VWFC module-containing protein; grain 
weight 

MBC 

Var 

EStp 

[44] 

GW2 

Rice 

RING-type protein with E3 ubiquitin ligase 
activity; grain width 

MBC 

OE, Sil 

EStp 

[47] 

5. Disease resistance 

FI ml 

Maize 

HC toxin reductase; resistance to the fungus 
Cochliobolus carbonum 

TT 

Co 

Reg 

[49] 

Cf-9 

Tomato 

Leucine-rich repeat family of proteins; 
resistance to leaf mold fungus Cladosporium 
fulvum 

TT 

Co 

Wild 

[54] 

RB/Rpi- 

blbl 

Potato 

CC-NBS-LRR-class R-gene analog; resistance 
to oomycete pathogen Phytophthora infestans 

MBC 

FC 

Wild 

[58, 59] 

Rpi-blb2 

Potato 

NBS-LRR protein; resistance to oomycete 
pathogen Phytophthora infestans 

MBC 

FC 

Wild 

[60] 
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Crop Traits: Gene Isolation. Table 1 (Continued) 


Gene (s) 

Crop 

Molecular and phenotypic function 

Cloning 

method 3 

Validation 

method 6 

Causative 

change 0 

Reference (s) 

mlo 

Barley 

Membrane-anchored protein; resistance 
against the fungal pathogen Blumeria 
gram inis f. sp. hordei (Bgh) 

MBC 

Mut, RA 

AC/EStp 

[62] 

Mi 

Tomato 

NBS-LRR protein; resistance to root-knot 
nematodes (Meloidogyne spp.) 

MBC 

FC 

Wild 

[72] 

Tm-2 2 

Tomato 

CC-NBS-LRR class of R proteins; resistance to 
tomato mosaic virus (ToMV) 

TT 

FC 

Wild 

[76] 

Pto 

Tomato 

Protein kinase; resistance to Pseudomonas 
syringae pv. tomato 

MBC 

FC 

Wild 

[149] 

Xa21 

Rice 

Protein kinase; resistance to Xanthomonas 
oryzae pv. oryzae 

MBC 

FC 

Wild 

[150] 

Cf-2 

Tomato 

LRR protein; resistance to leaf mold fungus 
Cladosporium fulvum 

MBC 

FC 

Wild 

[151] 

N 

Tobacco 

LRR protein; resistance to tobacco mosaic 
virus (TMV) 

TT 

FC 

Wild 

[152] 

Hsl pro ■' 

Sugar 

beet 

LRR protein; resistance to beet cyst nematode 

MBC 

FC 

Wild 

[153] 

I2C 

Tomato 

NBS-LRR protein; resistance to Fusariurn 
oxysporum f sp lycopersici 

MBC 

Sil, OE 

Wild 

[154] 

Ve 

Tomato 

Cell surface-like receptors; resistance to 
Verticillium dahliae 

MBC 

FC 

" 

[155] 

R1 

Potato 

Leucine-zipper/NBS/LRR protein; resistance to 
Phytophthora infestans 

MBC 

FC 

Wild 

[156] 

Rpgl 

Barley 

Receptor kinase; resistance to Puccini a 
graminis f. sp. tritici 

MBC 

Var 

EStp, AC, 

FS 

[157] 

Hero 

Tomato 

NBS-LRR protein; resistance to potato cyst 
nematodes Globodera rostochiensis 

MBC 

FC 

Wild 

[158] 

LrlO 

Wheat 

CC-NBS-LRR protein; resistance to Puccinia 
triticina 

MBC, HS 

OE 

CD 

[159] 

Lr2i 

Wheat 

NBS-LRR protein; resistance to Puccinia 
triticina 

MBC 

FC 

Wild 

[160] 

Pm3b 

Wheat 

CC-NBS-LRR protein; resistance to Blumeria 
graminis f. sp. tritici 

MBC 

STA 

- 

[161] 

Pi9 

Rice 

NBS-LRR protein; resistance to Magnaporthe 
grisea 

MBC 

FC 

Wild 

[162] 

Rpgs 

Barley 

Protein kinase; resistance to Puccinia graminis 
f. sp. secalis 

MBC 

Sil 

FS 

[163] 

Yr36 

(WKS1) 

Wheat 

Kinase-START protein; resistance to Puccinia 
striiformis f. sp. tritici 

MBC 

FC 

Wild 

[164] 

Rdg2a 

Barley 

CC-NB-LRR protein; resistance to 

Pyrenophora graminea 

MBC 

FC 

Reg 

[165] 
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Crop Traits: Gene Isolation. Table 1 (Continued) 


Gene (s) 

Crop 

Molecular and phenotypic function 

Cloning 

method 3 

Validation 

method* 3 

Causative 

change 0 

Reference (s) 

6. Plant and inflorescence architecture 

tbl 

Maize 

Transcriptional regulator (TCP); plant and 
inflorescence structure 

TT 

Co 

Reg 

[80] 

tgal 

Maize 

Transcriptional regulator (SBP); seed casing 

MBC 

Mut 

AC 

[85] 

Q 

Wheat 

Transcriptional regulator (AP2); inflorescence 
structure 

MBC 

Mut 

Reg/AC 

[92] 

vrsl 

Barley 

Transcriptional regulator (HD-ZIP); spikelet 
morphology 

MBC 

Mut 

AC/FS 

[98] 

nud 

Barley 

Transcriptional regulator (ERF); seed casing 

MBC 

Var, Mut 

CD 

[103] 

MOC1 

Rice 

Transcriptional regulator (GRAS); tillering 

MBC 

FC 

TE 

[107] 

PROG1 

Rice 

Transcriptional regulator (ZF); growth habit 

MBC 

FC 

Reg/AC 

[109] 

7. Seed quality and color 

opaque-2 

Maize 

Transcriptional regulator (bZIP); endosperm 
characteristic 

TT 

Co 

TE 

[HI] 

l/l/x 

Maize 

Starch synthase; sticky grains 

TT 

Rev 

TE 

[113] 

l/l/x 

Rice 

Starch synthase; sticky grains 

SS 

Var 

Splice 

[115, 116] 

Sh2 

Maize 

Pyrophosphorylase; supersweet sweet corn 

TT 

Rev 

TE 

[118] 

sut 

Maize 

Isoamylase; sweet corn gene 

TT 

Co 

AC 

[119] 

Rc 

Rice 

Transcriptional regulator (bHLH); seed color 

MBC 

Mut 

Coding 

[121] 

cl 

Maize 

Transcriptional regulator (MYB); kernel color 

TT 

Co 

Reg 

[166] 

yi 

Maize 

Phytoene synthase; carotenoid content 

TT 

Rev 

Reg 

[167] 

R 

Pea 

Starch branching enzyme; seed sugar content 

IS 

Co 

TE 

[168] 

Brix9-2-5 

Tomato 

Invertase; fruit-soluble solid content 

MBC 

IR 

Reg 

[169] 

8. Seed shattering 

sh4 

Rice 

Transcriptional regulator (Myb3); abscission 
layer formation, shattering 

MBC 

FC 

Reg/AC 

[128] 

qSHl 

Rice 

Transcriptional regulator (homeodomain); 
abscission layer formation, shattering 

MBC 

FC 

Reg 

[129] 

sh-h 

Rice 

CTD phosphatase; abscission layer 
differentiation, shattering 

MBC 

Mut, Sil 

Splice 

[131) 

Jointless 

Tomato 

Transcriptional regulator (MADS); abscission 
zone development, shedding 

MBC 

FC, Sil 

Coding 

[170] 

9. Tolerance to abiotic stresses 

SKC1 

Rice 

HKT-type transporters; salt tolerance 

MBC 

FC 

AC 

[134] 

Subl 

Rice 

Transcriptional regulator (ERF); submergence 
tolerance 

MBC 

FC 

CD 

[137] 
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Crop Traits: Gene Isolation. Table 1 (Continued) 


Gene (s) 

Crop 

Molecular and phenotypic function 

Cloning 

method 3 

Validation 

method 6 

Causative 

change 0 

Reference (s) 

Altl 

Wheat 

Aluminum-activated malate transporter; 
tolerance to aluminum toxicity 

SH 

Co, Var, HE 

Reg/AC 

[140] 

Snorkel !, 
Snorkel2 

Rice 

Transcriptional regulator (ERF); deepwater 
tolerance 

MBC 

Var, OE 

CD 

[171] 


a MBC map-based cloning, TT transposon tagging, AM association mapping, HS haplotype study, SS sequence similarity, SH subtractive 
hybridization, IS immunological screening 

b FC functional complementation, Rev transposon revertant analysis, Co co-segregation analysis, RA recombination analysis, Var analysis of 
genetic variants by sequencing, Mut mutational analysis of the candidate genes, OE overexpression, S/7 silencing, IR intragenic recombi¬ 
nation, HE heterologous expression, SLA single-cell transient assay 

c AC amino acid change, Coding disrupted coding sequence, FS frame shift, EStp early stop codon, Reg regulatory change, Splice intron 
splicing defect, CD complete deletion, Wild introgression from wild relative ,TE transposon insertion (Following Doebley et al. [1] with 
modification) 


The rice sd-1 dwarfing allele, with origin from cul- 
tivar Dee-Geo-Woo-Gen and used in the Green Revo¬ 
lution dwarf variety IR8, was mapped to the long arm 
of chromosome 1 [29]. Characterization and isolation 
of this gene was reported in 2002 by three different 
groups [30-32]. Biochemical analysis of the sd-1 
mutant showed that the activity of GA20 oxidase 
(GA20ox), a key enzyme in the biosynthesis of gibber- 
ellin, did not function effectively in the mutant. A wild- 
type GA20ox gene ( GA20ox-2 ) was amplified by PCR 
using primers based on the conserved domain of 
GA20ox genes. Linkage mapping showed that this 
gene mapped to the long arm of chromosome 1, tightly 
linked to the sdl locus. Compared to the wild-type Sd-1 
allele, the sd-1 dwarfing allele (in Dee-Geo-Woo-Gen 
and IR8) contained a 383-base-pair deletion which 
produces a frame shift that creates a stop codon [30]. 
Another study showed RFLP markers flanking the sd-1 
locus were positioned on a physical segment of chro¬ 
mosome 1 covered by contiguous BAC clones by using 
the physical map of the reference genome available in 
the database. A candidate gene search identified the 
GA20ox-2 gene in this region. This gene was amplified 
in the wild type and mutant, and sequence comparison 
revealed a 280-bp deletion in the coding region of this 
gene in the mutant allele [31]. A third independent 
analysis of 3,477 segregants using several PCR-based 
markers localized the sd-1 locus in a 6-kb candidate 
interval on chromosome 1, containing only one pre¬ 
dictable ORF, that of GA20ox-2. The 3,477 segregants 
were derived from selfing of a backcross inbred 


line (BIL) having close chromosomal similarity to 
Sasanishiki (normal-type parent) over the whole 
genome length with the only heterozygous sequences 
located on sd-1 locus. Sequence comparison showed 
that Habataki (semidwarf parent) has a 383-bp dele¬ 
tion from the middle of exon 1 to upstream of exon 2, 
including a 105-bp intron, resulting in a frame shift 
that produces a termination codon in exon 3 [32]. 
These evidence clearly defined the identity of the Sd-1 
dwarfing gene and the mutant sd-1 alleles. 

Flowering or Heading Date 

Heading date or flowering time is an important trait for 
the adaptation of crops to different cultivation areas. In 
rice, heading date is determined mainly by two factors: 
duration of the basic vegetative growth and photope¬ 
riod sensitivity (PS) [33]. Genetic analysis of heading 
date in rice cultivars Kasalath and Nipponbare showed 
that a number of quantitative trait loci, termed Hd 
(heading date) determine the genetic variation for 
flowering time [34]. Day-length treatment sensitivity 
tests with an NIL of Hd6 [NIL (Hd6)] revealed the Hd6 
PS phenotype, with the Kasalath allele increasing days- 
to-heading under natural and long-day conditions but 
not under short-day conditions. Linkage analysis of an 
advanced backcross progeny mapped Hd6 on the long 
arm of chromosome 3 as a single Mendelian factor 
[35]. High-resolution mapping using 2,807 segregating 
plants derived from an advanced backcross progeny, in 
which the region around Hd6 was heterozygous and 
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almost all other regions were homozygous for 
Nipponbare, delimited Hd6 to a 26.4-kb genomic 
region. The sequence analysis of this region identified 
one gene with deduced amino acid sequence having 
high homology (>90%) to the a subunit of protein 
kinase CK2 (CK2a) in maize and Arabidopsis. 
Sequence comparison showed a single-nucleotide sub¬ 
stitution within the coding region, which changed the 
lysine codon (AAG) in Kasalath to a premature stop 
codon (TAG) in Nipponbare. Functional complemen¬ 
tation was performed by introducing the Kasalath 
genomic fragment carrying the CK2a gene into 
Nipponbare and the transgenic plants scored under 
natural day-length condition showed late heading, 
indicating that the Kasalath allele of CK2a increases 
days-to-heading [33]. 

Another heading date locus Hdl was characterized 
by high-resolution mapping using 1,505 early heading 
BC3F3 segregants derived from a cross between 
Nipponbare and Kasalath, which resolved the locus to 
a genomic region of 12 kb. Sequence analysis of the 
region identified one putative gene with considerable 
similarity to the CONSTANS (CO) gene, known to be 
involved in photoperiod response in Arabidopsis. 
Sequence comparison showed a 2-bp deletion in the 
second putative exon of the Kasalath allele resulting in 
a premature stop codon and a predicted shorter protein 
than the wild-type Nipponbare protein. Functional 
complementation was performed by transferring the 
candidate Hdl region from Nipponbare into a NIL of 
Nipponbare carrying an introgression of Kasalath 
nonfunctional Hdl allele. Transgenic plants showed 
earlier heading under short-day conditions than the 
NIL control and null-segregants. These results provide 
clear evidence that the Hdl sequence in the candidate 
genomic region retains the function of photoperiod 
response [34]. 

The Hd3a locus was roughly mapped on chromo¬ 
some 6. NIL(LM3a), an NIL homozygous for the 
Kasalath allele at the Hd3a locus in the genetic 
background of Nipponbare, headed earlier than 
Nipponbare under SD conditions and headed at almost 
the same date as Nipponbare under LD and natural 
field conditions [36]. High-resolution mapping using 
2,207 segregants delimited Hd3a to a 20-kb genomic 
region, whose sequence revealed one putative gene with 
high similarity to the FLOWERING LOCUS T ( FT) 


gene, which promotes flowering in Arabidopsis. In 
this study, the nucleotide polymorphism that caused 
the allelic difference between Kasalath and Nipponbare 
Hd3a could not be clarified [37]. 

Fruit Weight and Grain Yield 

The domestication and improvement of crops has been 
accompanied by increase in size and yield of harvest- 
able products. Cultivated tomatoes ( Solarium 
lycopersicum) have hence undergone more than a 
100-fold increase in mass over their wild relatives. 
A major quantitative trail locus (QTL) for fruit weight, 
/w2.2, was mapped to the same position on 
chromosome 2 in an introgression line F2 derived 
from S. lycopersicum x Lycopersicon pennellii and 
a backcross 1 (BC1) population derived from 

S. lycopersicum x. Lycopersicon pimpinellifolium. The 
fw2.2 locus accounts for 30% and 47% of the total 
phenotypic variance in the L. pimpinellifolium and 
L. pennellii populations, respectively, indicating that 
this is a major QTL controlling fruit weight in both 
species. The small-fruit L. pennellii allele for fw2.2 is 
semidominant to the large-fruit S. lycopersicum allele 

[38] . High-resolution mapping using 3,472 F2 plants 
derived from a cross between S. lycopersicum and a NIL 
containing a small introgression from L. pennellii 
narrowed down fw2.2 to a less than 150-kb region 

[39] . A yeast artificial chromosome (YAC) containing 
fw2.2 was used to screen a cDNA library constructed 
from the small-fruit L. pennellii. Four unique tran¬ 
scripts were identified and used to screen a L. pennellii 
cosmid library, identifying four positive, 
nonoverlapping cosmids (cos50, cos62, cos69, and 
cos84), one corresponding to each unique transcript. 
These four cosmid clones were transformed into two 
tomato cultivars carrying the partially recessive large- 
fruit allele of fw2.2. R1 progeny of primary 
transformants carrying cos50, but not other cosmids, 
showed a statistically significant reduction in fruit 
weight compared to null-segregants, indicated that 
fw2.2 is contained within cos50. Sequence analysis of 
cos50 revealed two open reading frames (ORFs): one 
corresponding to cDNA44, which was used to isolate 
cos50, and another 663-nucleotide (nt) gene, ORFX , 
for which no corresponding transcript was detected in 
the initial cDNA library screen. Analysis on a single 
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recombination event used in the previous mapping 
showed that ORFX is the likely cause of the fw2.2 
QTL phenotype. Semiquantitative reverse transcrip¬ 
tase-polymerase chain reaction (RT-PCR) analysis 
showed that the relative level of the ORFX transcript 
in the carpels of the small-fruited NIL was significantly 
higher than in the large-fruited NIL. Analysis of the 
predicted amino acid sequence of ORFX indicated that 
it is a soluble protein with alpha/beta-type secondary 
structure, and has a structural similarity to the human 
oncogene c-H-ras p21. Sequence comparison of 
L. pennellii and S. lycopersicum ORFX regions indicated 
that the fw2.2 phenotype is probably not caused by 
differences within the coding region of ORFX , but by 
one or more changes upstream in the promoter region 
of ORFX [40]. 

Grain yield is a complex crop trait determined 
mainly by the three component traits, number of pan¬ 
icles, number of grains per panicle, and grain weight; all 
of which are typical quantitative traits. In rice, the 
developments in genome mapping, sequencing, and 
functional genomic research have provided the neces¬ 
sary tools for dissecting the genetic and molecular bases 
of these quantitative traits [41]. 

The rice grain number gene Gnl was first roughly 
mapped on the short arm of chromosome 1 using 96 F2 
individuals derived from heterozygote (Gnl/gnl) 
plants of NIL-Gnl carrying the Gnl region from culti- 
var Habataki in the Koshihikari background. Habataki 
plants have been known to produce more grains in 
their main panicle than Koshihikari plants. It was 
found that Gnl consisted of two loci, QTL-Gnla and 
QTL-Gnlb. High-resolution mapping using 13,000 F2 
plants derived from heterozygotes (Gnla/gnla) of NIL- 
Gnla narrowed down the Gnl a region into a 6.3-kb 
interval containing one reading frame with high simi¬ 
larity to cytokinin oxidase/dehydrogenase (CKX), 
named OsCKX2. Sequence comparison between 
Habataki and Koshihikari revealed several nucleotide 
changes, including a 16-bp deletion in the 5'- 
untranslated region, a 6-bp deletion in the first exon, 
and three nucleotide changes resulting in amino acid 
variation in the first and fourth exons of the Habataki 
allele. However, protein expression study in Saccharo- 
myces cerevisiae showed both Habataki and Koshihikari 
alleles of OsCKX2 encode functional enzymes. 
Sequence analysis of 5150, a rice variety from China 


producing higher grain number, detected 11-bp dele¬ 
tion in the coding region of this gene creating 
a premature stop codon. The coincidence of the 
OsCKX2 null allele and a higher grain number 
suggested that a reduction or loss of function of 
OsCKX2 enhanced grain production. Confirmation 
was done by introducing antisense strands of OsCKX2 
into an easily transformable cultivar Taichung 65 
(TC65), which possesses the Koshihikari allele of 
OsCKX2. Transgenic plants with reduced levels of 
expression developed higher grain numbers. Reverse 
Transcription (RT)-Southern blotting showed that 
the highest levels of OsCKX2 expression in inflores¬ 
cence meristems were found in Koshihikari, but were 
less abundant in Habataki and NIL-Gnl a and 
extremely low in 5150. These results suggested that 
the phenotypic differences observed might have been 
caused by differential transcription of OsCKX2 [42]. 

The rice GS3 grain weight gene was mapped to the 
pericentromeric region of chromosome 3 [43]. Near- 
isogenic lines (NILs) of GS3 were developed by succes¬ 
sive crossing and backcrossing Minghui 63 (large 
grain) with Chuan 7 (small grain), using Minghui 63 
as the recurrent parent. High-resolution mapping 
using 1,384 BC3F2 individuals with the recessive phe¬ 
notype (large grain) derived from one BC3F1 plant 
heterozygous for the GS3 region and containing the 
least genetic background from Chuan 7 delimited the 
GS3 region to 7.9 kb. A full-length cDNA was identified 
that matched well with the region, and sequence anal¬ 
ysis of the predicted GS3 protein revealed the presence 
of a combination of multiple domains. Comparative 
sequencing analysis using six cultivars including three 
with long grains and three with short to medium grains 
indicated that all the large-grain varieties tested share 
the same nonsense mutation in the second exon of the 
GS3 gene that causes a 178-aa truncation in the 
C-terminus of the predicted protein. These findings 
suggest that GS3 may function as a negative regulator 
to prevent the growth and size of the grain [44]. 

Two QTLs for grain width were mapped to the short 
arm of chromosome 2 [45, 46). Subsequently, a major 
QTL for grain width GW2, was mapped to the same 
region using an F2 population derived from a cross 
between WY3 (1,000-grain weight, 41.9 it 1.3 g) and 
Fengaizhan-1 (FAZ1) (1,000-grain weight, 17.9 it 
0.7 g). High-resolution mapping using 6,013 BC3F2 
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plants narrowed the GW2 locus to an 8.2-kb region. 
Only one predicted ORF was considered a viable can¬ 
didate for GW2 in this region [47] . Homology search in 
databases showed that GW2 encodes a previously 
unknown RING-type protein with E3 ubiquitin ligase 
activity, which is known to function in the degradation 
by the ubiquitin-proteasome pathway. Comparison of 
the nucleotide sequences of the FAZ1 and WY3 alleles 
of GW2 uncovered a 1-bp deletion resulting in 
a premature stop codon in exon 4 of the WY3 allele. 
The premature stop codon led to truncation of 310 
amino acid residues; the remaining portion of the 
protein consisted of a 115-residue polypeptide of 
~13 kDa. Sequence analysis of GW2 from Oochikara, 
another rice variety that has a wider grain width, sim¬ 
ilar to WY3, showed a nucleotide sequence identical to 
the WY3 allele. These data indicate that reduction or 
loss of function of GW2 results in increased grain 
width. Rice transformation was used to produce trans¬ 
genic plants expressing different levels of GW2. The 
transgenic plants with antisense strands of GW2 and 
with reduced levels of endogenous expression had 
a significantly wider grain width than plants containing 
the vector control. Transgenic plants overexpressing 
GW2 cDNA under the control of the 35S promoter, 
which produced high levels of expression, showed 
reduced grain width [47]. 

Disease Resistance 

Disease resistance loci (R genes) conferring resistance to 
specific races of the pathogen, carrying corresponding 
avirulence (Avr) genes, have been studied in light of the 
gene-for-gene hypothesis proposed by Flor [48]. Plants 
that contain such race-specific R-genes with resistance 
to specific pathogen races, offer an interesting system to 
study plant pathogen interactions, although the genes 
are very rarely useful in the crop as they are quickly 
overcome by the pathogen gaining virulence. 

In addition, there are non-race-specific disease 
resistance loci which provide plant resistance to 
a wider range or races of the pathogens. The first 
plant resistance gene cloned was Hml from maize, 
which belonged to this class of resistance genes, was 
isolated by transposon tagging [49]. Two RFLP probes 
were mapped to 5 centimorgan (cM) proximal and 
distal to the Hml locus, respectively, using progeny 


from the cross K61/Prl x K61 [50]. These two probes 
were used to classify segregating progeny of transposon 
insertion hml mutants to determine which HM1 alleles 
they had inherited. A transposable element probe was 
then used to identify a restriction fragment that 
cosegregated with hml mutant allele. This fragment 
was isolated and sequenced, and the DNA flanking 
the transposon insertion was used as probe in northern 
blot analysis. A 1.3-kb RNA band was detected in 
polyadenylated [poly(A) + ] RNA from the resistant 
inbred strain Prl ( Hml-Prl ), while the susceptible 
strain, K61 (hml -2), and the mutants either had no 
detectable hybridizing mRNA, or an mRNA of aberrant 
size. Difference at the transcriptional level between 
resistant and susceptible genotypes makes it unlikely 
that susceptible genotypes possess an alternative form 
of HM1 with specificity for a substrate other than HC 
toxin. A 1.6-kb cDNA clone was isolated by homology 
with the probe and sequenced. DNA sequence data¬ 
bases revealed homology between the HM1 cDNA and 
the NADPH-dependent dihydroflavonol-4-reductase 
(DFR) genes of maize, petunia, and snapdragon. This 
homology supports the prediction that HM1 encodes 
HC toxin reductase (HCTR) [49]. 

Among the race-specific R-genes, the tomato-leaf 
mold interaction has been well studied and yielded the 
first examples of cloned R-genes. The avirulence gene 
Avr9 in the leaf mold fungus Cladosporium fulvum was 
shown to specify a 28-amino acid secreted peptide that 
elicits a necrotic response when injected into tomato 
plants carrying the Cf-9 resistance gene [51]. The Cf-9 
gene was introgressed into cultivated tomato from wild 
species L. pimpinellifolium accession PI126915 [52]. 
The Cf-9 locus was mapped to the short arm of chro¬ 
mosome 1 [53], and a targeted transposon tagging 
strategy was employed to isolate the Cf-9 gene [54]. 
A tomato line homozygous for Cf-9 was crossed to 
a transgenic tomato line carrying a Ds element located 
3 cM from the Cf-9 locus, and additionally to a trans¬ 
genic tomato plant containing a stabilized Ac (s Ac) 
element. The FI plants were crossed and the progenies 
selected to produce tagging parents heterozygous for 
Ds and s Ac and homozygous for Cf-9. To tag Cf-9 the 
tagging lines were crossed as female parents to a tomato 
line homozygous for the Avr9 transgene and lacking 
Cf-9. The progeny of this cross, which were heterozy¬ 
gous for Cf-9 and Avr9, became necrotic and died 
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shortly after seed germination, but the mutants for Cf- 9 
survived. Approximately 160,000 progeny were germi¬ 
nated and 118 survivors were recovered. A total of at 
least 37 independent Ds insertions into Cf-9 were iden¬ 
tified. Specific Cf-9 primers were used in conjunction 
with Ds primers to map the Ds insertions on the basis 
of PCR product size. Twenty-eight Ds insertions were 
mapped to the same 3-kb region of the tomato genome. 
All stable mutants tested were susceptible to race 5 of 
Cladosporium fulvum , which indicates concordance 
between the loss of response to the Avr9 transgene 
and loss of resistance to a race of the fungus carrying 
Avr9. Isolation via plasmid rescue and sequencing of 
the flanking genomic sequence showed that Cf-9 
encodes a putative membrane-anchored 
extracytoplasmic glycoprotein with homology to the 
leucine-rich repeat family of proteins [54]. 

Late blight caused by the oomycete pathogen 
Phytophthora infestans , and responsible for the Irish 
Potato Famine of the mid-nineteenth century that 
induced vast migration, is still one of the most devas¬ 
tating of plant diseases causing more than $3 billion 
loss annually [55]. The P. infestans pathogen easily 
overcomes the R-genes crossed in from various wild 
relatives of potato. Diploid Solanum bulbocastanum 
from Mexico, which was not easily crossable to potato 
(Solanum tuberosum ) cultivars, had been characterized 
to have high resistance to P. infestans and was used to 
characterize the resistance gene loci. The RB (for 
“Resistance from S. bulbocastanum”) locus was 
mapped to chromosome 8 of S. bulbocastanum [56]. 
BAC walking by the reiterative screening of a BAC 
library using probes derived from the ends of previ¬ 
ously identified BAC clones, was initiated using linked 
RFLP markers and the contig subsequently used to 
develop PCR-based markers to enhance map resolution 
in the RB region. High-resolution mapping using 542 
BC2, 1,060 BC3, and 206 BC4 genotypes delimited the 
RB region to approximately 55 kb [57]. This region 
contained one truncated and four complete CC- 
NBS-LRR (coiled coil-nucleotide binding site- 
Leu-rich repeat)-class R-gene analogs (RGAs). Each of 
the four complete RGAs was amplified from 
S. bulbocastanum using Long Range-PCR and cloned 
into a binary vector for complementation studies. 
Katahdin, a late blight susceptible potato variety was 
transformed with the four complete R-gene analogs. 


Only the transgenic plants containing RGA2-PCR con¬ 
struct displayed resistance to all six isolates of 
P. infestans , including 126C18, a “super race” that over¬ 
comes all 11 major R genes identified in Solanum 
demissum , the complementation demonstrating that 
RGA2 represents the functional RB gene [58]. 

In a simultaneous effort to the analysis of the RB 
locus, the Rpi-blbl resistance locus was characterized in 
intraspecific crosses between S. bulbocastanum acces¬ 
sions segregating for resistance, and mapped closely to 
marker CT88 on chromosome 8 [59]. A BAC contig 
was isolated across the Rpi-blbl locus, and sequence 
analysis revealed 4 RGAs of the CC-NBS-LRR class. 
One of the genes termed Rpi-blbl complemented 
potato and tomato cultivars to confer resistance 
to complex races of the P. infestans pathogen. In 
subsequent analysis of resistant complex inter¬ 
specific hybrids designated ABPT derived from 
S. bulbocastanum , the Rpi-blb2 locus was identified and 
mapped to a position on chromosome 6 at a similar 
position as the tomato Mi locus (see below) conferring 
resistance to nematode, aphids, and white flies. The 
Rpi-blb2 locus harbored 15 Mi-1 gene homologues, one 
of which conferred P. infestans resistance in tomato and 
potato. The Rpi-blb2 protein shares 82% sequence 
identity to the tomato Mi-1 protein and exemplifies the 
evolution of a resistance gene cluster to confer diverse 
resistance specificity to a wide range of organisms such as 
nematodes, insects, and oomycetes [60]. 

The barley recessive mlo locus conferring resistance 
against the fungal pathogen Blumeria graminis f. sp. 
hordei (Bgh) has been mapped on chromosome 4 [61]. 
High-resolution mapping using 2,022 F2 segregants 
identified a DNA marker cosegregating with mlo and 
two flanking markers at a distance of 0.24 and 0.4 cM, 
respectively. Screening of a large insert yeast artificial 
chromosome (YAC) library identified three YAC clones 
containing the cosegregating marker and two flanking 
markers. Subcloning experiments of one YAC clone 
into bacterial artificial chromosome (BAC) vector and 
further mapping and sequencing delimited the mlo 
region to 30 kb. Sequence analysis identified one 
sequence contig of 5.8 kb, including the cosegregating 
marker, revealing an extensive region of high coding 
probability. Reverse transcriptase-PCR using a series of 
primers deduced from this region and sequencing 
revealed a single extensive open reading frame (ORF) 
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of 1,599 bp. The deduced 60 kDa protein was predicted 
to be membrane-anchored by at least six membrane- 
spanning helices. Genomic PCR-based sequencing of 
11 mutagen-induced mlo resistance alleles and their 
corresponding wild-type DNAs identified nucleotide 
alterations (point mutations or deletions) in all tested 
mutant alleles that at the amino acid level result either 
in single amino acid substitutions or truncated versions 
of the predicted wild-type protein. A comparison 
among the wild-type gene sequences of seven tested 
barley cultivars indicated not a single amino acid dif¬ 
ference. Inter-mutant crosses were performed using 
lines containing different mutant alleles. F2 seedlings 
were screened for rare disease-susceptible individuals 
after inoculation with an isolate of powdery mildew 
which is virulent on each of the parental Mlo wild-type 
cultivars. Homozygous susceptible F3 progeny were 
isolated and used for molecular analysis using RFLP 
markers tightly linked (<4 cM) on each side of the Mlo 
locus. Seven susceptible individuals exhibited flanking 
molecular marker exchange, indicating reciprocal 
crossover (CO) events. Genomic PCR-based sequenc¬ 
ing demonstrated that all seven CO type recombinants 
restored wild-type sequences [62]. 

Root-knot nematodes ( Meloidogyne spp.) are endo- 
parasites of thousands of crop species and are impor¬ 
tant pests of tomato worldwide [63]. The root-knot 
nematode resistance gene (Mi) was introduced into 
cultivated tomato, S. lycopersicum , from its wild relative 
Lycopersicon peruvianum in the early 1940s [64], and 
today it provides the only form of genetic resistance 
against this pathogen. The Mi locus was localized to the 
short arm of chromosome 6 [65] and multiple markers 
linked to Mi were identified [66-69]. In these studies, it 
was found that the nematode-resistant tomato line 
Motelle, and related lines, retained only a small 
introgressed region (650 kb) from L. peruvianum. Efforts 
to localize the Mi gene were hampered for many years 
because of the severe repression of recombination near 
this gene in Lycopersicon esculentum lines carrying 
the introgressed L. peruvianum DNA [67, 68, 70]. 
This handicap was circumvented by screening 
large populations of tomato for recombinants and by 
identifying recombinants within L. peruvianum 
populations [71]. After data from S. lycopersicum and 
L. peruvianum recombinant analyses were combined, 
Mi could be localized to a genomic region of <65 kb. 


Subcloning experiments of one YAC clone containing 
the Mi locus identified four overlapping BACs hybrid¬ 
izing to the Mz-flanking DNA probes. Large-scale 
sequencing of two BACs identified six open reading 
frames; three of these are homologous to each other 
and to previously identified R genes of the nucleotide 
binding-LRR class. Two of them, Mi-1.1 and Mi-1.2, 
appear to be intact genes; the third is a pseudogene 
predicted to not encode a functional product. Comple¬ 
mentation studies were performed by transforming 
a nematode-susceptible tomato line using constructs 
containing the Mi-1.1 and Mi-1.2 genes. Eighty-seven 
percent of transformants carrying Mi-1.2 were resistant 
to the root-knot nematode, whereas all transformants 
carrying Mi-1.1 were completely susceptible, 
confirming Mi-1.2 as the functional Mi-1 gene [72]. 

In cultivated tomato, tomato mosaic virus (ToMV) 
infections are controlled by the introgressed Tm-1 , Tm-2 , 
and Tm-2 2 resistance ( R ) genes [73, 74]. Among these 
resistances the Tm-2 2 resistance was shown to be 
remarkably durable and, therefore, of ongoing practical 
importance. The Tm-2 2 locus has been localized to 
tomato chromosome 9, but map-based cloning of this 
gene has been shown to be difficult, especially due to 
the lack of recombination in the centromeric region 
[75]. The two-component Ac/Ds transposon system 
was utilized to isolate the tomato Tm-2 2 gene. 
A tomato line (homozygous Tm-2 2 ) was crossed with 
another line (homozygous for sAc). One F2 plant from 
this cross (homozygous for both sAc and Tm-2 2 ) was 
subsequently used in a cross with a tomato line having 
Ds transposon on chromosome 9 with distance 2 cM 
from the resistance gene. About 100 independent 
plants with the genotype Ds/-; sAc/-; Tm-2 2 /Tm-2 2 
were selected and used as males and females in a cross 
with a transgenic tomato line which is homozygous for 
the ToMV MP (Movement Protein) gene for a large- 
scale tagging experiment. Previous observation had 
shown a lethal combination of the Tm-2 2 gene and 
the ToMV MP transgene in the seedling stage of tomato 
plants. From about 30,000 seeds obtained from these 
crosses, five putative mutants were obtained. All 
contained the Ds element and the MP gene and only 
one mutant plant still contained sAc. To test whether 
these five putative mutants were really mutants in the 
Tm-2 2 gene, cuttings of these plants were inoculated 
with ToMV. All the five putative mutants were 
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susceptible to ToMV infection. Plasmid rescue was 
used to obtain transposon-flanking plant DNA from 
the mutant plants. Sequencing of 9.8 kb of rescued 
plant DNA identified an open reading frame (ORF) 
of 2,586 bp, which corresponds to a polypeptide of 
861 amino acids. In silico analysis of the ORF did not 
predict the presence of introns. The polypeptide 
encoded by the 2,586-bp ORF shows the characteristics 
of the CC-NBS-LRR class of R proteins, and the 5'- 
and 3'-RACE experiments confirmed the prediction of 
the gene structure. Functional complementation was 
performed by introducing either Tm-2 2 gene under the 
control of its own promoter and polyadenylation signal 
or Tm-2 2 gene under the control of the CaMV 35S 
promoter and the NOS polyadenylation signal into 
a susceptible tomato line. The transgenic plants of the 
two constructs were resistant to ToMV [76]. 

Plant and Inflorescence Architecture 

Maize Domestication Traits Most crop plants differ 
considerably from their wild ancestors in major mor¬ 
phological phenotypes that have probably resulted 
from visual selection of desired plant type. This is 
most dramatic in maize, where the major difference 
between maize and its probable wild ancestor teosinte 
is that teosinte typically has long branches with tassels 
at their tips whereas maize possesses short branches 
tipped by ears [77]. Most of the variation for the dra¬ 
matic differences in inflorescence morphology between 
maize and teosinte is explained by five quantitative trait 
loci (QTLs) [78]. Complementation tests indicated 
that one of these QTLs, which is on chromosome arm 
1L, is the locus for the maize mutant teosinte branched l 
(tbl) [79]. To isolate the tbl gene, homozygous tbl-ref 
(a spontaneous mutant in a maize population) plants 
were crossed to an active Mu transposon line, and 
26,000 FI plants were grown. Among these, three new 
tbl mutants ( tbl-muml , tbl-mum2 , tbl-mum3) were 
observed. Each new mutant was crossed to maize 
inbred A532, and 14 progeny from each of these crosses 
were used for Southern blot analysis with marker 
loci that closely flank tbl to discriminate progeny that 
possessed the tbl-ref versus the new Mu alleles. 
The progeny were also screened by Southern blot 
analysis with Mu element probes, and a Mu3 element 
that cosegregated with tbl-muml was identified. 


Overlapping genomic restriction fragments carrying 
the cosegregating Mu3 element were cloned into X 
vectors. Subclones of these were used for Southern 
blot analyses of the 20 progeny of each of the three 
mutant plants, showing that each of these mutants has 
an insertion into this region of the genome that did not 
exist in their sibs and the progenitor stocks. These 
observations provided the crucial evidence that the 
tbl has been newly tagged. Portions of the X clones 
were sequenced and oligonucleotide primers were 
designed to be used in conjunction with a Mu- specific 
primer to amplify the insertion fragments for each of 
the three Mu alleles. DNA sequence analysis of these 
fragments identified the Mu insertion point of each 
allele. A 0.8-kb restriction fragment flanking the Mu 
element was used to screen a cDNA library derived 
from immature ear. A single clone with a 1,360-bp 
insert, excluding the poly (A) tail was obtained. The 
cDNA sequence is fully collinear with the genomic 
sequence of maize inbred A619, without any evidence 
for introns. BLAST analysis with the sequence showed 
that tbl shares two short regions of homology with the 
snapdragon cycloidea gene. The pattern of tbl expres¬ 
sion and the morphology of tbl mutant plants suggest 
that tbl acts both to repress the growth of axillary 
organs and to enable the formation of female inflores¬ 
cence. The maize allele of tbl is expressed at twice the 
level of the teosinte allele, suggesting that gene regula¬ 
tory changes underlie the evolutionary divergence of 
maize from teosinte [80]. Analysis of nucleotide poly¬ 
morphism in tbl gene of maize and teosinte 
populations showed that during maize domestication 
the effects of selection were limited to the gene’s regu¬ 
latory region and could not be detected in the protein¬ 
coding region [81]. Fine mapping showed that the 
intergenic sequences approximately 58-69 kb 5' to the 
tbl cDNA confer pleiotropic effects on Zea mays mor¬ 
phology. Moreover, an allele-specific expression assay 
showed that sequences >41 kb upstream of tbl act in 
cis to alter tbl transcription [82]. 

The most critical step in maize ( Zea mays ssp. mays) 
domestication was the liberation of the kernel from the 
hardened, protective casing that envelops the kernel in 
the maize progenitor, teosinte [83]. This evolutionary 
step exposed the kernel on the surface of the ear, such 
that it could readily be used by humans as a food 
source. A large effect QTL for glume induration 
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(husk hardening) was mapped to chromosome 4 using 
two F2 populations derived from crosses between maize 
race Reventador (Nay 15) and Balsas teosinte, Z mays 
ssp. parviglumis or between maize race Chapalote and 
Z mays ssp. mexicana [84]. This large-effect QTL seg¬ 
regates as a single Mendelian locus in an isogenic back¬ 
ground, and has been designated tgal [83]. Maize 
Mo 17 bacterial artificial chromosome (BAC) libraries 
were screened using a marker on maize chromosome 4, 
which is tightly linked to tgal and a BAC contig near 
tgal was identified. The BAC end and other sequences 
from this contig were used to BLAST search the rice 
genome sequence, and a region on rice chromosome 
8 that is collinear with the region near tgal on maize 
chromosome 4 was identified. Subsequent BLAST 
searches using the collinear rice sequence identified 
a second maize Mo 17 contig near tgal. Marker analysis 
showed that these 2 contigs flank tgal. Maize B73 
contigs that correspond to the two Mo 17 contigs were 
subsequently identified. DNA sequence analysis 
revealed that the two B73 contigs overlap, and could 
be merged to a single supercontig of ~ 1.5 megabases. 
Fine mapping using 3,106 F2 plants delimited tgal to 
a 6-kb region. Further BLAST searches using the 6-kb 
region at tgal revealed that it has homology to SBP 
(squamosa-promoter binding protein) transcriptional 
regulators. DNA sequence analysis of the SPB gene for 
the tgal-emsl stock, another tgal allele generated by 
ethyl methanesulfonate mutagenesis of maize line W22 
that matched the phenotype of the teosinte allele in 
homozygous state, revealed that it differs from its 
parental (W22) allele by a nonconservative amino 
acid substitution of a phenylalanine for a leucine at 
position 5. This mutation in the tgal-emsl allele con¬ 
firmed the conclusion from positional cloning that tgal 
is the SBP gene, and demonstrated that a single amino 
acid substitution was sufficient to confer the difference 
between the maize and teosinte phenotypes. Northern 
blots, real-time PCR, and in situ hybridization did not 
show any quantitative or qualitative differences in tgal 
expression between the isogenic lines (W22 and W22: 
tgal) or between the maize inbred W22 and teosinte 
itself, suggesting that differences between the maize and 
teosinte proteins may be critical to phenotype. Further 
mapping using seven additional recombination events 
within the 6-kb region narrowed the location of the 
causative site for the functional difference between 


maize and teosinte phenotypes to a 1,042-bp segment. 
DNA sequence analysis of this 1,042-bp segment using 
16 diverse maize and 12 teosinte individuals identified 
seven fixed differences between maize and teosinte. Six 
of these seven are single base-pair polymorphisms that 
lie just 5' of the coding sequence and potentially affect 
tgal expression. The seventh difference encodes an 
amino acid substitution of lysine (K) in teosinte to 
asparagine (N) in maize at position 6. Western blot 
analysis showed that tgal protein encoded by the teo¬ 
sinte allele is more abundant than the one encoded by 
the maize allele over a range of developmental stages 
that might underlie the phenotypic differences. 
The K —> N substitution might alter protein stability, 
or it might affect translation efficiency or protein 
function [85]. 

Wheat Inflorescence In wheat, the Q allele confers 
the square-headed phenotype and free-threshing char¬ 
acter, and is possessed by most of the cultivated wheats, 
but most wild wheats have the q allele and, therefore, 
speltoid spikes that are not free threshing [86]. Early 
experiments involving the cytogenetic analysis of aneu- 
ploid (abnormal chromosome number) plants located 
the Q gene on the long arm of chromosome 5A [87, 88] . 
Using chromosome deletion lines, Endo and Gill 
(1996) physically mapped the Q gene to 
a submicroscopic deletion interval on the long arm of 
chromosome 5A [89]. Comparative mapping of anon¬ 
ymous RFLP clones, AFLP (amplified fragment length 
polymorphism) markers, and mRNA differential dis¬ 
play analysis of lines which have deletion break points 
that flank the Q locus identified 18 markers within the 
Q gene deletion interval. These markers were used to 
construct a genetic linkage map of the region in F2 
populations derived from chromosome 5A disomic 
(homologous chromosome pair) substitution lines. 
The genetic map corresponding to the deletion 
segment was 20-cM long, and markers as close as 
0.7 cM to the Q gene were identified [90]. The closest 
marker to the Q gene was used to screen four high- 
density Triticum monococcum BAC filters resulting in 
the identification of one BAC clone. Chromosome 
walking involved the reiterative screening of the BAC 
library using probes derived from the ends of previ¬ 
ously identified BAC clones. A BAC contig was then 
constructed that spans a physical distance of 300 kb 
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corresponding to a genetic distance of 0.9 cM. The 
physical map of T. monococcum had perfect colinearity 
with the genetic map of wheat chromosome arm 5AL. 
Analysis of fast neutron q mutants using markers 
derived from the BAC contig at the Q locus confirmed 
that the T. monococcum BAC contig spanned the Q 
locus and narrowed the region for prospective Q gene 
candidates to a 100-kb segment, which contains an 
APETALA2 (AP2)-like gene that cosegregates with Q 
[91]. Sequence analysis showed that this AP2 gene 
consists of ten exons and nine introns. The M2 gener¬ 
ation of a population of Triticum aestivum cv. Chinese 
Spring (CS) EMS mutants was screened for the speltoid 
phenotype to identify putative knockouts. Three 
speltoid mutants harboring point mutations within 
the AP2 gene were identified. One mutant had 
a single base substitution in an AP2 DNA binding 
domain (exon 5) that resulted in the change of 
a cysteine to a tyrosine. Two other mutants had point 
mutations in the donor site of intron 2 and the acceptor 
site of intron 7, respectively, resulting in alternate splic¬ 
ing in these mutants. The sequence analysis of the 
mutants validated that the AP2 gene is the Q locus. 
Comparison of the genomic sequences of the AP2 gene 
revealed six conserved differences between Q- and 
^-containing genotypes. Four of these differences, 
including a variable microsatellite (DNA sequence 
repeat), were present in introns and one was in the 
3' UTR. One conserved nucleotide difference changed 
a predicted amino acid where, at position 329, all 
Q-containing genotypes possessed an isoleucine while 
all ^-containing genotypes possessed a valine. Yeast 
two-hybrid analysis showed that the full-length 
Q protein has the ability to form a homodimer, 
whereas the point mutation of q greatly reduced 
homodimer formation of the full-length q protein. 
Rachis fragility, glume shape, and glume tenacity mim¬ 
icked the q phenotype in transgenic plants exhibiting 
posttranscriptional silencing of the transgene and the 
endogenous Q gene. Variation in spike compactness 
and plant height were associated with the level of trans¬ 
gene transcription due to the dosage effects of Q [92]. 

Barley Inflorescence Barley spikes have a unique 
structure consisting of three one-flowered spikelets at 
each spike node. Cultivated barley, Hordeum vulgare 
subsp. vulgare , can be divided into two forms according 


to the morphology of the spikelets: two-rowed and six- 
rowed. The two-rowed condition is believed to be 
primitive, because wild barley, Hordeum vulgare 
subsp. spontaneum , is two-rowed. Row type is con¬ 
trolled by multiple alleles at the vrsl locus (formerly v 
for vulgare) on chromosome 2H [93, 94]. A recessive 
mutation from Vrsl to vrsl changes two-rowed barley 
to six-rowed barley. High-resolution mapping using 
373 BC7F1 plants and 278 BC6F2 plants identified 
four RFLP-derived STS markers closely linked to the 
vrsl locus [95]. The orders of four marker loci and vrsl 
locus were the same in six different mapping 
populations developed from nine different barley cul- 
tivars ( H . vulgare subsp. vulgare) or mutant and wild 
barley ( H . vulgare subsp. spontaneum) [96]. Five AFFP 
markers within the 0.9-cM region associated with the 
vrsl locus were subsequently developed using well- 
characterized near-isogenic lines as plant materials 
[97]. Recombinants within the 0.9-cM region were 
analyzed further using STS markers generated from 
ESTs (expressed sequence tags) and BAC DNA 
sequences. PCR screening of BAC clones of cv. Morex 
using an STS marker located 0.01 cM proximal to the 
vrsl locus isolated one BAC clone. Chromosome walk¬ 
ing identified six bacterial artificial chromosome 
(BAC) clones covering completely the candidate geno¬ 
mic region. The contig containing vrsl is composed of 
518,343 bp. Annotation showed three predicted genes: 
HvHoxl and HvEP2 appeared to be intact genes, 
whereas HvEPl is highly degenerated and interrupted 
by several insertions of transposable elements. HvHoxl 
is the only gene that lies between two markers that 
define the break points and is thus a likely candidate 
for Vrsl. The ORF of two-rowed barley encoded 
a polypeptide composed of 222 amino acid residues, 
including a homeodomain-leucine zipper (HD-ZIP) 
motif. Expression of Vrsl was strictly localized in the 
lateral-spikelet primordia of immature spikes, 
suggesting that the VRS1 protein suppresses develop¬ 
ment of the lateral rows. Foss of function of Vrsl 
resulted in complete conversion of the rudimentary 
lateral spikelets in two-rowed barley into fully devel¬ 
oped fertile spikelets in the six-rowed phenotype [98]. 

The wild progenitor of barley, H. vulgare subsp. 
spontaneum , has covered (hulled) caryopses in which 
the hull (outer lemma and inner palea) is firmly adher¬ 
ent to the pericarp epidermis at maturity. In cultivated 
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barley, the hulled or naked caryopsis is one of the most 
important agronomic traits because of the direct link to 
its use. Most cultivars have the hulled caryopsis and are 
mainly used for animal feed and brewing malts. In 
contrast, naked barley has a caryopsis with easily sep¬ 
arable husks upon threshing and is suitable for edible 
purposes. The naked caryopsis is considered a key 
domestication character in barley because extensive 
pearling to remove the hull is unnecessary [99]. The 
covered/naked caryopsis in barley is controlled by 
a single locus ( nud , for nudum) located on chromo¬ 
some arm 7HL [100]; the covered caryopsis allele 
(Nud) is dominant over the naked one (nud). Bulked 
segregant analysis on an F2 population derived from 
a cross between Kobinkatagi (naked type) and Triumph 
(hulled type) was performed using 1,894 primer com¬ 
binations, and 12 AFLP markers were selected. Among 
them, five closely linked and two cosegregating AFLP 
markers were mapped around the nud locus using 151 
F2 individuals [101]. High-resolution mapping using 
2,380 segregants derived from five cross-combinations 
identified AFLP-derived markers flanking the nud 
locus at the 0.6-cM proximal and the 0.06-cM distal 
side, respectively [102]. Further mapping using 2,828 
progeny segregating for the trait from two cross¬ 
combinations delimited the nud locus to a 0.64 cM 
interval. Integration of the flanking markers into 
a high-density barley expressed sequence tag (EST) 
map selected two barley ESTs flanking the nud locus. 
BLASTN analysis identified their respective homolo¬ 
gous rice ESTs 370 kb apart on rice chromosome arm 
6L. Two rice genes within the collinear region were used 
as vehicles to develop closer barley markers. A BAC 
library of the covered barley cultivar Haruna Nijo was 
screened using the closest marker to the nud locus. 
Seven rounds of chromosome walks selected 
20 BAC clones and a 500 kb-contig spanning the nud 
locus was constructed. In the physical map, the 
nud locus was covered completely with four 
overlapping BAC clones. An ethylene response factor 
(ERF) family transcription factor was the only gene 
that lies in the region delimited by the genetic and 
physical mapping and, therefore, is considered as 
a Nud candidate gene [103]. Sequencing of the nud 
region obtained from two naked lines [Kobinkatagi 
(a Japanese landrace) and nud -Bowman (an isogenic 
line carrying the nud allele in the genetic background of 


the covered cultivar Bowman)] revealed a deletion of 
16,680 bp relative to the corresponding region of the 
Haruna Nijo BAC contig sequence. The 16,680-bp 
deletion included the entire ERF gene. Thus, the gene 
structure analysis of naked cultivars supports the can¬ 
didacy of the ERF gene. Sequence analysis of the can¬ 
didate gene in two X-ray-induced naked mutants 
showed that each of the two mutants carried 
a different single base mutation in the putative func¬ 
tional motif of the ERF gene, but their wild-type vari¬ 
eties (Haisa’s and Ackermanns Donaria, respectively) 
had a nucleotide sequence that is identical to that of 
Haruna Nijo. RNA in situ hybridization using the anti- 
sense probe revealed that, in Bowman, Nud was 
expressed strictly in the testa where adherence occurs, 
while no signal above background was detected in 
nud- Bowman. The Nud gene has homology to the 
Arabidopsis WIN1/SHN1 transcription factor gene, 
whose deduced function is control of cuticular wax and 
cutin-related lipid biosynthesis pathways [104-106]. 
Staining with a lipophilic dye (Sudan black B) detected 
a lipid layer on the pericarp epidermis only in covered 
barley [103]. 

Rice Plant Architecture Tillering in rice (Oryza 
sativa L.) is an important agronomic trait for grain 
production. Screening of a collection derived from 
spontaneous mutations identified monoculm 1 (mod) 
mutant. The mod plants nearly completely lose their 
tillering ability, producing only one main culm, in 
contrast to the multiple tillers in wild-type plants. 
Genetic analysis with reciprocal crosses between mod 
and wild-type plants revealed that mod is a single locus 
mutation. The MOC1 locus was mapped to the long 
arm of chromosome 6 of O. sativa using 280 F2 mutant 
plants generated from the crosses between mod and 
Minghui 63. Fine mapping using 2,010 F2 mutant 
plants and newly developed molecular markers 
delimited the MOC1 locus to a 20-kb region. Annota¬ 
tion of the 20-kb sequence identified an ORF that 
encodes a protein highly homologous (44% identity) 
to the tomato LATERAL SUPPRESSOR (LS). The 
corresponding ORF from mod and wild-type plants 
were amplified by PCR and sequenced. DNA sequence 
comparison revealed a 1.9-kb retrotransposon inserted 
in this ORF in the mod mutant. Transformation of 
a binary plasmid carrying a 3.2-kb wild-type genomic 
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fragment containing the entire ORF plus a 1.5-kb 
upstream sequence and the 316-bp downstream 
sequence, but not the one carrying a 3' truncated 
MOC1 gene, was able to rescue the monoculm pheno¬ 
type of the mod mutant [107]. 

Typical common wild rice ( Oryza rufipogon ) tends 
to have a prostrate growth habit during the vegetative 
phase and develop erect panicle-bearing stalks during 
the reproductive phase. Cultivated rice has an erect- 
growth habit throughout the entire growth phase, 
which may increase plant density, enhance photosyn¬ 
thesis efficiency, and improve grain yield. A set of 
introgression lines was constructed using an accession 
of Yuanjiang common wild rice (YJCWR, Oryza 
rufipogon) with prostrate growth habit as a donor, 
and an elite indica cultivar Teqing (O. sativa) with 
erect-growth habit as a recipient [108]. One introgres¬ 
sion line (YIL18) displaying prostrate growth was 
obtained, which harbored two YJCWR chromosomal 
segments on the long arm of chromosome 3 and the 
short arm of chromosome 7. The tiller angle of YIL18 
was larger than that of Teqing. The grain number on 
the main panicle (GNP) in YIL18 was only 57.6% of 
that in the recipient Teqing, a result of the lesser num¬ 
ber of primary branches and secondary branches on the 
main panicle. Genetic linkage analysis within 246 F2 
individuals derived from the cross between YIL18 and 
Teqing showed that prostrate growth was completely 
associated with a marked decrease of GNP and con¬ 
trolled by a single semidominant gene, PROG1 
(PROSTATE GROWTH I), located on short arm of 
chromosome 7. High-resolution mapping using 3,600 
recessive homozygote plants with erect growth from 
the F2 population delimited progl within an 8.8-kb 
region. Only one ORF was identified within this region 
that encodes a putative single Cys2-His2 zinc-finger 
protein. Transformation of a binary plasmid carrying 
the entire O. rufipogon PROG1 with 596-bp or 2,914-bp 
5'-flanking regions, but not the one carrying only the 
2,914-bp 5'-flanking region, showed complementation 
of the prostrate growth phenotype. Comparison of the 
coding sequences of PROG1 in YJCWR and progl in 
Teqing showed 15 SNPs and 6 insertion/deletions 
(indels) that encoded 23 amino acid changes between 
PROG1 and progl. Sequencing of the progl coding 
regions of 182 erect-growth varieties of cultivated rice, 
including 87 indica and 95 japonica cultivars from 


17 countries showed that all the cultivars contained 
identical mutations as progl in Teqing, including 15 
SNPs and 6 indels [109]. 

Cereal Seed Quality and Color 

Cereal grain is the staple food of most of the world, 
representing the major carbohydrate energy source in 
the diet. One of the most important crop traits is grain 
quality, which determines the price and variety of food 
that can be prepared from the cereal crop. Thus the 
major cereal crops such as rice, wheat, maize, barley, 
and oats have undergone selection from the onset of 
the crop domestication process. In comparison to 
other crop traits, seed quality and color are primarily 
biochemical traits and have been studied at the protein, 
gene, and metabolite or grain component level. 

Maize Lysine Content The protein nutritional qual¬ 
ity of maize is improved by increase in the essential 
amino acid lysine in the opaque-2 ( o2 ) mutant locus, 
which was localized on the short arm of chromosome 7 
[110]. To isolate the gene by transposon tagging, nor¬ 
mal maize strains (02/02) carrying a mutable allele of 
Cl containing an autonomous Spm ( cl-m5 ) or 
a mutable allele of Wx containing a nonautonomous, 
defective derivative of Spm ( dSpm) ( wx-m8 ) were used 
as pollen donors for opaque plants ( o2/o2 ). Three 
opaque-mutable o2/o2-m kernels were selected from 
approximately 530,000 FI seeds. These kernels were 
grown to maturity and self-pollinated. DNAs prepared 
from leaf samples of F2 individual plants were used for 
Southern blot analysis using a Spra-specific probe. The 
opaque-mutable plants contained a novel 8.4-kb band 
absent in both parents and missing from at least some 
of the individuals that had been classified as opaque. 
Those kernels that had been classified as opaque but 
possessed the fragment were evidently not mutable 
because of failure of the Spm to transpose, or else 
transposition occurred so late in endosperm develop¬ 
ment that revertant sectors were undetectable, giving 
the kernels an opaque rather than an obvious opaque- 
mutable phenotype. The fragment was cloned and 
restriction enzyme mapping of the clone revealed the 
presence of a full-length, autonomous 8.3 kb Spm and 
an adjacent sequence of about 150 bp. This non- Spm 
region was sequenced and used as a probe on Southern 
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blot using a BC1 population derived from a cross 
between opaque ( o2/o2 ) and normal (02/02) plants. 
All the plants derived from opaque ( o2/o2 ) seeds 
showed a single 6.5-kb fragment, whereas plants from 
seeds with a normal phenotype (o2/02) had a 10-kb 
fragment in addition to the 6.5-kb fragment. This 
demonstrates that the 150-bp Spm-fanking sequence 
derived from the opaque-mutable plant represents 
a single-copy sequence that cosegregates with alleles 
of the o2 locus. This fragment was subsequently used 
as a probe to clone a 17-kb HmdIII fragment of the 
wild-type 02 allele from the cl-m5 parent [111]. 
A cDNA library prepared from wild-type endosperms 
was screened with probes derived from the 02 genomic 
clone. Analysis of 02 cDNA sequence showed that the 
deduced 02 protein sequence contains a “leucine-zip¬ 
per” domain characteristic of some mammalian and 
fungal transcription activation factors. DNA binding 
assays demonstrated that the 02 protein or only 
a fragment specifying the leucine-zipper domain 
bound to two specific regions on the 5' side of the 
coding sequence in a zein genomic clone [112]. 

Cereal Amylose Content The maize Waxy locus 
determines starch quality by the level of amylose and 
amylopectin. Isolation of the gene was carried out by 
a combination of identification of the protein and the 
gene associated with the kernel mutant phenotype. The 
mutant wx-m6 and wx-m9 alleles have Ds transposon 
inserts at the Wx locus, while the wx-m8 allele is attrib¬ 
utable to insertion of a defective transposon of the Spm 
transposon family. Starch granules from immature 
endosperm tissue of kernels carrying the wx-m6 , wx- 
m9 y and wx-m8 alleles were isolated and the starch 
granule-bound proteins were analyzed on SDS- 
polyacrylamide gels [113]. When the kernels did not 
contain an autonomous controlling element (either Ac 
or Spm ) and showed no somatic reversion of the 
wx-m6 , wx-m9 , or wx-m8 mutations, the 58 kd protein 
was either not detectable or present at a very low level. 
When the Spm element was present with the wx-m8 
allele, sectors of endosperm tissue showed the Wx phe¬ 
notype and a 58 kd protein was present in isolated 
starch granules. By contrast, when Ac was present 
together with the wx-m6 allele, endosperm tissue 
showed sectors that were intermediate between the 
Wx and the wx phenotypes and a novel 60 kd protein 


was detectable on starch granules. Hence, reversion of 
the wx-m6 mutation gives a protein that is structurally 
abnormal and appears to have an altered enzymatic 
activity. Only the 58 kd protein is present in starch 
granules from kernels that are heterozygous for the 
wx-m6 and wx-m8 alleles, but contain only the Spm 
element. Therefore, the structurally altered protein 
detected upon reversion of the wx-m6 allele in the 
presence of Ac is not detected in the presence of Spm. 
Were the Wx locus a regulatory locus, both the 58 kd 
and the 60 kd proteins would be present in a kernel that 
is heterozygous for the wx-m6 and wx-m8 mutations 
and contains the Spm element. It was concluded that 
the Wx locus is the structural locus for the 58 kd 
protein [113]. 

Poly(A) + RNA purified from Wx and wx endo¬ 
sperms was translated in a rabbit reticulocyte in vitro 
translation system, and an antigenically related poly¬ 
peptide was identified using antiserum raised against 
the Wx protein. No major 58 kd polypeptide was evi¬ 
dent among the translation products, but a major 65 kd 
polypeptide was translated from Wx , and not from wx 
poly(A) + RNA. The 65 kd in vitro translation product 
may be a precursor that is synthesized, but not 
processed in the rabbit reticulocyte extract. Poly(A) + 
mRNA fractions substantially enriched for Wx mRNA 
were used to construct cDNA clones. Screening of 
cDNA clones was performed by selective hybridization 
to the mRNA sequence encoding the immunopre- 
cipitable 65 kd Wx polypeptide. Further testing showed 
that the Wx cDNA hybridized to an abundant 2.3-kb 
poly(A) + RNA present in immature endosperm of 
plants homozygous for the Wx allele, but absent from 
immature endosperm tissue of a plant homozygous for 
a stable recessive wx allele. Finally, the Wx cDNA was 
shown to hybridize to a unique sequence in maize 
genomic DNA. That the unique sequence corresponds 
to the Wx locus was deduced from the results of hybrid¬ 
ization analyses of DNA isolated from maize strains 
with and without a Ds insertion mutation at the Wx 
locus. The presence of the insertion at the Wx locus was 
correlated with the presence of a 2.4-kb insertion in the 
sequence homologous to the cloned Wx cDNA, 
unequivocally establishing the homology of the cDNA 
to the Wx locus [113]. Sequence analysis of genomic 
and cDNA clones of the wild-type Wx gene showed that 
the coding region comprises 3,718 bp and is composed 
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of 14 exons and 13 small introns. N-terminal sequenc¬ 
ing of the mature Wx protein led to the identification 
of a maize amyloplast-specific transit peptide of 72 
amino acid residues [114]. 

The Wx gene homologues were isolated from 
a number of species by isolating homologous clones 
or identification of DNA/protein sequences homolo¬ 
gous to the maize Wx gene/protein. A rice genomic 
library was constructed in EEMBL-3 vector and 
screened using the maize Waxy gene DNA as probe. 
Two overlapping genomic clones containing the Wx 
gene sequence were identified. Sequencing and align¬ 
ment with Wx gene of maize and barley revealed that 
the Wx gene in rice contains 13 introns and 14 exons. 
The full-length of rice waxy preprotein is 609 amino 
acid residues [115]. An analysis of Wx transcripts, Wx 
protein, and amylose content of 31 rice cultivars 
revealed that endosperm amylose and Wx protein con¬ 
tents are correlated with the ability of the cultivar to 
excise intron I from the leader sequence of the Wx 
transcript [116]. Sticky or glutinous rice quality is 
related to the low amount of amylose due to the mutant 
wx allele. 

Sweet Corn The maize sh2 mutant used in 
“supersweet” corn has negligible starch. The Sh2 gene 
was predicted to be involved in carbohydrate metabo¬ 
lism, since the mutant sh2 endosperm contains reduced 
starch. Candidate Sh2 clones were identified from 
a cDNA library prepared from endosperm by differen¬ 
tial screening with labeled cDNA of near-isogenic W64 
Sh2 and W64 sh2 mRNA. The 1.3-kb cDNA clone 
pES6-66 hybridized to wild-type cDNA and not sh2 
cDNA. Clone pES6-66 was subsequently shown to 
hybridize to a restriction fragment length polymor¬ 
phism (RFLP) associated with normal Sh2 function in 
F2 segregants of Sh2 x sh2 crosses in three backgrounds 

[117] . To determine whether this clone was indeed Sh2 
or simply a closely linked gene, a series of maize stocks 
that differed only at the Sh2 locus were used, including 
a wild-type Sh2 allele (progenitor), sh2-ml (mutable) 
allele containing Ds at the sh2 locus, and Sh2 revertants 

[118] . Southern blot analysis using 500-bp fragment 
derived from pES6-66 as probe showed that sh2-ml 
yielded a fragment that is approximately 1,600 bp 
larger than that seen in the progenitor or revertant. 
The data are compatible with sh2-ml containing 


a 1,600-bp insertion. RNA gel blot analysis indicated 
that the size of the wild-type transcript is approxi¬ 
mately 2.0-2.2 kb. A cDNA library constructed from 
poly A + RNA of shrunken-1 , bronze-mutable4 (shl bz- 
m4) was screened using probes derived from pES6-66. 
Eight of the 14 clones isolated had the 1.95-kb insert 
and were almost full length. The 900-bp and 1,050-bp 
EcoRl fragments from one of these clones were 
subcloned and sequenced. Examination of the 1.95-kb 
cDNA sequence identified only one open reading frame 
(ORF) coding for 542 amino acids and making up 
a polypeptide of 59,845 Da. Homology search 
identified the Escherichia coli glucose-l-phosphate 
adenyltransferase (ADP-glucose pyrophosphorylase or 
ADP-glucose synthetase, EC 2.7.7.27) was the only 
protein having significant similarity to the amino acid 
sequence of the Sh2 cDNA [118]. 

The maize sugary 1 (sul ) mutant accumulates twice 
as much sugar as sweet corn, having reduced starch. 
Mutations at the sul locus were generated by crossing 
active Mutator (Mu) plants with standard lines. Mutant 
alleles were identified following the self-pollinations of 
the FI plants. Normal and sugary sibling kernels from 
a population segregating 1:1 for the nonmutant allele 
Sul and sul-R4582::Mul were germinated, and geno¬ 
mic DNA used for Southern blot analysis using the 
Mul fragment as probe. A 4.0-kb EcoRl fragment 
containing sequences homologous to the transposon 
Mul that cosegregated with sul-R4582::Mul was iden¬ 
tified. This 4.0-kb EcoRl fragment was subsequently 
cloned by screening a BAC library, based on hybridiza¬ 
tion to a probe internal to Mul , and the genomic DNA 
insert was subcloned as part of plasmid pMJ60. The 
nucleotide sequence of both termini of the transposon 
were determined and found to match the known 
sequence of Mul A 1.0-kb BamHl-EcoRl genomic frag¬ 
ment flanking Mul in the cloned DNA was purified and 
used as a hybridization probe (termed BE1000). This 
genomic probe detected a 4.0-kb EcoRl fragment that 
was present in all sul-R4582::Mul/sul-Ref plants but 
was missing from all Sul/sul-Ref plants. Thus, the 
cloned Mu transposon is the same element that is 
within or tightly linked to the sul gene locus. Probe 
BE 1000 identified a second EcoRl fragment of 2.6 kb 
that was present in all plants examined, and represen¬ 
tative of the nonmutant progenitor allele. Presumably, 
insertion of the 1.4-kb transposon Mul within this 
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2.6-kb region resulted in the 4.0-kb EcoRl fragment 
that cosegregated with sul-R4582::Mul. Mutations of 
the sul gene locus other than sul-R4582::Mul were 
analyzed to determine whether they also cosegregated 
with physical alterations in the cloned region of the 
genome. Populations segregating for the nonmutant 
allele Sul and either sul-R2412 , sul-R7110 , or sul- 
R3162 was digested with EcoRl and probed in DNA 
gel blot analysis with genomic fragment BE 1000. In the 
sul-R7110 and sul-R3162 families, the 4.0-kb EcoRl 
fragment was present in all the seedlings grown from 
sugary kernels but was not observed in any seedlings 
grown from nonmutant sibling kernels. A different 
EcoRl fragment, 4.6 kb in length, was found to 
cosegregate with sul-R2412. The mutation sul-R2412 
is likely to have occurred via insertion of a 2.0-kb 
element into this same region. Approximately 200,000 
lambda clones from a maize endosperm cDNA library 
were screened with genomic probe BE 1000. DNA from 
eight different hybridizing clones was digested with 
EcoRl; the largest cDNA insert was 2.4 kb in length. 
Rapid amplification of cDNA ends (RACE) protocol 
was used to obtain the 5' end of the sul cDNA. Sequenc¬ 
ing of sul cDNA and BLAST search of the deducted 
amino acid sequence showed that sul cDNA specified a 
polypeptide of at least 742 amino acids, which is highly 
similar in amino acid sequence to bacterial enzymes that 
hydrolyze a-(l —> 6) glucosyl linkages of starch [119]. 

Rice Seed Color Red pericarp is ubiquitous among 
the wild ancestors of cultivated rice ( Oryza rufipogon ), 
in which it is closely associated with seed shattering and 
dormancy. On the other hand, most rice ( Oryza sativa) 
that is grown and consumed throughout the world has 
white pericarp, showing that white pericarp is associ¬ 
ated with domestication and remains under strong 
selection in most rice breeding programs today. 
A QTL associated with red pericarp, rg7.1 , was mapped 
on chromosome 7 using two independent BC2 
populations derived from crosses between an accession 
of O. rufipogon (IRGC-105,491) from Malaysia and, in 
one case, a US tropical japonica cultivar, Jefferson, and 
in the other case, a widely planted tropical indica cul¬ 
tivar, IR64. The log of the odds scores associated with 
the rg7.1 QTL peaks in these two populations were 99 
and 33, respectively [120, 121]. The peak of both QTLs 
corresponded to the previously mapped position of the 


mutant locus, brown pericarp, Rc [122]. Fine mapping 
using 1,410 BC2F3 plants and high-resolution map¬ 
ping using 4,000 BC2F6 plants narrowed the rg7.1 
QTL to an 18.5-kb region. Two genes encoding 
CACTA type, En/Spm subclass, transposon proteins, 
and one gene encoding bHLH protein were detected 
within the 18.5-kb target region. Sequence comparison 
of bHLH locus between cv Jefferson and H75, an Rc 
mutant stock belonging to the japonica subspecies, 
carrying a functional allele, but much more closely 
related to cv Jefferson than to O. rufipogon , showed 
that the coding sequence of the bHLH allele in H75 
was identical to the cv Jefferson sequence except for 
a 14-bp indel in exon 6. This 14-bp sequence was 
present in the H75 stock as well as in O. rufipogon , 
but was deleted in cv Jefferson and cv Nipponbare. 
The deletion induces a frame shift in the sequence, 
resulting in two premature stop codons before the 
end of exon 6. The stop codons truncate the protein 
before the bHLH domain. Sequence analysis of bHLH 
of Surjamkuhi, an indica line that carries a third allele, 
Rc-s , conditioning light red seed pigmentation, showed 
that the sequence of this line differed from the 
O. rufipogon allele at only four sites (positions 96, 
660, 1353, and 1833-1844). The first two changes 
proved to be synonymous substitutions. The change 
at position 1353 consisted of a C-to-A change in exon 6. 
This single-nucleotide polymorphism (SNP) was inde¬ 
pendent of any change seen in previous comparisons 
and represented a premature stop codon before the 
bHLH domain, truncating the protein and rendering 
the effect of the remaining indel immaterial. Conven¬ 
tional reverse transcriptase-PCR showed no expression 
of Rc in leaf tissue; however, expression was seen in 
panicles before fertilization, pericarp from grains in 
the milk or dough stage of filling, and pericarp from 
mature seeds. Similar expression levels of Rc were 
detected in cv Jefferson and O. rufipogon [121]. 

Seed Shattering 

Wild plant species shatter their seed from the mature 
infloresence and thus disperse seed for the next gener¬ 
ation to be seeded, while during domestication humans 
have selected for non-shattering plant types from 
which the seed can be harvested or gathered from the 
crop. Study of shattering has been very fruitful in 
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Arabidopsis, setting up the genetic models for 
shattering phenotypes [123] that have been also 
revealed in other crops like the cereals. 

In rice, a major QTL for seed shattering, sh4, has 
been mapped on chromosome 4 using segregating 
populations derived from crosses between O. sativa 
ssp. indica and the wild perennial species O. rufipogon 
[124, 125], between O. sativa ssp. japonica and 
O. rufipogon and two other closely related wild species 
O. glumaepetula and O. meridionalis [126, 127], and 
between O. sativa ssp. indica and the wild annual spe¬ 
cies Oryza nivara [128]. High-resolution mapping 
using 12,000 F2 (O. sativa ssp. indica x O. nivara) 
delimited sh4 to a 1.7-kb region of a gene with 
a previously unknown function. The comparison of 
the 1.7-kb sequences between the mapping parents 
revealed seven mutations located in the intron, exon, 
or 5' upstream of the start codon. Sequencing of this 
1.7-kb region from an additional 14 rice cultivars of 
O. sativa , 21 accessions of O. nivara , 6 accessions of 
O. rufipogon , and 1 accession of each of the four 
remaining wild A-genome species and their association 
with shattering phenotypes showed that a nucleotide 
substitution of G for T or an amino acid substitution of 
asparagine for lysine in the first exon was selected for 
the development of non-shattering cultivars during 
rice domestication. Annotation of the sh4 protein 
identified a Myb3 DNA binding domain and a 
nuclear localization signal, suggesting that sh4 is 
a transcription factor. Transformation of a binary plas¬ 
mid carrying sh4-GFP fusion under the control of Ubi 
promoter into rice cv. Taipei 309 determined the 
nuclear localization of GFP-tagged sh4. Transformation 
into rice cv. Taipei 309 of a binary plasmid carrying 
O. nivara sequence from the 3' nontranslated region to 
the inclusion of the G-T mutation site and O. sativa 
sequence for the rest sequence until the 5' regulatory 
region, but not the one carrying a shorter O. nivara 
sequence excluding the G-T mutation site, showed sig¬ 
nificantly reduced strength of grain attachment to ped¬ 
icel [128]. 

A major QTL for seed shattering, qSHl , was first 
mapped on chromosome 1 using 182 F2 plants of 
a cross between Kasalath (a shattering-type indica cul- 
tivar) and Nipponbare (a non-shattering-type japonica 
cultivar). A near-isogenic line (NIL) was developed 
that contained a short chromosomal segment from 


Kasalath at the qSHl region in a Nipponbare genetic 
background. The NIL had a stronger seed-shattering 
phenotype than either Kasalath or Nipponbare. High- 
resolution mapping using 10,388 plants from the 
BC4F2 and BC3F2 populations segregating at the 
qSHl region succeeded in mapping the functional nat¬ 
ural variation in 612 bp and only one single-nucleotide 
polymorphism (SNP) was found within this region. No 
distinct open reading frame (ORF) was identified in the 
SNP region based on gene prediction for the qSHl 
region in both Nipponbare and Kasalath genome 
sequences. However, one ORF for a rice ortholog of 
the Arabidopsis REPLUMLESS (RPL ) gene was found 
12 kb away from the SNP. The RPL gene encodes 
a BEL 1-type homeobox and is involved in the forma¬ 
tion of a dehiscence zone (or abscission layer) alongside 
the valve in the Arabidopsis fruit (silique). Transforma¬ 
tion of 26-kb Kasalath genomic fragments scanning the 
predicted ORF and the SNP regions into the non¬ 
shattering Nipponbare cultivar showed complementa¬ 
tion of the complete seed-shattering phenotype. The 
other fragments were not able to complement the phe¬ 
notype, even if they contained the entire ORF region or 
the SNP region. These results indicated that both the 
ORF and the SNP regions were required for full 
shattering function. In situ hybridization analysis 
revealed that in the NIL the ORF was expressed at the 
inflorescence meristem in the stage of rachis meristem 
establishment [inflorescence stage 1 (Ini)]. It was also 
expressed at both the anther region and the provisional 
abscission layer at the base of the spikelet in the stage of 
floral organ differentiation (In7) and in the stage of 
rapid elongation of the rachis and branches (In8). On 
the other hand, in Nipponbare, the ORF was expressed 
in the same way as in the NIL, except that it was not 
expressed at the provisional abscission layer in either 
In7 or In8. These results, together with the comple¬ 
mentation results, indicated that this RPL ortholog was 
the qSHl gene and that the identified SNP affected only 
the spatial mRNA expression pattern of qSHl at the 
abscission layer [129]. 

A shattering mutant line of rice, Hsh, was derived 
from a non-shattering japonica variety, Hwacheong, by 
N-methyl-N-nitrosourea (MNU) treatment. Optical 
microscopy revealed that Hsh had a well-developed 
abscission layer similar to the wild rice Oryza nivara , 
while Hwacheong did not produce an abscission layer. 
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Genetic analysis showed that the easy shattering of Hsh 
was controlled by the single recessive gene sh-h. Using 
an F2 population consisting of 240 individuals derived 
from the Hsh/Blue&Gundil cross the sh-h was mapped 
on chromosome 7 [130] . Fine mapping using six newly 
developed SSR markers delimited the sh-h locus to 
a 150-kb region. Further mapping using three newly 
developed SNP markers on five F3 recombinant lines 
derived from F2 heterozygous plants narrowed the sh-h 
locus to a 34-kb region. Analysis of genomic sequences 
from Hwacheong, Hsh, and Blue&Gundil lines identi¬ 
fied eight genes in this region. Among these, a gene 
which encodes a protein containing a conserved 
carboxy-terminal domain (CTD) phosphatase domain 
was considered to be a strong candidate for the sh-h 
locus because of the presence of a point mutation at the 
3' end splice site of its seventh intron. Sequencing of the 
RT-PCR products of Hwacheong and Hsh revealed 
that, compared with Hwacheong, a 15-bp deletion 
was induced by altered splicing in the mRNA isolated 
from Hsh. Because the consensus sequence for splicing 
at this “AG” site was changed to “TG,” the next “AG” 
sequence, 15-bp downstream, was used as a new splice 
site. The 15-bp deletion in the Hsh mRNA resulted in 
the deletion of five amino acids in the C-terminal 
region of the CTD-like phosphatase domain (CPDc), 
which are situated within the phosphatase active site. 
Two transferred DNA (T-DNA) insertion mutants and 
one point mutant exhibited the enhanced shattering 
phenotype, confirming that this CTD phosphatase-like 
gene is indeed the sh-h gene. RNA interference (RNAi) 
transgenic lines with suppressed expression of this gene 
exhibited greater shattering [131]. 

Tolerance to Abiotic Stresses 

Wild species are adapted to the environment where they 
are found; survive over long periods of time by creating 
diversity where environmental selection pressures allow 
the fit to reproduce and survive. The environmental 
stresses include abiotic stresses such as heat, cold, 
drought, salt, light, radiation, heavy metals, soil pH, and 
others. Plant response to these stresses has been studied 
and shows common and unique responses [132]. 

Salinity Stress Tolerance Salinity tolerance has been 
amenable to study for phenotypic variation and the 
genetic dissection into quantitative traits. In rice, 


a major QTL for shoot K + content under salt stress, 
SKC1 , was mapped to chromosome 1 using 133 lines of 
an F2 and an equivalent F3 population derived from 
a cross between a salt-tolerant indica variety Nona 
Bokra, and a susceptible elite japonica variety 
Koshihikari [133]. Fine mapping was performed 
using 192 BC2F2 plants, and a high-resolution map 
was subsequently generated with 2,973 BC3F2 plants 
using markers newly developed on the basis of the PAC 
clone sequences. Progeny testing of fixed recombinant 
plants (BC3F4) delimited the SKC1 locus to a 7.4-kb 
region, and only one predicted open reading frame 
(ORF) was identified in the region. The SKC1 promoter 
region (2,554-bp upstream of the initial codon) and the 
coding region of the cDNA clone (1,665 bp) were 
ligated and subcloned into a plant binary vector. This 
construct was transferred into the japonica variety 
Zhonghua 11, which contained the same SKC1 allele 
as Koshihikari and is salt susceptible. Shoot K + con¬ 
centrations were substantially higher in all six T1 prog¬ 
eny containing the Nona Bokra SKC1 transgene 
compared with plants containing the vector control 
under salt stress but not under normal condition. 
Database searches showed substantial similarity 
between SKC1 and the HKT-type transporters found 
in plants, bacteria, and fungi. Comparison of nucleo¬ 
tide sequences between Nona Bokra and Koshihikari 
alleles showed six nucleotide substitutions in the cod¬ 
ing region that lead to four amino acid changes, which 
may be responsible for the functional difference 
between the two alleles. RT-PCR analysis in both 
Koshihikari and NIL ( SKC1 ) showed that SKC1 tran¬ 
script levels were upregulated by salt stress in the root 
but not in the shoot. Under normal condition, K + and 
Na + contents in NIL ( SKC1 ) shoots were not substan¬ 
tially different from those in Koshihikari. But under salt 
stress, NIL ( SKC1 ) shoots had a higher K + content and 
lower Na + content than Koshihikari shoots. No sub¬ 
stantial differences in K + and Na + contents were 
observed in the roots under either normal or stress 
condition. Consistent with the results of shoot analysis, 
K + and Na + contents in xylem sap were not different 
between NIL ( SKC1 ) and Koshihikari; under salt stress, 
however, the xylem sap of NIL ( SKC1 ) contained more 
K + and less Na + than that of Koshihikari. These results 
indicated that SKC1 is involved in regulating K + / 
Na + homeostasis in the shoots [134]. 
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Submergence Tolerance A major QTL for submer¬ 
gence tolerance, Subl , was mapped to chromosome 9 
using 169 F2 plants and the resulting F3 families of 
a cross between a tolerant indica rice line, IR40931-26 
(a descendant of FR13A), and a susceptible japonica 
line, PI543851 [135]. High-resolution mapping using 
2,950 F2 individuals and the resulting F3 families of 
a cross between DX18 -121 (a tolerant F3 plant from the 
first population) and M-202 (a submergence- 
susceptible japonica cultivar that is widely used in 
California) identified 2 AFLP markers localized within 
0.2 cM of Subl [136]. The Subl locus was delimited to 
an interval of 0.06 centimorgan using 4,022 F2 individ¬ 
uals from the cross between DX18-121 and M-202. 
Physical mapping with five overlapping bacterial arti¬ 
ficial chromosome (BAC) clones derived from submer¬ 
gence-intolerant indica rice varieties and a nearly 
complete contig of 13 binary clones from IR40931-26 
showed that Subl region physically spans over 182 
kilobases (kb). This interval encodes three genes 
containing ethylene response factor (ERF) domains 
and designated Subl A, SublB , and SublC, ten non- 
ERF genes including four transcribed and six hypothet¬ 
ical protein-coding genes, and >50% retrotransposon- 
related sequences. The corresponding region of the 
japonica genome represented by the sequenced variety 
Nipponbare spans 142 kb and is considerably 
rearranged. Notably, SublA is absent from the rice 
reference Nipponbare genome. Accumulation of 
SublA and SublC messenger RNAs was strongly but 
transiently promoted by submergence in seedling 
leaves of tolerant FR13A, while SublB transcripts 
increased only slightly during submergence. The ten 
non-ERF genes in the indica Subl region showed no 
evidence of expression in seedling leaves before or 
during submergence in IR40931-26 or the intolerant 
variety M-202. A survey of the Subl locus haplotypes in 
17 indica and 4 japonica varieties identified 2 SublA , 9 
SublB , and 7SublC alleles on the basis of variation in 
amino acid sequence. The SublA-1 and SublC-1 alleles 
are limited to all six submergence-tolerant accessions. 
There was no SublB allele identified as being specific to 
submergence tolerance. In the tolerant SublA-1 allele, 
a single-nucleotide polymorphism at position 556 is 
responsible for a Pro 186 (intolerant) to Ser 186 (tol¬ 
erant) substitution in a MAPK site. Conversely, the 
SublC-1 allele of tolerant lines lacks a MAPK 


phosphorylation site present in the alleles of the intol¬ 
erant accessions. A Subl A-1 full-length cDNA under 
the control of the maize Ubiquitinl promoter was 
transformed into an intolerant japonica variety 
Liaogeng. A screen of seedlings after 11 days of sub¬ 
mergence identified four T1 families, derived from 
independent TO Ubi:SublA + lines, with submer¬ 
gence-tolerant transgenic individuals, and progeny 
from two families were examined in detail. T1 families 
one and three showed a correlation between high 
expression of the SublA-1 transgene and submergence 
tolerance. As observed in the FR13A descendant 
IR40931-26, tolerant SublA-1+ plants showed a signif¬ 
icant impairment of shoot elongation under submer¬ 
gence compared with the intolerant parent Liaogeng 
and non-transgenic siblings [137]. 

Aluminum Tolerance In wheat ( Triticum aestivum L.) 
mechanisms that minimize the harmful effects of A1 
ions have been investigated using near-isogenic lines 
[ET8 (Al-tolerant) and ES8 (Al-sensitive)] that differ in 
A1 tolerance at a single dominant locus designated as 
Altl. The Altl locus cosegregates with an Al-activated 
malate efflux from root apices [138, 139]. One cDNA 
clone more highly expressed in the root apices of ET8 
compared to that of ES8 was isolated by subtractive 
hybridization. The full-length cDNA was obtained by 
rapid amplification of cDNA ends (RACE)-PCR. The 
predicted protein is hydrophobic having six to eight 
putative transmembrane regions. Heterologous expres¬ 
sion of this gene in Xenopus oocytes indicated that this 
gene encodes an Al-activated transporter that facilitates 
the efflux of malate but not citrate. Based on this result, 
the gene was named ALMT1 (aluminum-activated 
malate transporter). Sequencing the ALMT1 cDNAs 
derived from ET8 and ES8 showed that the sequences 
differed at six nucleotides that resulted in the deduced 
proteins differing at two amino acid residues. Sequence 
of the ALMT1 coding region in Atlas66 (Al-tolerant 
cultivar) was identical to that of the ET8 allele, while 
the sequence in Scout66 (Al-sensitive cultivar) was 
identical to that of ES8. As found for the ET8 and ES8 
lines, the expression of ALMT1 in the root apices was 
greater in Atlas66 compared to Scout66. Expression 
analysis of ALMT1 in the 57 F2 individuals derived 
from a cross between ET7 and ES5, the near-isogenic 
progenitor lines of ET8 and ES8, showed that all the 
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Al-sensitive seedlings expressed the ES8 allele only, 
whereas Al-tolerant seedlings expressed either ET8 
allele only or both ET8 and ES8 alleles. Thus, the ET8 
allele was expressed only in the Al-tolerant seedlings, 
indicating that, in this population, the Al-tolerant phe¬ 
notype was correlated with the higher expression of this 
allele. Evaluation of A1 tolerance and genotype analyses 
using an RFLP marker and a PCR-based assay on 204 
F3 families derived from a cross between ET8 and ES8 
showed that the ET8 allele of ALMT1 completely 
cosegregated with the Al-tolerant phenotypes and the 
Al-tolerance locus, Altl. Heterologous expression of 
ALMT1 in cultured tobacco cells increased the toler¬ 
ance of tobacco cells to A1 treatment [140]. 

Future Directions 

The present era of plant sciences is distinguished by 
an integration of multiple disciplines toward a 
transdisciplinary/interdisciplinary approach of doing 
science, utilizing the technology and knowledge from 
computer science and bioinformatics, genomics, dedi¬ 
cated instrumentation for high-throughput automa¬ 
tion of laboratory and field tasks, which can all 
together provide integrated models and technologies 
for the improvement of crop traits. The paradigm shift 
in genetics, due to the reduced cost in genome sequenc¬ 
ing, is to be able to generate the genome sequence of 
any plant genotype of interest that will totally revolu¬ 
tionize the approach to use this information. With 
a few genetic crosses followed by whole genome 
sequencing of all genotypes, it would be possible to 
identify genes and useful alleles that can be incorpo¬ 
rated into plant improvement programs. This would 
make the traditional ways of isolating or identifying 
gene sequences for crop traits obsolete. The lag will be 
in phenotyping individual genotypes, which will 
remain the challenge for the biologist. The association 
of genes to a trait will still require that traits be charac¬ 
terized, dissected into components and parameters 
that can be measured, and the candidate gene(s) 
validated by some reverse genetics strategies using 
methods such as mutant analysis, transformation, and 
complementation, at least till the function of every 
gene/allele is known and how these genes and their 
alleles interact together to determine the phenotype 
of a crop trait. 
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Glossary 

Allocative efficiency Efficiency with which given var¬ 
iable input(s) is used. 

Attainable yield Yield reached by farmers with eco¬ 
nomically optimum management and reasonable 
risk aversion. 

Ecotilling Seeking related alleles across germplasm 
collections based on DNA homology. 

Farm yield Average grain yield across farms in a region. 

Harvest index Ratio of grain dry weight to total 
above-ground dry weight at maturity. 

Molecular markers Short sequences of DNA that are 
homologous with sequences close to or within 
important genes. 

Potential yield Grain yield in the absence of manage¬ 
able abiotic and biotic stresses. 

Potential yield water limited Grain yield in the 
absence of manageable abiotic and biotic stresses 
apart from that imposed by water supply through 
inadequate rainfall. 

Radiation use efficiency Above ground biomass pro¬ 
duced per unit of total solar radiation intercepted 
by green crop tissue. 

Technical efficiency Practices, timing, and technical 
skills adopted by farmer in using inputs. 

Yield gap Difference between farm yield and potential 
yield. 
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Definition of Subject 

The entry assumes that yield increase will continue to 
play a dominant role in world food security, as it has 
over the last 60 years. It is restricted to annual grain 
crops, since these dominate the world’s arable land¬ 
scape (>70%) and humankind’s food supply 
(>70%), including grain used as livestock feed. Crop 
yield is the weight of grain, at some agreed standard 
moisture content, harvested per unit of land area per 
crop (note, there can be two or even three annual crops 
per year in some favored environments, meaning 
a cropping intensity of 200% or 300%, respectively). 
The starting point for yield is usually the field, district, 
regional, or national average yield in kg or t per hectare, 
as reported in surveys or local or national statistics. 
Here this is referred to as farm yield (FY, t/ha). This and 
many related cropping statistics are collated annually 
for all countries by FAO (http://faostat.fao.org/site/ 
567/default.aspx#ancor). FY is usually expressed rela¬ 
tive to harvested land area (note: this can fall well below 
planted area in some situations). Although FY is 
quoted and used widely, it may not be as accurate as 
it appears due to uncertain grain admixtures and/or 
poor collection of the statistics. With survey data sam¬ 
pling error can also arise. 

At the highest end of the yield scale it is useful to 
define potential yield (PY), which is the yield to be 
expected with the best adapted cultivar, the best man¬ 
agement of agronomic inputs, and in the absence of 
manageable abiotic and biotic stresses [ 1 ]. Many com¬ 
plications are hidden within this apparently simple 
definition. PY is usually determined in plots, with of 
course sampling error. In order to be relevant to the 
surrounding district, the natural resource base of the 
plot (climate, soil type, topography) needs to be com¬ 
parable, including any long-term management invest¬ 
ments to improve this aspect of the site (e.g., liming, 
tile drainage). Water supply must be adequate for PY 
determination as defined, and this can come from well- 
distributed rainfall close or equal to crop potential 
evapotranspiration (crop water use from sowing to 
harvest without water limitation) or from full or sup¬ 
plemental irrigation; in addition, pests, weeds, and 
diseases must be held at negligible levels through the 
use of biocides if necessary. Finally, crops experiencing 
relatively rare weather damage such as crop lodging or 


unseasonal frosting are excluded from PY measure¬ 
ment. Taken overall, it would seem PY might be impos¬ 
sibly difficult to measure, but it is reported often in the 
crop science literature, although not always with 
proper attention to the above caveats. 

Since PY is usually measured in plots, edge effects 
arising from extra solar radiation or soil moisture or 
nutrients reaching outside plants must be avoided by 
discarding the plot edges. Two types of PY plots are 
commonest. Often they are well-managed yield trials to 
compare new varieties or advanced lines against older 
ones, or even historic ones, to give measures of breed¬ 
ing progress by plotting variety yield against year of 
release. The most useful such trials, for example those 
for wheat conducted by the UK Home Grown Cereals 
Authority (HGCA http://www.hgca.com/content.tem- 
plate/O/O/Home/Home/Home.mspx), measure yields 
with and without fungicide protection. Only the for¬ 
mer yields are a measure of true PY, but fungicide 
protection is still not very common in such yield trials 
around the world, although visible disease levels are 
usually reported and can be negligible. The second 
source of PY data comes from experiments conducted 
by crop physiologists to calibrate and/or validate crop 
simulation models: the models, driven largely by vari¬ 
ous aspects of solar radiation and temperature, can 
then be used to predict PY in other environments 
(e.g., sowing dates, years, locations). For such purposes 
modeling accuracy has steadily improved, but models 
need to be updated with the latest varieties every few 
years, since breeders are steadily altering varieties (e.g., 
phasic development, improving PY). Sometimes, crop 
contest yields or crop record yields are considered to be 
synonymous with PY. But they need to be treated with 
caution, because they usually exceed PY measured as 
defined here, probably because they refer to very favor¬ 
able circumstances (e.g., soils, weather, management), 
relative to the district average; nevertheless, we can 
learn from such yields if all site and management vari¬ 
ables are quantified. For example, properly verified 
world record yields, invariably higher than PY values, 
not only extend our simulation models but also reveal 
that there is no anatomical limitation to very high 
yields, the main limitation evident with such yields 
being the stem strength needed to support them. 

Since much of the world’s grain crops are grown in 
rainfed situations where water supply from 
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precipitation plus starting soil storage falls well below 
the potential evapotranspiration, it is also useful to 
define a water-limited potential yield (PYw, t/ha): it is 
the yield obtained with no other manageable limitation 
to the crop apart from the water supply. Obviously, it 
will depend on the amount of water, so PYw is usually 
plotted relative to water supply (or use), the slope being 
the crop water use efficiency or water productivity, 
commonly reported in kilograms per hectare per 
millimeter. Complications can arise due to variation 
in rainfall distribution with respect to crop develop¬ 
ment stages, but PYw, defined as a linear function of 
the water supply, is a very useful simple benchmark, as 
argued in a recent in-depth review by Passioura and 
Angus [2], while simulation modeling has been espe¬ 
cially useful in dealing with expected deviations due to 
variation in the distribution of water supply. 

In any given region, between FY and PY (or PYw), 
there is another useful yield notion, namely attainable 
yield (AY, t/ha), which is defined here as the yield 
attained by a farmer with average natural resources 
adopting economically optimal practices and levels of 
inputs. Since risk of financial loss is almost always part 
of a farmer’s decision to invest in increased inputs, the 


AY definition must temper “optimum level” with “pru¬ 
dent attention to risk”; as an example this could mean 
input investments must be expected to return at least 
50% in net profit at the margin. Of course, AY will 
reflect the economic circumstances of the crop and 
region, in particular grain prices relative to input 
ones, all measured at the farm gate. Thus, it is not 
easy to know AY, but general experience suggests that 
it lies around 20-30% below PY in situations where 
world prices and reasonable transport costs operate. FY 
for wheat in the UK probably meets these conditions 
(Fig. 1). Where this does not occur, for example, in 
much of sub-Saharan Africa where infrastructure and 
institutions are weak, the AY as defined above may be 
much lower; alternatively where inputs and grain prices 
are heavily subsidized, AY could approach closer to PY. 
Because of these uncertainties, it is easier to talk of the 
yield gap in terms of the FY to PY gap, bearing in mind 
that even in the most advanced cropping situations in 
developed economies, operating at close to world 
prices, FY will remain significantly below PY. Since it 
is more appropriate to express this gap as a percentage 
of FY, it can be assumed from the literature that this 
minimal gap is reached when the PY-FY gap is about 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 1 

Change in farm yield (FY) and potential yield (PY) with time for wheat in the UK. Source: FAOStat and HGCA. Note that 
in this and other figures (except Fig. 4), PY is plotted against year of variety release, for varieties are compared side by side 
in recent yield trials under modern management 













Crop Yields Around the World: Closing the Gap and Raising the Potential 


C 


2723 


30% (of FY). Any larger gap that this, which is usually 
the case, is often defined as an “economically exploit¬ 
able yield gap,” but bearing in mind that the expected 
exploitation can be as much a task for national and 
local government and agribusiness, as for the farmers. 

Introduction 

Crop yield has been the subject of attention since the 
days of Malthus and earlier. More recently, many have 
pointed out that the improved supply of food calories 
per capita globally, and reduced real costs of grain, over 
the last 60 years has been due almost entirely to global 
crop yields growing faster than the burgeoning world 
population (e.g., [4]), for there have been only small 
increases in cropped area (Fig. 2). The world popula¬ 
tion will grow to over nine billion by 2050, and grain 
demand is projected to increase around 60-70% (com¬ 
pared to 2000) [5], with others projecting even greater 
increases, and certainly more rapid increase coming 
early in this period. Seventy percent over 50 years 
amounts to an exponential increase of 1.1%; output 
growth must match this or real prices will increase (see 
also later). There is some potential new arable land of 


reasonable quality in South America and sub-Saharan 
Africa (and in northern latitudes of Russia), but 
increase in arable area demands heavy capital invest¬ 
ment and brings negative environmental consequences; 
indeed, maintenance of arable area could be a challenge 
in densely populated fast-urbanizing regions like South 
and East Asia, and where there is continuing arable 
land degradation. Crop area can also increase through 
intensification of annual cropping but that usually 
depends on expansion in irrigation, which has in fact 
slowed markedly, especially in Asia where water avail¬ 
ability for agriculture is now constrained. As 
a consequence, most argue that the projected increase 
in grain demand must continue to be met by yield 
increase. 

Many grain crops have exceeded 1% exponential 
yield growth in the early decades of the modernization 
of agriculture last century, including the Green Revo¬ 
lution period for rice and wheat in Asia, but global 
yield growth trends for most crops over the last 20 
years have slowed and have become linear, with current 
growth rates at or below 1%; maize at 1.6% and soy¬ 
beans at 1.1% are the only major crops exceptions to 
this (Fig. 3; note growth rate is expressed here by 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 2 

Change in world population, global arable area, and cereal yield. Adapted with permission for Evans (1998) 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 3 

World average yield versus time for wheat, rice, maize, and soybeans over last 20 years. Source: FAOStat 


calculating the linear slope as a percentage of current 
predicted yield for each crop). Increasing the propor¬ 
tion of a crop which is irrigated can increase yield 
growth, as happened with wheat in Asia in the 1960s, 
but further increases in irrigation area are unlikely, due 
to cost and/or lack of water, except possibly in sub- 
Saharan Africa. This leaves the maintenance, or prefer¬ 
ably boosting, of rates of yield increase in existing 
cropped situations as the major route for meeting 
growing demand for grain, and the major challenge 
facing agricultural science. And despite some media 
sentiment to the contrary, agricultural scientists are 
fully aware that this goal has to be met while sustaining, 
or better still, improving the natural resource base of 
cropping, and while confronting the uncertain and 
predominantly negative projected impacts of climate 
change. 

From the definition of the subject, it can be seen 
that there are two sources of yield increase: raising PY 
and closing the PY to FY gap (see also Figs. 1, 4 and 5). 
In most situations, both processes contribute simulta¬ 
neously to FY increase, with farmers adopting new 
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Crop Yields Around the World: Closing the Gap and 
Raising the Potential. Figure 4 

Change in PY and in FY for rice in Egypt. Source: Badawi [3]; 
A. E. Draz, 2009, personal communication 

technologies (varieties, management techniques, 
input levels) that have been developed years or decades 
earlier by researchers at which time the associated rise 
in PY occurred. Recently, two trends in this process are 
becoming evident. Firstly, management or agronomic 
innovations for increasing PY, including their positive 
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Year 

Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 5 

Wheat PY and FY changes with time, Yaqui Valley, Mexico. Source: [5] 


interactions with new varieties (e.g., the universal 
nitrogen by variety interaction in wheat and rice, or 
the plant density by hybrid variety interaction in 
maize), are becoming fewer, leaving a greater propor¬ 
tion of PY progress to breeding advances. This is 
a somewhat controversial observation but is clearly 
supported by analysis of the winter wheat yield changes 
in the UK [6]. Secondly, in some special situations, as 
mentioned earlier, the yield gap is approaching the 
minimal 30% or so dictated by current economics; 
examples include winter wheat in the UK (Fig. 1) and 
probably all of Western Europe), rice in Egypt (Fig. 4), 
and maize in the US corn belt (e.g., Iowa, see Fig. 6). 

While it is not always easy to separate PY advance 
from gap closing, since this volume has given more 
attention to the former, in particular the possibilities 
arising with new molecular biological techniques 
becoming available to plant breeders, this entry will 
give more attention to closing the yield gap than to 
raising PY. Both issues have been more fully discussed 
in Fischer et al. [5]. Another reason for concentrating 
here on closing the yield gap is that, notwithstanding 
the evidence of small yield gaps in some situations, it is 
likely that yield gap closing offers better prospects 


globally for quickly lifting FY progress, as is needed, 
than does boosting PY progress. The entry continues by 
looking at the size of the yield gap in various key 
situations around the world. Passing then to gap clo¬ 
sure, traditionally this is considered to arise from 
farmer adoption of yield-enhancing technologies, the 
area of agricultural extension or technology transfer. 
While there is a lively body of socioeconomic research 
on this process, which will be considered briefly, the 
way the yield gap is defined here, it is hard to see the 
new products of breeding and agronomic research 
reducing the yield gap. This is because their initial 
impact should be on PY as defined, actually increasing 
the gap, followed by their gradual adoption which, if 
complete, might be expected to reduce the gap to the 
same relative value as before, other things being equal. 
Needless to say the real situation is more complex and 
examples will be given of how such research, including 
progress in molecular biology, can help directly reduce 
the gap (e.g., more adoptable technologies, less costly 
ones, and better biotic stress resistance). The entry 
finishes by discussing briefly the current situation 
with progress in PY and PYw themselves. What will 
not be discussed further is the likelihood that 
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Year 


Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 6 

Change in maize FY in Iowa compared to that in PY as estimated by Pioneer (Hammer et al. ([7], red squares ) and de Kalb 
(Edgerton [8], green triangles) hybrid yields plotted against year of hybrid release 


atmospheric C0 2 increase will continue to improve all 
grain yields, other things remaining equal, but the rate 
of increase is currently around only 0.2% in a C3 crop 
like wheat [9], and less for C4 crops. 

Estimates of Yield Gaps 

Fischer et al. [5] attempted to estimate yield gaps and 
their rate of change for a number of key representative 
cereal-producing regions in the world. The most reliable 
estimates are summarized in Table 1, supplemented 
with a few new crops and numbers. 

The yield gap estimates for wheat are remarkably 
consistent at around 50% except for the UK, where as 
mentioned before the 25% gap probably implies that 
FY is approaching AY, an AY largely determined by 
ruling world prices in the UK. Figure 5 illustrates the 
Yaqui Valley data, where there was substantial yield gap 
closing as the modern semidwarf varieties took over in 
the 1960s; lately, however, the yield gap has remained 
fairly steady at 50%. Lobell et al. [12] recently 
presented 12 published estimations of the yield gap 
for wheat in developing countries, the gap ranged 
from 5% to 150% but the average was close to that in 
Table 1, namely 55%. 


For rice, Table 1 shows gap values around 100% or 
more for most developing country situations except for 
Central Luzon in the wet season (65%), and 
a remarkably low 15% for Egypt (see also Fig. 4). The 
value of 55% for Japan reflects the heavy emphasis on 
producing high-quality grain, acting as brake on FY 
progress, because food-quality rice varieties and agron¬ 
omy deliver lower yields. From the 41 rice estimates 
from developing countries in [12], the average gap was 
65%, with modeling evidence that the gap was 120% in 
northwest India (cf. 110% in the Indian Punjab in 
Table 1). 

Table 1 has only two cases for maize: Iowa at 25% 
contrasts with Kenya at 200+%; sub-Saharan Africa as 
a whole is similar to Kenya. Tittonell et al. [13] in 
a detailed survey of farm yields in three districts in 
the favored Kenyan Highlands confirms the figure 
shown in Table 1, recording a mean FY of 1.5 t/ha 
when PY was 6-7 t/ha. Lobell et al. [12] give nine 
examples of rainfed maize in tropical and subtropical 
developing countries (average gap 200%) and examples 
of irrigated and rainfed maize in Nebraska based on 
simulation of PY and PYw which suggest gaps of 35% 
and 55%, respectively, both larger than that for rainfed 
maize in the adjacent wetter state of Iowa (Table 1). 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Table 1 Estimates of current yield gaps for 
key crops and regions (compiled from [5] and elsewhere as noted) 





Yield (t/ha) a 

and gap(%), 2007 or 2008 

Region 

Environment 

Crop area (M ha) 

FY 

PY 

Gap 

Wheat 

Yaqui V, Mexico 

Irrigated, low latitude 

0.16 

6 

9 

50 

Punjab, India 

Irrigated, low latitude 

3.9 

4.3 

6.25 

45 

Western Australia 

Subhumid, low latitude 

4.5 

1.8 

2.6 b 

45 

N. Dakota 

Subhumid, high latitude 

3.4 

2.5 

3.7 b 

50 

UK 

High rain, winter wheat 

1.8 

8.2 

10.4 

25 

Eastern China 

Irrigated, winter wheat 

16 

4.7 

7 

50 

Kansas 

Subhumid, winter wheat 

3.6 

2.6 

3.9 b 

45 

Rice 

Central Luzon 

Wet season + irrigation 

0.8 

3.8 

6 

60 

Punjab, India 

Wet season + irrigation 

2.4 

3.8 

8 

110 

Japan 

Wet season + irrigation 

3 

6.5 

10 

55 

South Asia 

Wet season, rainfed 

28.5 

1.8 

3.6 

100 

Central Luzon 

Dry season, irrigated 

0.4 

4.5 

9 

100 

Egypt 

Dry season, irrigated 

0.7 

10.1 

11.6 

15 

Maize 

Iowa 

Temperate, high rainfall 

5.3 

10.8 

15.5 C 

45 

Kenya 

All altitudes, moderate rain 

1.75 

1.8 

6 b 

200+ 

Soybeans 

USA d 

Temperate, high rainfall 

30 

2.8 

3.6 

30 

India 6 

Subtropical, tropical, moderate 
rainfall 

9 

1 

2.2 b 

120 

Millet 

lndia f 

Subtropical, rainfed 

11 

0.9 

1.8 b 

100 


Predicted from linear trends 
b Actually PYw 

c Reestimated in Fischer and Edmeades [9] 
Estimate by R. A. Fischer, 2010 unpublished 
e Bhatia et al. [10] 
f Murty et al. [11] 


While there is no doubt about the huge yield gap with 
maize in Africa, some uncertainty exists in these critical 
estimates from the USA, where maize is probably the 
most intensively researched and promoted crop in the 
world. No doubt the gap is quite small in Iowa and 
Nebraska, where it has been closing lately, and FY may 


be close to AY (Fig. 6), although more PY progress data 
is needed for a clearer conclusion. 

The soybean examples in Table 1 also show a large 
difference in the yield gap between a developed and 
developing country, but do not show developing coun¬ 
tries Argentina and Brazil, where FY across more than 
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35 M ha with favorable rainfall is as high as the USA, 
and PY is unlikely to be any higher. Finally, millet, 
grown in India under very harsh conditions, shows 
a gap no worse that of soybeans in adjacent more 
favorable areas of India. 

Closing Yield Gaps 

Yield gaps are reduced by farmers adopting new tech¬ 
nologies or practices (new at least for them) or 
adopting higher rates of inputs (e.g., many farmers in 
western Kenya use added nutrients, either fertilizer or 
organic ones, but the rate of nutrients applied may be 
very low relative to the need/optimum (e.g., [13])). 
Some of this is usefully illustrated in Fig. 7 derived 
from Byerlee [14]. 

FY under traditional practices with low inputs is 
shown by point A on curve 1. Adopting new technol¬ 
ogies such as an improved new variety and practices 
(e.g., line sowing and fertilizer) lifts technical efficiency 
and brings curve 2 into play. The farmer may move 
from point A to point B, as in the first years of the 
Green Revolution. Allocative efficiency could then rise 
further as fertilizer levels increase, following curve 2 to 
point C, approaching some economic optimum. An 
even newer variety and the best available practices 
(e.g., herbicides, conservation tillage) which together 
define current PY (point F) could lift the technical 


frontier further (the uppermost curve 3). Farmers 
might choose to further increase technical efficiency 
by moving to the uppermost curve, maximizing 
allocative efficiency at around point D, or sacrificing 
allocative efficiency, but not yield, by reducing inputs 
(point E). Of course, curve 3 is not static: for any given 
environment it may be moved further upward with 
new technologies, and it may also shift downward due 
to problems of resource degradation. This figure also 
serves to illustrate that the yield gap is not the same as 
an efficiency gap, gaps between curves are technical 
efficiency ones, gaps along curves are allocative ones, 
both combine to make up the FY to AY gap. 

Adoption of New Technology 

There is a rich agricultural economic literature on the 
adoption of new technologies by farmers. Examples for 
the adoption of two new technologies in the rainfed 
cropping in Australia are shown in Fig. 8a (a new crop, 
lupins, and a new practice, no till). The new crop was 
adopted faster than the new agronomic technique. 
Developing countries have proved equally fast in 
adopting new varieties (Fig. 8b) when the conditions 
are favorable. The well-recognized patterns of lag, rate 
of adoption, and ceiling adoption level clearly differ 
between technologies and farmers (regions); in addi¬ 
tion, the infrastructural and institutional context is 



Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 7 

Illustration of pathways from low FY to close the gap with PY; modified from Byerlee [14] 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Figure 8 

Adoption of new technologies by (a) farmers in regions of Australia (lupins in Wongan Hills district [15] and no till in 
Loddon district [16]), and (b) semidwarf wheat varieties in Mexico and in Bangladesh [17] 


important. For the technology itself, key elements are 
its perceived relative advantage (return over cost, risk of 
loss, convenience, etc.) over that which it replaces, and 
its trialability, meaning the ease with which a new tech¬ 
nology can be tested on a small scale. Farmer charac¬ 
teristics influencing adoption are very diverse and can 
also interact with the nature of the innovation: educa¬ 
tion, age, health, exposure to extension and related 
media and demonstrations, group pressure, and sup¬ 
port can be important. But there is little doubt that 
farmers around the world, large and especially the 
small, women and men, aspire to increase monetary 
return and given the means to do so, will adopt more 
profitable practices. 

It is in the area of rural infrastructure and institu¬ 
tions that developed and developing country farmer 
circumstances differ most. One obvious example is the 
effect of poor roads on increasing input costs and 
reducing prices for surplus farm output. For example, 
the price ratio of N fertilizer to grain is on average 
double that in other regions of the world and higher 
still in inland landlocked regions of sub-Saharan Africa 
[ 18] . The high cost of credit, uncertainties surrounding 
contracts (including land tenure), the risk of poor- 
quality inputs, theft or unrest, and the prevalence of 
unfavorable and uncertain tax and pricing policies, all 
make adoption less attractive in many developing 


situations. Sometimes, price subsidy on inputs and out¬ 
puts can compensate for these brakes on adoption, such 
as in the recent situation where a well-crafted fertilizer 
supply and subsidy policy permitted Malawi, a nation of 
small holders, to produce an unprecedented surplus of 
maize and to do so without a maize price collapse. But 
in general and especially in sub-Saharan Africa the lack 
of public investment in infrastructure, institutions and 
agricultural extension, and the lack of sound policy are 
the major contributors to the yield gap. 

Looking at farm-level technical constraints that 
contribute to the yield gap, in any situation there are 
usually multiple constraints, and the challenge is to 
determine which constrains should take priority, 
while recognizing that interventions often interact pos¬ 
itively and are thus more effective when adopted 
together [19, 20]. This can only be answered by on- 
farm survey and experimentation. Such work started 
many years ago with farming system research, farm 
management clubs, and rapid rural appraisal. It con¬ 
tinues in many guises in the developed world, especially 
influenced by the privatization of agricultural exten¬ 
sion, the use of remote sensing and ICT advances, and 
the entry of input suppliers, in particular seed compa¬ 
nies, into agricultural extension. 

In the developing world the more traditional 
approaches remain, although with growing emphasis 
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on farmer participation [21]. Lobell et al. [12] recount 
how IRRI conducted on-farm rice experiments in Asia 
in the 1970s to test high inputs, learning that farmer 
yields varied greatly between fields, as did responses to 
inputs especially fertilizer and insecticide, which were 
often uneconomic. This pointed to the importance of 
field-to-field variability, and the need to adjust inputs 
accordingly and as the season unfolds, whether by site- 
specific nutrient management, which reached maturity 
some 20 years later [22], or via field-level pest moni¬ 
toring as part of IPM packages. Titonell et al. [13] 
recount a similar picture of substantial variability in 
soil fertility, resource use intensity, and yields among 
small farmer maize fields in western Kenya. Another 
lesson is surely that this is scientist-intensive expensive 
research, usually taken over by the farmer and his 
advisers in the industrial world, and explaining why 
large yield gaps often persist in the developing one, 
where circumstances demand innovative approaches 
in order to reach the billion small farmers (e.g., [21]). 

Very recently IRRI again looked at rice yield gaps, 
this time using expert knowledge to assess constraints 
and possibilities for irrigated rice in South Asia [23]. 
For this crop, FY is currently 5.1 t/ha over 34.3 M ha; it 
was estimated that on average yield was constrained 1.9 
t/ha (37%) by yield-limiting factors, which included 
individual constraints from nutrient insufficiency 
(10%), diseases (7%), weeds (7%), water shortage 
(5%), and rats (4%). The exercise was repeated for 
the 28.5 M ha of rainfed lowland and upland rice in 
South Asia with a current FY of 1.8 t/ha: yield-limiting 
factors amounted to 68% of FY, including nutrients 
(23%), disease (15%), and weeds (12%). The IRRI 
paper predicted that with a substantial research, devel¬ 
opment and extension effort, the adoption of existing 
technology and ongoing breeding over the next 15 
years could reduce these losses by one third (irrigated 
rice) or one quarter (rainfed rice) adding about 1% to 
FY growth rate. 

With wheat in the Yaqui Valley we have a recent 
concerted effort to understand the yield gap, PY-FY 
(currently 50%, Fig. 5, Table 1), this time using the 
latest high-resolution satellite imagery to estimate 
field-level yields for all fields in the Valley [24] and 
supplement a long history of farm surveys. Despite 
the moderate size and wealth of farms relative to 
India or Kenya, again field-to-field variability in yield 


was substantial. It was estimated from images over 
several years that wheat yields were constrained by 
late planting, delays in the first post-plant irrigation, 
and summer fallow weeds [25]. Improved institutions 
and farm management decisions could largely elimi¬ 
nate these constraints, which averaged over years 
totaled about 10-15% of FY, and would bridge about 
half of the gap to estimated AY in the Valley. These 
authors [26] used inter alia classification and regression 
trees to relate yield to constraints in their complex data 
sets. The same technique was used by Tittonell et al. 
[ 13] to explore management and soil constraints to 150 
field-level maize yields in highland Kenya: the rate of 
added nutrients was the strongest explanatory variable, 
followed by date of planting and plant density, as yield 
ranged from 1.2 (low nutrients, late plant) to 4.2 t/ha 
(high nutrients, normal plant, high density). 

Two recent studies illustrate for rainfed cropping 
the power of simulation modeling and water produc¬ 
tivity boundary functions in understanding yield gaps 
in surveyed farmer fields and in particular removing 
that part of the apparent gap which is actually due to 
non-manageable weather (mostly rainfall distribu¬ 
tion). Grassini et al. [27] found these distribution 
effects quite important for sunflower yield variation 
among fields in the western pampas of central Argen¬ 
tina. Hochman et al. [28] looking at farm wheat yields 
across Australia, found less influence of rainfall distri¬ 
bution, and more scope for farmers to lift water pro¬ 
ductivity via improved management (e.g., earlier 
planting, higher seeding density, higher N input). 
A weakness of both these studies is that they refer to 
better farmers rather than to a random sample of fields, 
for which the opportunities of gap-closing interven¬ 
tions are likely to be greater. 

The persistence of large yield gaps in the developing 
world especially draws attention to situations where 
these gaps have been closed. Rice in Egyptian is an 
obvious example where strong R, D, and E engaged 
a small geographically focused industry of small 
holders under a sound price policy (Fig. 4). A second 
example of dramatic technology adoption, albeit with 
lesser immediate implications for FY than for sustain¬ 
ability of the whole cropping system, relates to the 
uptake of conservation tillage for wheat, maize, and 
soybeans in southern South America (Argentina, 
Brazil, and Paraguay), rising from nothing in 1970 to 



Crop Yields Around the World: Closing the Gap and Raising the Potential 


C 


2731 


24 M ha in 2000. This was very much driven by farmer 
groups and the farmers themselves faced with the threat 
of serious soil degradation and by the opportunity 
provided by knock-down herbicides and knowledge 
spillover from the North (e.g., [29]). This revolution 
has yet to reach other developing continents (but is 
beginning in northwest South Asia). A third success 
story among small poor farmers has recently appeared 
with the introduction and expansion of winter maize in 
northeastern India and Bangladesh. 

Despite individual success stories like rice in Egypt, 
yield gaps in general appear to be quite persistent and 
close only slowly; this happens even when gaps are well 
above that to be expected from economics and risk 
aversion and even when PY progress has slowed such 
that catch-up through eliminating excessive lags in 
varietal adoption is not a big issue. The problem is 
that gap closing on the large scale needed requires 
massive investments in rural infrastructure and insti¬ 
tutions as well as technology transfer, and these are not 
forthcoming, as maize in sub-Saharan Africa exem¬ 
plifies. Elsewhere public sector agencies, in particular 
reaching the billion small farmers in Asia [21] , aided by 
the private sector, in particular in Latin America, have 
made some inroads on the yield gap; they should con¬ 
tinue to do so largely in proportion to the investments 
made, but there is also scope for innovation, for exam¬ 
ple, based on modern ICT technologies. The employ¬ 
ment of agronomists by private seed companies is 
a pattern that is bound to be followed in the developing 
world as its seed industry grows in strength and com¬ 
petitiveness. With gap closing, there are no spill-ins as 
there are in the case of PY advance through R and D, 
innovations need to be adapted locally, but it can be 
argued that the Internet and mobile phones are rele¬ 
vant spill-in technologies for delivering information to 
farmers small and large, a role which could greatly 
expand. 

Innovative Adoption-Friendly Crop Management 
and New Varieties 

Research can facilitate the adoption of technologies to 
lessen those constraints which contribute to the FY to 
PY gaps. Table 2 lists common constraints, separating 
research targeting agronomic solutions from that 
involving breeding, while institutional and 


infrastructural solutions already mentioned are shown 
in the last column. Some agronomic technologies are 
still under development (e.g., improved seasonal 
weather forecasts), but much of the necessary agro¬ 
nomic research is adaptive, generally fitting technolo¬ 
gies from more advanced farming systems. For example, 
the widespread adoption of direct seeding of wheat after 
rice in northwest India, which by saving on land prep¬ 
aration time has meant that sowing dates are less likely 
to be late and yield expectations are hence improved, 
required research and development on appropriate 
small-scale drills for this direct seeding operation. 
Sometimes, the agronomic technology comes from 
a “less-advanced” field cropping situations or from 
farmers themselves: for example, the use of plastic film 
mulches, quite common in northern China with wheat, 
maize, and oilseeds. Such mulches give substantial yield 
benefit through retaining soil moisture and aiding soil 
warming in the spring, and challenge agronomic 
researchers to adapt them in a sustainable fashion else¬ 
where, including developed countries. 

It is with breeding that there are greater gap-closing 
opportunities. Considering that actual losses from dis¬ 
eases and insects globally exceed 20% of yield with 
wheat, rice, and maize [30], any improvement in 
genetic resistance has an immediate benefit for FY 
when the improved varieties are adopted. Conventional 
host plant resistance breeding continues to make pro¬ 
gress on this front, while genetic engineering has 
brought exciting new opportunities. Although 
engineered resistance does not lift PY, it lifts FY wher¬ 
ever farmers cannot control pests and diseases with 
traditional means. The experience with Bt cotton in 
India shows how important this can be: small holder 
yields are at least 30% better simply because before the 
advent of Bt cotton they were unable to eliminate 
damage no matter how much insecticide was used 
[31]. It is likely that the advent of GM corn resistant 
to root worm is boosting farm yields in the USA 
because root worm damage went unnoticed before¬ 
hand (and is very difficult to treat with chemicals). 
Fungal diseases have yet to succumb to genetically 
engineered host plant resistance, but it is reasonable 
to expect varietal releases in this area in the next 
decade; engineered viral resistance has already been 
deployed in some crops [31]. Actual yield losses 
due to weeds are estimated globally at about 10% by 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Table 2 Constraining factors contributing 
to the farm yield-potential yield gap and their alleviation so that farm yield can approach the attainable yield 
corresponding to the current potential yield with realistic economics 



Resolution 


Research 


Constraint 

Agronomic 

Breeding 

Institutional/infrastructural 

General farmer constraints 

Lack of farmer 

awareness or conviction 
or skill 

On-farm demonstration 

On-farm testing and selection 

Education, media campaigns, 
extension 

Risk aversion by farmer 

Forecasts, tactical decision 
making (e.g., for N top 
dress) 

Tolerance of extreme weather 
events, like drought, flooding, 
hail, frost, wind 

Insurance schemes, favorable 
credit terms, price stability 

Inadequate labor 
supply 

Mechanization, reduced 
tillage, herbicides 

Select for uniform maturity to 
favor mechanical harvesting 

Facilitate labor migration; credit 
for mechanization 

Technical constraints 

Lacking major long¬ 
term soil amelioration 

Drainage, land leveling, 
liming, deep tillage, 
gypsum 

Waterlogging, acidity, 
and salt tolerance 

Long-term credit for major soil 
amelioration, secure tenure 

Excess tillage and loss 
of moisture, soil 
compaction 

Conservation tillage 
options and suitable 
machinery, controlled 
traffic 

Suitable varieties: disease and 
herbicide tolerance 

Credit for new machinery 

Manageable topsoil soil 
toxicities 

Ameliorate (e.g., lime for 
acidity) 

Acidity, aluminum tolerance 

Input suppliers, credit for lime 

Suboptimal nutrient 
supply 

Diagnostics, application of 
nutrients, slow release 
forms, tactics to match 
supply with need 

Some scope for improved N, P, 
and Zn uptake and utilization 
efficiency 

Infrastructure, input suppliers, 
credit, quality control 

Soil variation within and 
between adjacent fields 

Diagnostics to aid 
adjustment of application 
rates 

Greater tolerance of soil 

stresses 


Growing old varieties, 
or use of poor seed 

Better on-farm seed 
management and storage 

FI hybrids, licensed traits, 
royalties to encourage strong 
seed industry 

Strong seed industry and proper 
regulation, credit 

Incorrect time of sowing 

Mechanize, reduced tillage 
to speed sowing; 
treatments to warm soil for 
spring sowing 

Make available varieties with 
range of maturities; herbicide 
and cold tolerant varieties 

Policy to favor mechanization, 
contract seeding 

Poor plant population 

Better drilling procedures 
and machines, quality seed 
storage 

More robust varieties (e.g., 
long coleoptile in wheat, more 
tillering) 

Seed testing and regulations 

Diseases and pests, 
above and below 
ground 

Biocides, sanitation, crop 
rotation, IPM 

Host plant resistance 

Input suppliers, pesticide 
quality control 
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Crop Yields Around the World: Closing the Gap and Raising the Potential. Table 2 (Continued) 



Resolution 


Research 



Constraint 

Agronomic 

Breeding 

Institutional/infrastructural 

Weeds 

Herbicides, cultivation, 
sanitation, crop rotation 

Enhance crop plant 
competitiveness, herbicide 
tolerance 

Herbicide quality control, 
release regulation 

Poor water 
management in 
irrigated systems 

Improve water application 
techniques and skills, land 
levelling 

Greater tolerance water 
shortage and excess 

Efficient supply systems to farm 

Long-term soil 
degradation 

Crop rotation, fertilizer, 
green manuring, farm yard 
manure, conservation 
tillage, amelioration 

Varieties adapted to biotic and 
abiotic stresses of high plant 
residue levels, and with good 
residue production 

Tenure regulations ensuring 
land ownership by farmer 


Oerke [30], to which should be added the costs of 
current control measures. More competitive cultivars 
can help, but again genetic engineering has brought 
revolutionary advances in ease, cost, and effectiveness 
of weed control. Of course, with any chemical-based 
susceptibility of biotic stress agents, there will be the 
risk of resistance evolving in the target organism, but 
integrated management, albeit requiring greater farmer 
skills, can prevent that. Given proper R, D, and exten¬ 
sion, the potential impact of herbicide-tolerant culti¬ 
vars on FY in labor-limited African cropping is likely 
substantial. Of course, all breeding solutions to aid gap 
closing presuppose an effective seed production and 
distribution system. Systems are gradually improving 
in developing countries and most commercial farmers 
grow improved varieties although the rate of turnover 
of varieties is often too slow. There is little doubt that 
wherever plant breeding is privatized, competition 
drives quicker variety turnover; in advanced systems, 
it also drives significant agronomic extension by the 
breeding firms keen to maximize variety by manage¬ 
ment interactions and to retain clients. 

Lifting Potential Yield 

Although separated by a yield gap and a time lag, many 
situations show a close relationship between progress 
in PY and that in FY, as new technologies eventually 
find their way to farmers. Often the process is facili¬ 
tated by new varieties linked to agronomic advances, 


for new varieties are fairly quickly adopted in most 
commercial farming. Sometimes, the PY advances owe 
more to innovative farmers than to researchers, as for 
example, when farmers moved to earlier planting per¬ 
mitted by direct seeding in southern Australia. In all 
cases, increase in PY is an important component driv¬ 
ing increased FY, inevitably so where the yield gap is 
approaching the economic minimum (e.g., wheat in 
the UK), but less so where it is large (e.g., paddy or 
rainfed rice in India), and even less when the gap is 
huge (e.g., maize in sub-Saharan Africa). 

Current rates of PY progress were summarized for 
the key cereals in [5]. These numbers come from 
breeders’ and researchers’ trials containing varieties of 
different vintage grown under modern agronomic 
management with little or no biotic stress; thus, the 
rates capture as well the positive interactions between 
new varieties and modern management. Linear regres¬ 
sion is used to calculate the absolute rate of progress 
which is then expressed as a percentage of most recent 
predicted PY in the series, hopefully a recently released 
variety (>2006) but such data is not always available. 
For wheat, progress ranged from 0.3% to 1% p.a. with 
an average of 0.6% (n = 6) and with little difference 
between water-unlimited and water-limited PY pro¬ 
gress (see also Figs. 1 and 5). For rice, the range was 
0.2-0.7% (average of 0.4%, n = 4). Progress in PY at the 
International Rice Research Institute (IRRI) is disap¬ 
pointing, at close to zero; however, there is a one-off 
yield jump of 10-15% through the recent exploitation 
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of FI hybrids feeding into the tropical rice regions, and 
following upon on a similar gain in China over the last 
20 years as hybrids moved to >50% of their acreage. 
For maize, there is only one estimate in [5], coming 
from Iowa and showing 1%. Data of Luque et al. [32] in 
a somewhat similar environment in Argentina give 
a rather similar number for hybrids released up to 
1997 (1.3%), but since then PY progress may have 
slowed (M. Otegui, 2009, personal communication). 
A subsequent report from Iowa with hybrids of a rival 
company showed progress at 1.7% but the breeding 
period sampled was only 2001-2006 [8] . In an irrigated 
favorable Mediterranean environment, data presented 
by Campos et al. [33] indicate PY progress in the Iowa 
Pioneer hybrids of 1% (but interestingly progress in 
PYw of 1.5% under artificial severe mid-season water 
stress). PY progress in cereals is discussed in more 
detail in [9]. Potential yield for soybeans in the USA 
appears to be progressing at 0.7% p.a. (R. A. Fischer, 
2010, unpublished). 

From the above, it is apparent that even in advanced 
situations with substantial breeding investment, poten¬ 
tial yield progress from this source is currently at or well 
below 1%, with maize showing more progress than other 
major crops. Such PY progress is unlikely alone to drive 
FY progress at the rates needed. Prospects of lifting this 
rate of PY progress were now discussed briefly. 

New Yield-Positive Agronomic Techniques 

Improved agronomy has played a large if not dominant 
role in past yield increase (e.g., increased fertilizer use, 
better weed control, better seeding techniques for ear¬ 
lier seeding and more reliable crop stands, and water 
conserving reduced tillage), and most improvements 
have interacted positively with variety improvement 
(e.g., [20]). While many authors have consistently 
failed to anticipate agronomic advances in the past 
[34], currently it is hard to see any new ones raising 
PY or PYw (as distinct from advances specifically 
targeting improved input efficiency). One example 
might be improved seasonal forecasts which would 
permit the farmer to better tailor management and 
variety to expected weather, something currently lim¬ 
ited by the low skill of forecasts. Another, which would 
clearly lift PYw, would be the reduction of soil evapo¬ 
ration with inexpensive plastic films (e.g., spray- 


applied nanofilms), copying a common practice seen 
in northwest China field crops, but designed so as not 
to contaminate the environment. Also it is possible 
advances can still come from better management of 
soil-root-microbe interactions, a complex and 
neglected area of research: poorly explained observa¬ 
tions of “yield decline” and “break crop effects” point 
in this direction, as do reports of positive effects of 
increased soil organic matter (independent of nutri¬ 
tion), controlled traffic and reduced soil compaction. 

Greater Investment in Current Conventional 
Breeding Efforts for Yield 

Yield improvement through plant breeding is generally 
regarded as delivering benefits far in excess of the costs 
[35, 36]. Nevertheless, it has been suggested that 
returns from the investment in breeding personnel are 
diminishing, for example in maize, and that yield gains 
are getting harder to achieve [37]. This is occurring 
despite greater efficiency through computerization, 
advanced biometrics, mechanization, robotics, and 
techniques for rapid generation advance, all part of 
any conventional breeding today. What is quite unclear 
however is the marginal return on investment in con¬ 
ventional breeding; for example, what would happen if 
the number of breeders, and of crosses, selections, and 
yield trials, were increased say 50% in any currently 
substantial program? In crops where breeders are 
devoting many resources to maintaining disease resis¬ 
tance (e.g., wheat) or eating quality (e.g., wheat, rice), 
and thus unable to give full attention to yield, PY 
progress is likely to increase, although probably less 
proportionally that the proportional investment 
increase. A related question is whether any extra 
funds would be better spent on new breeding tools. 

New Tools to Increase Progress in Conventional 
Breeding 

Genetic variation is the basis of yield progress. It is 
unlikely that all or even the major part of the existing 
genetic variation has been utilized in any crop. Sam¬ 
pling germplasm collections is a formidable task, but 
new tools are becoming available for seeking out new 
and useful variation, such as the Focused Identification 
of Germplasm Strategy (FIGS, [39]) and Ecotilling 
[38]. Because of the low chance of success, this work 
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tends to be in the area of publically funded 
prebreeding, with only medium to long-term impacts 
on breeding, but it is worth noting that the steady 
reduction in genome sequencing costs is also opening 
up new ways of allele searching [40]. 

Both parental and especially early generation 
selection can benefit from the use of low cost perfor¬ 
mance-related markers. For the last 50 years or so, 
physiologists have attempted to identify such markers 
or traits, physiological or morphological, or even whole 
ideotypes, which would lead to greater yield. There 
have been some successes, for example, the first high- 
yielding semidwarf tropical rice variety, IR8, resulted 
from the pursuit of a specific ideotype [41], and breed¬ 
ing for resistance to some toxicities is usually based on 
early generation physiological screening (e.g., alumi¬ 
num tolerance). However, indirect selection for both 
PYw and especially for PY has proved challenging, and 
much physiology has been restricted to the retrospec¬ 
tive identification and understanding of traits which 
have changed as direct selection has increased yield. 
A good example are the many traits that have been 
found to change as maize yields increased in North 
America under direct selection for yield, lodging resis¬ 
tance, dry down, and disease resistance alone [37, 42]. 
As understanding of yield determination improves, 
including the recognition that seemingly useful traits 
can carry trade-offs, and as measurements of many 
traits become cheaper, especially through the use of 
remote sensing in the field, the possibilities for physi¬ 
ological traits have improved. At CIMMYT, maize 
breeding for drought (PYw) has definitely benefitted 
from the inclusion of physiological traits such as anthe- 
sis to silking interval in managed drought environ¬ 
ments [43]. More recently, with wheat there has been 
testing of canopy temperature as an early generation 
selection criterion, a surrogate for stomatal conduc¬ 
tance and possibly photosynthetic rate under irriga¬ 
tion, or possibly for rooting depth under drought; it 
shows promise under both conditions [44], as does 
multispectral canopy reflectance under irrigation 
[45] . In CSIRO, Australia, ideotrait selection for greater 
PYw has been under way for several decades, with focus 
on traits such as water use efficiency via carbon isotope 
discrimination, coleoptiles length, low tillering, and 
high stem carbohydrate content at flowering, with 
moderate success [46]. Very recent research points to 


genetic differences in wheat for grain set under drought 
and ways for screening this trait which is likely to be 
a significant bottleneck in PYw determination [47]. 

While physiology may be beginning to look more 
useful in early generation yield selection, it is now being 
challenged by the rapidly growing use of molecular 
markers for traits, initially qualitative ones but lately 
also quantitative traits, including yield itself. The huge 
decline in cost of detecting molecular markers, and 
their increasing density across crop genomes is driving 
this. Heffner et al. [48] review the possibilities of such 
marker-based genome selection in crops. Earthington 
et al. [49] describe in some detail the extent to which 
molecular markers are being incorporated into the 
Monsanto crop breeding programs: rates of progress 
in multiple trait indices (e.g., including yield, grain mois¬ 
ture, standability, and test weight for maize) can be dou¬ 
bled; indeed, for these authors marker-aided selection is 
the “new” conventional breeding. These approaches pre¬ 
sume accurate phenotyping of genotypes for yield and its 
interactions with target environments, a challenge in all 
breeding. However, it is interesting that physiology hardly 
features in this new scenario, although other groups are 
busily mapping putatively useful physiological traits at 
the molecular level (e.g., [50, 51]). It remains to be seen 
whether physiological traits, either observed directly 
or identified by accurate markers, will have a place 
in marker-aided yield breeding, but it is possible some 
combination of markers and early generation trait 
phenotyping using smart remote sensing will prove an 
even more efficient strategy for PY advance. In addition 
physiology, just like plant pathology, will continue to 
identify individual traits whose incorporation into lead¬ 
ing varieties may lift PY or PYw, a process which could 
benefit from the use of molecular trait markers, just as is 
becoming the case with new hard-to-screen host plant 
disease resistance genes. 

Multilocation multiple-year yield testing remains 
the essential final step in all plant breeding for yield; 
testing under different management regimes (plant 
density, planting date, soil stored water) is also recog¬ 
nized as increasingly important. Crop simulation 
models are now good enough to help breeders under¬ 
stand the environmental and sometimes the genetic 
bases of the interactions encountered, and thus deploy 
the testing more efficiently, and more efficiently 
indentify the genotypes best suited to major 
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environment x management combinations. An excel¬ 
lent example is the analyses of multiple environment 
rainfed trials of sorghum in Queeensland [52]. 

Genetic Engineering for Yield 

There are many claims of engineered traits that may 
increase crop yield, but very few of these have been 
backed up by yield measurements in the field. Crop 
physiologists have regularly pointed out how different 
is the performance of a genotype growing in a pot in the 
glass house from that of one at normal plant density in 
the field environment [53, 54]. For this reason, they 
remain skeptical of the value of these claims and await 
with much interest the results of field tests, for, to date, 
it appears that only a handful of papers describe per¬ 
formance in proper field tests. In an excellent example, 
wheat transformed with a more active ADP glucose 
pyrophosphorylase in the grain to increase grain sink 
strength and grain size, was thoroughly field tested, but 
there was no yield benefit [55]. More recently, 
Monsanto has reported increased PYw in field plots 
with maize transformed with two separate events [56, 
57]; results in the latter case point to modest PYw 
increases and no change in PY, but the nature of the 
drought and of the crop physiological responses are not 
well described. More experimental data is needed 
before much confidence can be placed in this event, 
or in engineering for yield in general, for both genetic 
and physiological understanding of yield determina¬ 
tion indicates that many processes are involved, and 
that the impact of any single transformation is likely to 
be small even if key regulatory genes or bottleneck 
processes are targeted. Indeed, it is now being recog¬ 
nized that if functional genomics is to have a significant 
impact on yield through engineering yield processes, it 
must pay more attention to the relevant plant and crop 
physiology connecting gene action to field performance 
[58]; even the current fashion for automated 
phenotyping in controlled facilities must be balanced 
by substantial investment in phenotyping in field plots 
in managed environments. 

Some Tentative Conclusions on Future Potential 
Yield Progress 

As PY increases, relative rates of PY progress are 
decreasing; this is probably more than the consequence 


of a linear rate of progress relative to a rising denom¬ 
inator, and may, in situations where the breeding 
investment is large (e.g., maize in North America, 
wheat in the UK and the rest of Western Europe), also 
reflect diminishing returns as biological limits are 
approached. Theoretical limits to harvest index are 
already being reached, but those to radiation use effi¬ 
ciency appear well above current measured values [59] . 
There have been a couple of attempts to project PY into 
the future. For example, Sylvester-Bradley et al. [60] 
propose that a longer period of light capture and 
a higher radiation use efficiency could lift PY of winter 
wheat in the UK to 17.4 t/ha by 2050, a linear rate of 
progress of 167 kg/ha/year, or two and half times the 
current rate of progress seen in Fig. 1. Taking a different 
tack, Monsanto has predicted a doubling of US maize 
yield between 2000 and 2030, based on equal contribu¬ 
tions from conventional breeding, marker-aided selec¬ 
tion, and genetic engineering, and amounting to 
a linear annual rate of progress which is 3.3% of today’s 
yields. Physiological theory and commercial optimism 
both, however, have their limitations. Looking more 
cautiously at the ways of impacting PY progress 
outlined above, it is concluded that breeding progress 
will dominate in the future, that conventional breed¬ 
ing, including the exploitation of heterosis, will con¬ 
tinue to deliver small but declining gains, while new 
approaches may boost gains enough to maintain cur¬ 
rent rates of linear PY progress at between 0.5% and 
1%, other things such as climate remaining equal (note 
that given the way PY is defined here, this progress is 
independent of any direct effect of atmospheric C0 2 
increase). Observers expect the investment in the rele¬ 
vant R and D to remain large, as the private sector 
involvement increases to balance a declining public 
sector, and many argue that it should grow further, 
although the marginal rate of return is not well known. 

Future Directions and Synthesis 

The key issue for future food security is the real grain 
price needed to balance burgeoning demand against 
increasing supply. World food production demand 
and supply to 2050 have recently been projected by 
Hubert et al. [61] with the IFPRI Impact model, and 
by Tweetin and Thompson [62] . Thus, the baseline case 
of the former paper projects a 56% increase in cereal 
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production over 2000, accompanied by substantial real 
price increases over the 2000 base, with negative effects 
on malnutrition. Demand growth is greater for maize 
and least for rice, such that the projected real price 
increases over 2000 are maize (97%), wheat (90%), 
and rice (60%). Clearly, a 56% increase in production 
is not enough. For Tweetin and Thompson [62], cereal 
supply rises somewhat more, by around 75% in 2050, 
but real prices still increase (on average 44%). The 
IFPRI projections allow for biofuel demand (peaking 
in the late 2020s at 16% of total maize consumption, 
after which second-generation biomass biofuel takes 
over), rising energy costs and likely restrictions on 
irrigation water supply. They also point to a much 
greater increase in grain demand in the first 25 years 
(1.4% pa) than in the second 25 years (0.4%), clearly 
placing more pressure on short- to medium-term yield 
progress. The IFPRI model contains crop area as 
a variable but, given the cost of developing new lands 
and its scarcity, this only increases significantly if prices 
more than double under a very low yield progress 
scenario. 

In the IFPRI baseline projection, yield grows to 
meet demand at around 1% exponential projected 
over the whole period. In [62], it is expected to grow 
linearly at a rate equivalent to 1.4% of current yields. 
These rates compare to the current rates of world yield 
growth, which are wheat and rice (0.9%), maize (1.6%), 
and soybeans (1.1%), but it must be recalled that the 
actual rates all appear to be linear, not exponential, and 
that the projected rates in neither of the models are 
enough to hold real prices steady at 2000 levels. In 
addition, the elasticity of demand is such that a supply 
shortfall due to below baseline yield growth would be 
disastrous (as we saw 2 years ago although some other 
factors added to the sharp doubling of prices then): the 
projections in [61] suggest a 40% fall in yield growth 
below baseline (to 0.6% exponential) would lift 2050 
real cereal prices to more than 200% above 2000 real 
prices!! Alternatively, 1.4% yield growth would mean 
continuing declines in real grain prices. 

These numbers highlight the importance of yield 
growth that has been the subject of this entry and reveal 
that current FY growth rates are inadequate to hold real 
prices down. Since PY growth can contribute to FY 
growth everywhere, and is the only source of growth 
in advanced agricultural systems where FY is 


approaching PY, it assumes great underlying impor¬ 
tance, even as current rates of growth, except possibly 
for maize, seem to be well below 1% and gradually 
falling as a percentage of yield. Unfortunately, the 
brief examination here suggests that the prospects of 
boosting PY growth, largely now relying on plant 
breeding, are small, or at least rather uncertain, espe¬ 
cially in the short to medium term when yield increases 
are most critical. Much will depend on whether the 
molecular marker-aided yield selection can boost 
rates, for example, double them, as claimed by some; 
genetic engineering seems unlikely to lift PY in the 
short to medium term, although the prospects to lift 
PYw may be somewhat better if water stress-related 
yield bottlenecks can be relieved. And none of these 
projections have factored in the possible negative 
effects of climate change. 

As a consequence of the above, yield gap closing 
assumes great importance for future world food secu¬ 
rity, especially as there still appear to be large exploit¬ 
able gaps (>50%) in many situations, and as the way 
forward is clearer. There are few uncertainties regarding 
what needs to change at the farm level, but less opti¬ 
mism regarding how to achieve this quickly enough 
and from where the necessary resources will come. The 
biggest gaps are associated with small farmers in the 
developing world, and there is little doubt that strate¬ 
gies to help these farmers will have the greatest benefits 
for alleviating poverty and food insecurity. And note 
that the association between large yield gaps and small 
farmers is not an inevitable association, as can be seen 
among small holders in Egypt or parts of eastern China. 
This entry has emphasized that improvements in rural 
infrastructure and institutions are critical, but that 
breeding and agronomic research also has a role to 
play in gap closing, as is illustrated by the impact of 
simple direct seeding machinery and earlier sowing on 
wheat yields in the Indo-Gangetic Plains or of Bt cotton 
with Indian small holders. It is a great pity, however, 
that the types of on-farm adaptive research and devel¬ 
opment needed for gap closing in such situations are 
often thwarted by policy inertia toward agriculture in 
general, and poor rewards by science funders in partic¬ 
ular, although the situation may be gradually improv¬ 
ing. In developed countries, new paradigms seem to be 
emerging in which private sector companies, especially 
breeding and seed companies, invest not only in 
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developing better varieties but also in applied agro¬ 
nomic research linked to variety development and pro¬ 
motion, and this may account for some of the gap 
closing in the USA and Western Europe. In addition, 
and especially in Australia and the southern cone of 
South America, privately hired agronomists are assum¬ 
ing a critical role in yield gap closing and are rewarded 
accordingly. Taken overall such changes auger well for 
ongoing yield gap closing, even in the short to medium 
term, in the developed world, but their impact at the 
farm level in developing countries, and that of more 
traditional approaches, will remain limited by infra¬ 
structure and institutional constraints in the near to 
medium-term future. The cautious optimism 
expressed a decade ago by leading agricultural scientist 
Evans [4] regarding world food security may well be 
overly optimistic. 
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Glossary 

Agriculture Agriculture and farming are often consid¬ 
ered to be the same concept. However, both con¬ 
cepts can vary in their territorial scope of 
application. Agriculture is the production of food 
and goods through farming and forestry. Agricul¬ 
ture encompasses a wide variety of specialties and 
techniques. For this reason, its definition has devel¬ 
oped to become: The science, art, and business of 
cultivating soil, producing crops, and raising live¬ 
stock. The major agricultural products can be 


broadly grouped into foods, fibers, and raw mate¬ 
rials. As of late, agriculture also uses plants to pro¬ 
duce biofuels, biopharmaceuticals, and bioplastics. 

Agricultural practices Agricultural practices are a set 
of techniques applied to on-farm production and 
postproduction processes, resulting in food and 
nonfood agricultural products. 

Agronomic crops Agronomic crops typically involve 
a crop that is grown for grain, feed, or for 
processing into oil, starch, protein, or flour. Major 
agronomic crops include corn (grown for feed, 
ethanol, or processing), soybeans, wheat, hay 
(alfalfa and legume and grass mixtures), rice, pea¬ 
nuts, and cotton. Hay is also considered forage [ 1 ]. 

Agroecosystem An agricultural system or agricultural 
ecosystem is the basic unit of study for an 
agroecologist. This term is somewhat arbitrarily 
defined as a spatially and functionally coherent 
unit of agricultural activity, and includes the living 
and nonliving components involved in the unit as 
well as their interactions. An agroecosystem can be 
viewed as a subset of a conventional ecosystem. As 
the name implies, at the core of an agroecosystem 
lies the human activity of agriculture. However, an 
agroecosystem is not restricted to the immediate 
site of agricultural activity (e.g., the farm), but 
rather includes the region that is impacted by this 
activity, usually by changes to the complexity of 
species assemblages and energy flows, as well as to 
the net nutrient balance. 

Agronomy The science which establishes the theory 
and practice of crop production and soil manage¬ 
ment [1]. 

Ecosystem A functioning community of nature that 
includes fauna and flora together with the chemical 
and physical environment with which they interact. 
Ecosystems vary greatly in size and characteristics, 
and can be a mud puddle, a field, or orchard, or 
a forest. An ecosystem provides a unit of biological 
study and can be a unit of management [2]. 

Environment The totality of the surrounding external 
conditions (biological, chemical, and physical) 
within which an organism, community, or object 
exists. The environment can be defined at any scale. 
The term is not exclusive in that organisms can be 
and usually are part of another organism’s environ¬ 
ment. Thus, one can speak of the environment as 
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that within which humankind lives (i.e., separate 
and external) or, of humankind as a component of 
the environment [2]. 

Definition of the Subject 


(no-tillage CS), method of fertilization (organic CS), or 
agroecosystem (semiarid CSs) to simplify the designa¬ 
tion of the CSs. However, it is important to note that 
a CS involves much more than a rotation, tillage sys¬ 
tem, fertilization method, or climatic conditions. 


c 


Cropping system (CS) is a general term that describes 
how a producer or farmer might grow a crop [2]. 
Pragmatically, CSs effectively address the what to 
grow , when to grow it , and how to grow zt considerations 
of crop production in the context of optimizing mul¬ 
tiple goals [3] . A CS must bring together the biological, 
technical, economic, and sociological aspects of 
the land area farmed [4]. Therefore, a CS is a set 
of agronomic or agricultural practices used in 
a crop under specific conditions. The CS generates an 
agroecosystem, agri-environment, or agricultural sys¬ 
tem. Common synonyms of CS include crop manage¬ 
ment system, crop production system, farming system, 
crop production practices, etc. For example, a farming 
system is defined as a population of individual farm 
systems that have broadly similar resource bases, enter¬ 
prise patterns, household livelihoods, and constraints, 
and for which similar development strategies and inter¬ 
ventions would be appropriate [5]. 

The classification of CSs has been based on 
the following criteria: available natural resource base, 
including water, land, grazing areas, and forest; climate, 
of which altitude is one important determinant; 
landscape, including slope; farm size, tenure and orga¬ 
nization; dominant pattern of farm activities and 
household livelihoods, including field crops, livestock, 
trees, aquaculture, hunting and gathering, processing 
and off-farm activities; and taking into account the 
main technologies used, which determine the intensity 
of production and integration of crops, livestock, 
and other activities [5]. Multiple cropping, in which 
several crops are grown sequentially in 1 year, and 
intercropping, when several crops are grown at the 
same time, are other kinds of annual CSs known as 
polycultures [6]. 

In general, each and every agronomic practice used 
in a CS affects different aspects of the system, i.e., the 
tillage system prepares the seedbed but affects the 
availability of water and nutrients, carbon cycle, ero¬ 
sion, diseases, etc. The use of the term CS is often 
applied to a rotation (wheat-fallow CS), tillage system 


Introduction 

Since the dawn of settled agriculture about 10 millennia 
ago, human beings have integrated different agronomic 
practices for application to selected plants (crops), and 
as a result generated CSs that transformed the natural 
ecosystem into agroecosystems. Agricultural practices 
such as genotype selection, crop rotation, natural fer¬ 
tilizers, irrigation, etc., were developed long ago, but 
have made great strides in the past century. The history 
of agriculture has played a major role in human history 
as agricultural progress has been a crucial factor in 
worldwide socioeconomic change. When human 
beings began to produce food beyond their needs, 
others activities began to take place. It is this agricul¬ 
tural surplus that made the development of civilization 
possible. A crucial time in the history of agricultural 
began with the discovery of America. A global exchange 
of previously local crops and livestock breeds occurred. 
Key crops involved in this exchange from the New 
World to the Old included the tomato, maize, potato, 
manioc, cocoa bean, and tobacco, with several varieties 
of wheat, spices, coffee, and sugar cane going from the 
Old World to the New. With the rapid rise of mecha¬ 
nization in the early twentieth century, particularly in 
the form of the tractor, farming tasks could be accom¬ 
plished with a speed and on a scale previously impos¬ 
sible. Also during this period, the Haber-Bosch 
method for synthesizing ammonium nitrate 
represented a major breakthrough. Synthetic nitrogen, 
along with mined rock phosphate, pesticides, and 
mechanization, allowed crop yields to overcome previ¬ 
ous constraints. Furthermore, global yield increases 
were experienced later in the twentieth century when 
high yield varieties of wheat, corn, and rice were intro¬ 
duced as a part of the Green Revolution. The Green 
Revolution exported technologies (including pesticides 
and synthetic nitrogen) from the developed world to 
the developing world. However, concerns have been 
raised over the sustainability of intensive agriculture. 
Intensive agriculture has become associated with 
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decreased soil quality, and there has been increased 
concern over the effects of fertilizers and pesticides on 
the environment, particularly as the population 
increases and food demand expands. 

As a result, in the past few decades, a move toward 
sustainability in agriculture has also developed, inte¬ 
grating ideas of socioeconomic justice and conserva¬ 
tion of resources and the environment within an 
agricultural system [7, 8]. Sustainable agriculture is 
defined as practices that meet current and future soci¬ 
etal needs for food and fiber, for ecosystem services, 
and for healthy lives, and that do so by maximizing the 
net benefit to society when all costs and benefits of the 
practices are considered. If society is to maximize the 
net benefits of agriculture, there must be a fuller 
accounting of both the costs and the benefits of alter¬ 
native agricultural practices, and such an accounting 
must become the basis of policy, ethics, and action. 
Additionally, the development of sustainable agricul¬ 
ture must accompany advances in the sustainability of 
energy use, manufacturing, transportation, and other 
economic sectors that also have significant environ¬ 
mental impacts [9]. Sustainable agriculture has 
renewed research in alternative technologies such as 
no-tillage, integrated pest management, etc. Recent 
mainstream technological developments include 
genetically modified (GM) crops. Also, agriculture 
has needed - and still needs - subsidies from many 
governments to ensure an adequate food supply and its 
competitivity in global markets. However, at times, 
these subsidies, especially when instituted by developed 
countries, have been noted as protectionist, inefficient, 
and environmentally damaging [10]. 

This chapter is not an overview of a subject as 
complex and broad as worldwide CSs, where oversights 
are inevitable. It must first be pointed out that about 
85% of the Earth’s cultivated land is planted with 
annual crops [11], and thus the principle CSs are used 
in herbaceous plants. Of these, wheat, rice, and maize 
provide more than 60% of human dietary calories, 
either as cereals for direct human consumption or 
embodied in livestock products produced from ani¬ 
mals fed with feed grains and their by-products 
[9, 12]. Moreover, it is likely that these same cereal 
crops will continue to account for the bulk of future 
human food supply because they produce greater yields 
of human-edible food, are easily grown, stored, and 


transported, and require less fuel and labor for 
processing and cooking than other food crops [12]. 
Given their importance, this chapter focuses primarily 
on these crops, but also provides a general view that can 
be applied to any CS. The chapter starts off with a brief 
description of CS components and factors that must 
be taken into consideration during their design 
(Components and Design). To provide a new focus on 
defining CSs, and as a result agroecosystems, an 
anthropological view of a CS is then provided 
(Anthropological View). Following, the most important 
effects of these “human ecosystems” on the environ¬ 
ment are reviewed ( Agricultural Practices' Footprint). At 
this point, the chapter goes on to analyze the achieve¬ 
ments of agricultural research and their importance to 
man ( Praise of Agricultural Science). From here, agri¬ 
cultural practices that were unsuccessful in CSs and 
which must be urgently eliminated are studied 
(Eradication of Fallow, Tillage, and Monoculture). After¬ 
ward, the basic principles on which current and future 
CSs should be inspired are discussed. In fact, the appli¬ 
cation of these principles to a certain degree involves 
returning to techniques from the past, based on mon¬ 
itoring technologies of the future ( Biological Synergism 
and Technical Synchronization). The application of 
these principles logically leads to what are called 
Dynamic Cropping Systems, which shift away from the 
application of a series of rigid agricultural practices to 
the agroecosystem at each growth stage. Finally, the 
chapter tackles the Future Challenges for CSs in the 
twenty-first century. 

Components and Design 

All agroecosystems generated by a CS break away from 
“natural” ecosystems, and the main effects are the loss 
of biodiversity and the export of resources. This causes 
environmental imbalance which means more or less of 
a struggle, depending on the intensity of the CS, to 
stabilize it. The more distant it is from a “natural” 
stable ecosystem, the more inputs are required by the 
CS as the result of a greater imbalance. In fact, one of 
the greatest concerns caused by agriculture, which has 
been a problem from the beginning due to the selection 
of few species, is the lack of biodiversity in many CSs. 
Advocates of diversification argue that it provides 
greater income stability [2]. 
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Many CSs throughout the world are characterized 
by variable climate and soils, resulting in a high-risk 
condition for agricultural producers due to extreme 
variability in precipitation and seasonal temperatures 
[13]. In other words, crop production occurs in an 
environment that is always changing. With every grow¬ 
ing season, producers must attend to numerous factors 
that influence their management decisions. Cropping 
systems vary among farms depending on the available 
resources and constraints; geography and climate of the 
farm; government policy; economic, social, and polit¬ 
ical pressures; and the philosophy and culture of the 
farmer [5, 6]. This is a daunting challenge, especially 
when one considers that producers’ decisions are car¬ 
ried out in a financial environment of diminishing 
economic returns, where one wrong decision could 
mean financial hardship and potentially the end to 
a way of life [14]. 

A CS involves numerous interactive factors that 
may limit or facilitate crop production. The factors 
can be divided into the following groups [14]: (1) bio¬ 
physical environment: weather/climate and soil; 
(2) socioeconomic externalities: market conditions 
and government programs; and (3) available technol¬ 
ogy. These three factors affect or guide the agricultural 
practices of the CS (crop election, rotation, tillage sys¬ 
tem, planting date, fertilization, crop protection, etc.), 
and are conceived on the basis of productive and eco¬ 
nomic objectives, as well as the social and environmen¬ 
tal objectives that must be imposed by sociopolitical 
externalities. 

Most developed countries’ agriculture operates in 
a market-driven economy, although government poli¬ 
cies can have an influence on what farmers produce and 
how they produce it. As with other businesses, agricul¬ 
tural producers respond to economic incentives and 
disincentives, and make decisions to maximize their 
welfare; usually measured as net income. Specifically, 
the farmer must consider the influence of technological 
advancements, income supports embodied in farm leg¬ 
islation, and changes in market structure and consumer 
demand. Many of the technological advancements have 
required large-scale production units to justify the 
investment. The influence of commodity support pro¬ 
grams has been ambiguous. As farm legislation has 
evolved to decouple production decisions from pro¬ 
gram benefits, the incentives to specialize in program 


crops (crops that receive price and/or income benefits 
under governmental legislation) have diminished. 
However, wealth and risk effects, albeit small, may 
have promoted or inhibited the adoption of a more 
sustainable system. Changes in market structure, chan¬ 
nels, and consumer demand in the past five decades 
have been dramatic with consolidation and specializa¬ 
tion in both production and marketing sectors. How¬ 
ever, the diversity of consumer demand has also created 
opportunities for more integrated farm operations. 
There is an increasing number of consumers who 
have become concerned about how and where their 
food has been produced (food security). While price 
and income supports may have been biased toward 
specialization (as these programs were targeted to spe¬ 
cific commodities), the reduction in risk associated 
with the programs has enabled producers to expand 
the number and diversity of their production enter¬ 
prises [15]. 

The rapid change in the agricultural industry driven 
by continuously arising challenges (climate change, 
market globalization, environmental concerns) 
requires the development of new methods of produc¬ 
tion in order to guarantee sustainable agriculture [16]. 
The design of a CS undergoes a study of the biophysical 
environment and socioeconomic externalities, and an 
inventory of available technologies. The first step in 
developing a CS is the definition of goals and con¬ 
straints for the new CS. Constraints may result from 
soil and climate but also from environmental or eco¬ 
nomic concerns [17]. Afterward, a crop portfolio must 
be established, usually based on climate, and 
containing a diverse array of adaptable crop species, 
economic potential, crop production practices, and soil 
and water management considerations. The crop port¬ 
folio is used to screen adaptable crops for a region and 
includes the best management practices for the pro¬ 
duction of each adaptable crop [ 14] . In others words, it 
is the selection of crops compatible with the set of 
constraints. Next is the comparison and choice of the 
most satisfying crop and management options [16]. 
Designing new CSs is a long process and occurs in 
a rapidly changing environment. This is exemplified 
by climate but also by the economy (prices and poli¬ 
cies) and the changing demands and functions that 
society assigns to agricultural systems. The design pro¬ 
cess must therefore integrate objectives of the resilience 
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and flexibility of CSs in an unpredictable environment 
[ 18] . Obviously, the use of models for the design of CSs 
can be an important tool for the farmer, although there 
are some drawbacks [16]. Geographic information sys¬ 
tems, qualitative decision-matrix analyses, a simple 
rule-based model using multi-criteria evaluations, 
and a machine learning-based land-transformation 
model can be used harmoniously to study complex 
socio-ecological systems. Models evaluate how each 
technique performs in the study of complex socio- 
ecological systems using a multi-tier framework, detail¬ 
ing how each method analyzes the resource system, 
resource units, governance system, users and interac¬ 
tions, and outcomes in the system. Model use enhances 
our understanding of the land-use decision-making 
process [19]. 

Anthropological View 

Beginning around 8,500 BC, the transition from the 
hunter-gatherer lifestyle to food production enabled 
people to settle down next to their permanent agricul¬ 
tural lands, instead of migrating to follow seasonal 
shifts in wild food supplies. Food production was 
accompanied by a human population explosion that 
has continued unabated to this day, resulting from two 
separate factors. First, the sedentary lifestyle permitted 
shorter birth intervals. Nomadic hunter-gatherers had 
previously spaced out birth intervals at 4 years or more 
because a mother shifting camp can carry only one 
infant or slow toddler. Second, plant and animal spe¬ 
cies that are edible to humans can be cultivated in 
much higher density in our agricultural land than in 
wild habitats [20]. 

Agriculture also led to an explosion of technology 
because sedentary living permitted the accumulation of 
heavy technology such as forges that nomadic hunter- 
gatherers could not carry and because the storable food 
surpluses resulting from agriculture could be used to 
feed full-time craftspeople and inventors. By also feed¬ 
ing full-time kings, bureaucrats, nobles, and soldiers, 
those food surpluses led to social stratification, political 
centralization, and standing armies. All of these over¬ 
whelming advantages are what enabled farmers to 
eventually displace hunter-gatherers [21]. 

Like all species, humans have exercised their 
impulse to perpetuate and propagate themselves. In 


doing so, humankind has “domesticated” ecosystems 
in ways that enhance its food supplies, reduce exposure 
to predators and natural dangers, and promote com¬ 
merce. On average, the net benefits to humankind of 
“domesticated” nature have been positive. We have, of 
course, made mistakes, causing unforeseen changes in 
ecosystem attributes, while leaving few, if any, truly 
wild places on Earth [22]. Domestication of plants 
and animals may be the single most important feature 
of the human domination of our planet (Fig. 1). 
Domestication involves the selection of traits that fun¬ 
damentally alter wild species to become more useful to 
us. Conservation has often been framed as the science 
aimed at protecting nature, and especially protecting 
nature from people. However, there really is no such 
thing as nature untainted by people [24]. Facing this 
reality should change the scientific focus of environ¬ 
mental science. Instead of recounting doom-and- 
gloom statistics, it would be more fruitful to consider 
the domestication of nature as the selection of certain 
desirable ecosystem attributes, such as increased food 
production, with consequent alteration to other eco¬ 
system attributes that may not be desirable. Under this 
paradigm, our challenge is to understand and thought¬ 
fully manage the trade-offs among ecosystem services 
that result from the inescapable domestication of 
nature [22]. 

It is clear that cities are the main consumers of most 
ecosystem services. This is important because the desire 
and value for these services determines the traits that 
humans select for preservation or elimination. For 
example, if humans want to maximize food produc¬ 
tion, landscapes will be domesticated to accommodate 
a few high productivity species, plus the human- 
associated species able to survive in these modified 
landscapes. If people want more wildlife for recrea¬ 
tional hunting, populations of predators of game spe¬ 
cies will be reduced, and the edge habitat that a few 
game species prefer will be increased. The choices and 
actions of urban dwellers influence nature far removed 
from cities, yet urban dwellers are increasingly unaware 
of these impacts [22]. 

Agricultural Practices' Footprint 

The huge magnitude of human impacts in the environ¬ 
ment is recent, but the presence of impacts such as 
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Cropping Systems: Shaping Nature. Figure 1 

Stages in "domestication" of nature by human being in relation with the area occupied (Adapted from [23]) 


purposeful wildfires goes back thousands of years. The 
reality of the human footprint renders discussions 
about what areas of the world to set aside as wild and 
protected areas as somewhat irrelevant; more germane 
is a discussion of what trade-offs we are willing to 
accept as a result of the domestication of nature [22]. 

The main environmental impacts attributed to 
agriculture come from the conversion of “natural” eco¬ 
systems to agroecosystems by CSs [9] . Clearing land for 
agriculture, humans target wild species for harvest or 
elimination. Humans have so tamed nature that few 
locations in the world remain without human influence 
[22] (Fig. 1). The whole notion of a “virgin rainforest” 
may be erroneous, with extensive prehistoric human 
activity evident in what were once thought to be 
untouched forests in the Amazon and Congo [25]. 
Global maps of human impact indicate that, as of 
1995, only 17% of the world’s land area had escaped 
direct influence by humans [22]. Between 1700 and 
1980, the total area of cultivated land worldwide 
increased 466%, and yields increased dramatically, par¬ 
ticularly because of selectively bred, high-yielding vari¬ 
eties, fertilizers, pesticides, irrigation, and machinery. 
For example, irrigation increased corn yields in eastern 
Colorado by 400-500% from 1940 to 1997 [9]. 


Roughly 50% of the world’s surface area has been 
converted to grazed land or cultivated crops [26]. 
Cropping systems (areas where at least 30% of the 
landscape is in croplands, confined livestock produc¬ 
tion, or freshwater aquaculture) now cover a quarter of 
the Earth’s surface, partly by conversion of temperate 
grasslands, Mediterranean climate forests, and many 
tropical ecosystem types. More than half of the world’s 
forests have been lost in this land conversion [26]. 
Forests have essentially disappeared from 25 countries, 
with 9.4 million hectares lost annually from the Earth’s 
surface [27]. Species and populations of species are 
being lost at unprecedented rates, while at the same 
time, the global biota is becoming homogenized, owing 
to the introductions of alien species to new regions. 
These examples represent major losses of pieces of the 
biosphere machinery, which have a serious impact on 
the delivery of ecosystem-regulating services - impacts 
such as greater prevalence of infectious diseases in 
disrupted ecosystems, adverse effects on local climates 
by ecosystem modification, and the loss of flood 
protection [27]. 

Environmental damages have come from agricul¬ 
tural nutrients that pollute aquatic and terrestrial hab¬ 
itats and groundwater, and from pesticides, especially 
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bioaccumulating or persistent organic agricultural pol¬ 
lutants. Agricultural nutrients enter other ecosystems 
through leaching, volatilization, and the waste streams 
of livestock and humans. Pesticides can also harm 
human health, as can pathogens, including antibiotic- 
resistant pathogens associated with certain animal pro¬ 
duction practices [9]. Many CSs have degraded soil 
quality and necessitated the expense of increased fertil¬ 
ization, irrigation, and energy to maintain productivity 

[28] . Today, only 30-50% of applied nitrogen fertilizer 

[29] and ~45% of phosphorus fertilizer [30] is taken 
up by crops. A significant amount of the applied nitro¬ 
gen and a smaller portion of the applied phosphorus 
are lost from agricultural fields. Nitrogen fertilization 
can increase the emission of gases that have critical 
roles in tropospheric and stratospheric chemistry and 
air pollution. Nitrogen oxides (NO*) emitted from 
agricultural soils and through combustion increase 
tropospheric ozone, a component of smog that impacts 
human health, agricultural crops, and natural ecosys¬ 
tems. NO x from agroecosystems can be transported 
atmospherically over long distances and deposited 
in terrestrial and aquatic ecosystems. Finally, nitrogen 
inputs to agricultural systems contribute to emissions 
of the greenhouse-gas nitrous oxide. Rice paddy agri¬ 
culture and livestock production are the most impor¬ 
tant anthropogenic sources of the greenhouse-gas 
methane [31]. Much of agriculture in the developed 
world had for a time embraced potentially 
non-sustainable systems for economic reasons, 
over-utilizing monocropping, specialization and 
mechanization, which was damaging to soils and the 
environment [32]. Modern agroecosystems are also 
depleted in biodiversity and habitat heterogeneity, 
often with a reduction in resilience as a result of their 
biological monotony [22]. 

Cities are a good place to start when considering the 
broader implications of agroecosystems. The cumula¬ 
tive resource demands of cities are often expressed 
as the total land area required to supply those 
resources, called the “ecological footprint” [33]. Every 
city imports resources and exports waste into a region 
that is spatially much larger than the city’s area. How¬ 
ever, there is substantial variation in per capita ecolog¬ 
ical footprints between rich and poor regions, with the 
average resident of the United States using 6 times the 
area of the average sub-Saharan African [34]. 


Praise of Agricultural Science 

To a conservationist interested mainly in biodiversity, 
we have degraded nature, but to an agronomist, we 
have shaped the nature to make it better serve humans 
[22]. It is paradoxical that current urban dwellers look 
down upon and lack an appreciation of agriculture 
when it is the key to our existence and to obtaining 
the energy needed for their daily activities. There is no 
question that humans have been successful in their 
efforts to produce food, thereby enhancing their well¬ 
being [22]. Contrary to Malthus’s predictions, food 
production has kept up with, and even outpaced, 
human population growth [35]. Malthus did not take 
into account that “there is no gene for the human 
spirit” [36]. The massive increase in food supply has 
been achieved by focusing efforts on planting and con¬ 
suming a small variety of plants [22] . As of 1999, barley, 
maize, rice, and wheat occupied almost 40% of global 
cropland [37]. 

Nowadays, few scientists think of agriculture as the 
chief, or model science. Many, indeed, do not consider 
it a science at all. Yet it was the first science - the mother 
of all sciences; it remains the science that makes human 
life possible; and it may well be that, before the century 
is over, the success or failure of Science as a whole will 
be judged by the success or failure of agriculture [38]. 
For too many years, the agricultural sciences have been 
disparaged in the science and education communities, 
perhaps because agronomy, soil science, plant pathol¬ 
ogy, and animal science use a problem-solving 
approach rather than simply seeking knowledge [39]. 
But as we move into a new era of shared accountability 
and responsibility, let’s keep in mind that agricultural 
sciences affect us all, and when agricultural science is 
thriving, our communities likely are thriving too [39]. 
Agriculture is not seen as a source of solutions to many 
of the world’s most pressing challenges. When science 
research funds are handed out, agriculture often gets 
left off the list. It is possible to suspect this because 
policy-makers and some scientists see “agriculture” as 
synonymous with “agribusiness” rather than as 
a purely scientific discipline, and they assume private 
funding will take care of agriculture-related research 
needs [39]. 

The benefits of agriculture have been immense. 
Before the dawn of agriculture, the hunter-gatherer 
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lifestyle supported about four million people globally 
[40]. Modern agriculture now feeds 6,000 million peo¬ 
ple. Global cereal production has doubled in the past 
40 years, mainly from the increased yields resulting 
from greater inputs of fertilizer, water and pesticides, 
new crop strains, and other technologies of the 
“Green Revolution” [41] (Fig. 2). This has increased 
the global per capita food supply, reducing hunger, 
improving nutrition (and thus the ability of people to 
better reach their mental and physical potential), and 
sparing natural ecosystems from conversion to agricul¬ 
ture [43]. 

Agriculture is the science that has had and con¬ 
tinues to have the greatest impact on humanity. The 
advancements are the result of countless researchers 
unknown to the general public and who deserve proper 
tribute and acknowledgement. We should be eternally 
grateful to people, many of whom are consigned to 
oblivion, such as Justus von Liebig, (1803-1873), who 
is known as the “father of the fertilizer industry,” for his 
discovery of nitrogen as an essential plant nutrient 


and his formulation of the Law of the Minimum 
which described the effect of individual nutrients on 
crops; Sir John Bennet Lawes, (1814-1900) who 
experimented with crops and manures at his farm at 
Harpenden, nowadays known as Rothamsted, and 
established the first long-term field experiment in the 
world, the Broadbalk (1843); Daniel Albone, who 
invented, in 1902, the world’s first successful light 
farm tractor; Fritz Haber and Carl Bosch, whose work 
led to the synthesis of the first N fertilizer in 1913, 
seeing that without the use of synthetic fertilizers, 
world food production could not have increased at 
the rate it did and more natural ecosystems would 
have been converted to agriculture [9]; Sir Ronald 
Aylmer Fisher, who, working in the long-term experi¬ 
ments at Rothamsted in 1925, established the basis of 
the modern statistics that so greatly benefited agricul¬ 
tural research and all other sciences; Erling Johnson, 
who in 1927 developed an industrial method for pro¬ 
ducing nitrophosphate; Paul Hermann Muller, who 
discovered that DDT was a very effective insecticide 
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Cropping Systems: Shaping Nature. Figure 2 

Changes with time in winter wheat yields from 1850 to 2000 with explanations for the trends due to new 
agricultural practice technologies introduced. Treatments show how important is nitrogen fertilization and crop 
rotation (Adapted from [42]) 
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in 1939, in spite of the fact that it was later discovered to 
be hazardous for many living beings, the good it 
accomplished for humanity is immeasurable as it con¬ 
trolled mosquitoes spreading malaria and lice trans¬ 
mitting typhus; Judah Hirsch Quastel, who working 
at Rothamsted, developed the first herbicide (2,4-d) 
in 1946; Norman Ernest Borlaug, father of the “green 
revolution,” led the introduction of high-yielding vari¬ 
eties, doubling wheat grain yield in many countries 
during the 1960s (Fig. 2); John E. Franz, who discovered 
Glyphosate in 1970, a broad-spectrum systemic herbi¬ 
cide; Marc Van Montagu and Jeff Schell discovered the 
gene transfer mechanism, developing the first transgenic 
plant in the 1980s; in addition to a long list of anony¬ 
mous researchers who contributed their part to provid¬ 
ing humanity with the food needed through techniques 
which have become increasingly more respectful of the 
environment and safer for consumption. 

Eradication of Fallow, Tillage, and Monoculture 

There are three agricultural practices that must tend to 
disappear from present and future CSs because it has 
been proven, in general, that they are more harmful 
than beneficial. Two of these, tillage and fallow, have 
been used since the beginning of agriculture, while the 
third, monoculture, emerged during the last century as 
a consequence of the incorporation of other techniques 
that made it apparently viable in the context of a great 
demand for food. These practices must be considered 
errors of the past that the present has allowed us to 
discover. 

Soil is not naturally intended to be in a state of bare 
fallow. CSs must be inspired by or consider nature in 
their design. Soil is meant to be covered by vegetation 
as can be observed in most terrestrial ecosystems; there¬ 
fore, when man included bare fallow in its group of 
agricultural practices, he committed a big mistake. 
Even in the conditions employed to allow low-fertility 
soils to rest, the most ideal option would be a change in 
crop, or if necessary, another option would be to not 
remove the crop. Although it was already known at the 
end of the last century that bare fallow was not useful, 
there have been policies, as is the case of the EU, which 
encouraged this practice as a result of market forces. 
Efforts to stabilize production of cereal crops led to 
the adoption of wheat-fallow CSs. This system, while 


popular with producers because it required limited 
equipment and managerial skills, has proven to be 
agronomically inefficient and environmentally 
unsustainable as shown through poor precipitation- 
use efficiency and decreased soil quality [44, 45]. In 
fact, at least 60% of the precipitation received during 
fallow is lost to evaporation [46]. Recognition of the 
drawbacks of wheat-fallow as well as advances in weed 
and residue management technology led to a reduction 
in the frequency of fallow [13]. Fallow time must be 
limited to those periods in which no water is available. 
Leaving a soil without residue shifts away from natural 
processes since in naturally dry ecosystems, the dry 
pasture leaves its residue protecting the soil against 
aggressive climatic events. All of the suggestions for 
intensifying cropping under dry-land conditions are 
contingent on maximum water capture and minimum 
losses to weeds, runoff, and evaporation. Ideally, crops 
should be synchronized so that there is always one crop 
in the field available to intercept possible rainfall [44]. 
Increased emphasis on crop diversity within annual 
CSs has allowed producers to take advantage of positive 
agronomic benefits derived from crop rotations [14]. 
Realizing these benefits requires knowledge of soil- 
water depletion and recharge characteristics for indi¬ 
vidual crops. Such knowledge is especially critical for 
areas where soil-water status can vary greatly between 
growing seasons [47]. 

Tillage, as is, is also not observed in nature. It was 
man, with the purpose of domesticating nature, who 
introduced it. No-tillage was used since ancient times 
by indigenous cultures. This was because tillage to any 
depth required more energy and power than was gen¬ 
erally possible with hand labor. The ancient Egyptians 
and the Incas in the Andes of South America used 
a stick to make a hole in the ground and put seeds by 
hand into unprepared soil [48]. For thousands of years, 
agriculture and tillage were considered synonymous. 
It was simply not thought possible to grow crops without 
first tilling the soil before planting and for weed control 
[49]. Intensive tillage and use of heavy machinery have 
accelerated soil erosion, soil-C loss and nutrient deple¬ 
tion, soil compaction, acidification, pollution, and sali¬ 
nization [50]. The advent of modern herbicides 
permitted no-tillage to be developed and practiced on 
actual working family farms. No-tillage is generally 
defined as planting crops in unprepared soil with at 
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least 30% mulch cover [49]. Adoption of no-tillage 
after its successful demonstration in the 1950s was 
slow. Today, approximately 23% of the total cropland 
in the United States is planted using no-tillage [49]. 
No-tillage has revolutionized agricultural systems 
because it allows individual producers to manage 
greater amounts of land with reduced energy, labor, 
and machinery inputs. Maintenance of surface residue 
cover is essential to optimize CS performance as resi¬ 
due coverage minimizes erosion, enhances retention 
of limited precipitation, and improves soil quality 
[51]. Moreover, no-tillage substantially improves soil 
carbon, an area that is currently of great importance 
[45]. Crop residues, including the presence of crop 
stubble, can alter the soil environment in a number of 
ways by acting as a physical barrier, exudate of chemical 
suppressants (allelochemicals) from the residue, or 
enhancer of biological activity, and by providing 
a habitat for weed seed predators. These factors, as 
well as the buffering effect of crop residues on soil 
moisture and soil temperature and impacts on light 
availability, have a significant impact on weed-seed 
germination and emergence [52]. Lastly, the adoption 
of no-tillage has an economic benefit for farmers, 
reducing crop costs [53]. 

Monoculture, the lack of biodiversity, was 
a contributing factor to several agricultural disasters 
in history. Monoculture causes a loss of biodiversity 
in the “ecosystem” soil as a result of the monotony, 
leading to a yield loss with respect to a crop in rotation 
[54]. Practices that change species’ composition or 
reduce biodiversity in agricultural systems may also 
diminish goods and services because the ability of eco¬ 
systems to provide some services depends both on the 
number and type of species in an ecosystem [9]. The 
supply of agricultural products and ecosystem services 
are both essential to human existence and quality of 
life. However, recent agricultural practices that have 
greatly increased global food supply have had inadver¬ 
tent, detrimental impacts on the environment and on 
ecosystem services, highlighting the need for more 
sustainable agricultural methods [9]. Wheat, rice, and 
maize crops have become the three most abundant 
plants on Earth. A central conclusion of epidemiology 
is that both the number of diseases and disease inci¬ 
dence should increase proportionally to host abun¬ 
dance, and this disconcerting possibility illustrates the 


potential instability of a global strategy of food pro¬ 
duction in which just three crops account for so high 
a proportion of production [9]. The relative scarcity of 
outbreaks of diseases on these crops is a testament to 
plant breeding and cultivation practices. For all three 
cereals, breeders have been successful at improving 
resistances to abiotic stresses, pathogens, and diseases, 
and at deploying these defenses in space and time so as 
to maintain yield stability despite low crop diversity in 
continuous cereal systems [9]. However, it is unclear if 
such conventional breeding approaches can work 
indefinitely. Both integrated pest management and bio¬ 
technology that identifies durable resistance through 
multiple gene sources should play increasingly impor¬ 
tant roles [55]. Nonetheless, the evolutionary interac¬ 
tions among crops and their pathogens mean that any 
improvement in crop resistance to a pathogen is likely 
to be transitory [9]. Although the “Green Revolution” 
significantly increased rice yields in Asia, yield increases 
have not occurred in the past 15-20 years. The genetic 
“yield potential” has increased for wheat, but the yield 
potential for rice has not increased since 1966, and the 
yield potential for maize has “barely increased in 35 
years.” It takes a decade or two for herbicide-resistant 
weeds to emerge, and insects become resistant to insec¬ 
ticides within about a decade. Within about one or two 
decades of the introduction of each of seven major 
herbicides, herbicide-resistant weeds were observed. 
Crop rotation helps to prevent resistances [9]. 

Biological Synergism and Technical 
Synchronization 

Agriculture should fulfill the economic, environmental, 
and social objectives of sustainable development. It is 
thus necessary to tailor current CSs to meet these needs 
as they are often too intensive and dependent on exter¬ 
nal inputs [56]. It is clear that the more we imitate 
nature or encourage natural processes, the greater the 
benefits for the CSs and fewer inputs that will be 
required. Two basic concepts should illustrate the 
design and management of CSs of the future: biological 
synergism and technical synchronization. 

Biological Synergism 

Biological synergic CSs should synchronize 
edaphoclimatic resources with the natural rhythms of 
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crops, this being a question of common sense that has 
not always been applied, and clearly speaks of the 
diversity which has been lost with intensive systems. 
Therefore, the increase in biodiversity in CSs is a clear 
consequence of the search for biological synergism in 
addition to the promotion of natural processes. Biodi¬ 
versity is essential to ecosystem processes in ways that 
are not yet fully understood [57], and it is considered 
worth protecting in its own right. Crop diversification 
by itself, however, is of limited use without knowledge 
of how individual crops affect each other in a sequence. 
Consequently, a thorough understanding of crop 
sequencing effects - both positive and negative - on 
agronomic parameters is necessary to optimize CS 
performance [13, 46]. One problem associated with 
synergic CSs is how to choose and sequence crops to 
develop the inherent internal resources of the system 
while taking advantage of external resources such as 
weather, markets, government programs, and new 
technology [14]. Present CSs rely on extensive use of 
fertilizer and pesticides and the low cost of fossil fuel 
energy [46]. Future challenges for CSs will exploit 
synergism through crop sequencing to improve crop 
yields without additional inputs and to reduce deteri¬ 
oration of the environment [58]. 

Diversification of farming systems cannot be 
managed on a farm or field scale alone but also requires 
management on a regional scale. To improve biodiver¬ 
sity, the interaction with other stakeholders in 
the region is necessary as part of regional planning 
processes. This has consequences for agronomic 
research which thus far has had only limited attention 
for the regional scale. Spatial planning aimed at 
multifunctional agriculture can be seen as a negotiation 
process on environmental, social, and economic aspects 
of land use. Complexity arises due to the high number 
of stakeholders and due to limited knowledge, which is 
often organized along disciplinary divides [59]. 

Crop rotation is defined as the growing of different 
crops (spatial and temporal), in recurring succession, 
on the same land in contrast to monoculture cropping. 
Rotation usually is done to replenish soil fertility, create 
a diverse soil organism, and reduce pest populations in 
order to increase the potential for high levels of pro¬ 
duction in future years [2, 46]. Each crop or a closely 
related crop species should not be grown more than 
every 4 years because of increased pest problems 


[60, 61]. One of the most effective and inexpensive 
methods to control plant diseases in CSs is through 
crop rotation. Through the use of this practice, the 
need to breed for new disease resistance and to discover 
new pesticides can be reduced. Recently, an important 
and costly pathogen of rice was controlled in a large 
region of China by planting alternating rows of two rice 
varieties [62]. This tactic increased profitability and 
reduced the use of a potent pesticide. Specific crop- 
rotation effects on plant diseases, however, are poorly 
understood and contradictory results have been 
reported [13]. 

Changes in the weed flora of agroecosystems can 
occur as long-term changes or temporary fluctuations 
in species composition [52]. Agricultural weeds are 
a unique group of plant species because of their ability 
to infest and thrive in intensively disturbed habitats 
despite extensive efforts to eliminate them. Weed floras 
differ between fields, farms, regions, climatic zones, 
and CSs. Weeds are successful because they are gener¬ 
ally plastic plants that adapt to and survive changes in 
the environment. In continuous CSs, changes in the 
environment may be as a result of differences in crop 
species, tillage, fertilizers, herbicides, and other weed- 
control tactics. Long-term population shifts are usually 
observed after repeated use of a control measure, which 
exerts a high selection pressure on a population, for 
example, the increased incidence of some annual grass 
weed species due to the intensification of cereal pro¬ 
duction and selective herbicides [52]. Studies have 
shown that the more diverse the CS, the more diverse 
the weed community, with less dominant and trouble¬ 
some species as would occur in a simple rotation. 

With diversified continuous cropping, there is 
greater opportunity to utilize crops that vary in 
N requirements [63]. Rotations with legumes are of 
great interest for strengthening the synergism of a CS. 
Legume crops are often planted to enhance nutrient 
cycling and availability to other crops in the rotation. 
Crops with high N demands are usually grown after 
legume crops. Grain legumes conserve soil N by fixing 
atmospheric N, thereby leaving residual N in place for 
the next crop. Wheat yields are often higher following 
legumes than following other cereals [54] and the N use 
efficiency of a wheat crop is greater following legumes 
than others crops [64]. The inclusion of a legume crop 
decreases its reliance on external fertilizer, supplying 
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a significant proportion of the N for the next crop and 
may moderate nitrate levels in the soil to avoid the 
potential for nitrate leaching [65] (Fig. 3). 

Alternatives such as plurispecific CSs are of consid¬ 
erable interest for biological synergism because plant 
associations can provide environmental benefits in 
terms of better use of resources and provision of eco¬ 
logical services [67]. Yet they should also be able to 
maintain a level of productivity and production stabil¬ 
ity that fits with farmers’ objectives. Intercrops (grass 
cover in the inter-rows) are now being introduced to an 
increasing extent due to the potential positive impacts 
on perennial crop and their environment: increased 
infiltration rate and decreased runoff due to modified 
soil surface characteristics [68], mitigation of soil ero¬ 
sion [69], limitation of herbicide use and weed control, 
better water-resource utilization by roots [70], and 
limitation of risks of diseases by reducing vegetative 
development [71]. However, intercropping also 
induces competition for soil resources, and vegetative 
development and yield can consequently be limited 


[56]. These different functions should be promoted 
through simultaneous adapted management of the 
two crops of the system. 

Finally, an important link of biological synergism is 
the return provided by livestock to the CSs. There was 
always a crop-livestock pairing in ancient agriculture, 
but specialization caused this very important aspect of 
biodiversity and synergism to disappear. Research is 
needed to better understand interactions between 
crops and livestock with the intention of identifying 
management practices that maximize agroecosystem 
productivity and operational efficiency. This would 
save having to remove residues, part of which would 
be used directly by the livestock, partially allowing the 
recycling of nutrients. Inclusion of livestock in a CS 
complements both crops and livestock by adding value 
to grain, improving nutrient-use efficiency, and pro¬ 
viding alternative uses for forages and crop residues 
[72]. It may create synergisms between the two enter¬ 
prises, resulting in far greater productivity than either 
enterprise could attain alone [13, 73]. The inclusion of 


c 


Relative other crops yield 
3.0 


2.5 


2.0 


1.5 


1.0 


0.5 


Benefit only for others crops 


Chickpea 


Sunflower 


Maize 


Proso millet 

Buckwheat t* 


Benefit for both crops 


\ FLAX 


\ Safflower 


1 Lentil 


Dry pea 
^Barley 
A Crambe 


Canola 1 

. •• . 

Wheal, Soybean #Dlybean 


Detrimental for both crops 

_t_ 


Benefit only for wheat 

_ ! _ 


0.8 


0.9 


1.0 1.1 1.2 
Relative wheat yield 


Cropping Systems: Shaping Nature. Figure 3 

Effect of 14 previous crops on relative wheat yield and the effect of wheat, as a previous crop, on those crops in the Great 
Plains (Adapted from [46, 66]) 









2752 


Cropping Systems: Shaping Nature 



livestock in CSs would also help alleviate the problems 
generated by livestock: high-density animal production 
operations can increase livestock disease incidence, the 
emergence of new, often antibiotic-resistant diseases, 
and air, groundwater, and surface water pollution asso¬ 
ciated with animal wastes. Current livestock operations 
are vulnerable to catastrophic loss of animals to disease. 
The handling and disposal of animal wastes are signif¬ 
icant problems of high-density animal confinement 
facilities. Manure lagoons can release high levels of 
hydrogen sulfide and other toxic gases, volatilize 
ammonium that greatly increases regional nitrogen 
deposition, and contaminate surface and ground 
waters with nutrients, toxins, and pathogens. These 
animal wastes pose health and environmental risks 
similar to those of human wastes and should be treated 
accordingly. For example, animal wastes could be 
treated by composting to create a crop fertilizer that 
no longer harbors pathogens, and that is applied at 
appropriate rates and times and with methods that 
minimize nutrient leaching. This closing of the nutri¬ 
ent cycle decreases dependence on synthetic fertilizer 
production, and is more efficient when animal and 
crop production are combined locally [9]. 

Technical Synchronization 

Technical synchronization must be defined as the con¬ 
cept on which precision agriculture is based, i.e., the 
temporal and spatial synchronization of agricultural 
techniques. All agricultural practices must be synchro¬ 
nized in time with the rhythms of the agroecosystem 
(time-specific management). This synchronization 
must take into account the growth stage of the crop, 
different soil characteristics (nutrients and water), 
weather forecasts, etc. On the other hand, there is the 
spatial synchronization that divides the land into 
homogeneous units for management (site-specific 
management) [63]. A basic principal of technical syn¬ 
chronization is the energetic efficiency when making 
decisions in relation to the agricultural practices to be 
carried out. 

The key to the application of technical synchroni¬ 
zation in a CS is the acquisition of data in order to 
make management decisions in time and space. There 
are an increasing number of data acquisition devices 
available to assist with decision making: global 


positioning (GPS), satellites or aerial images, informa¬ 
tion management tools (GIS), weather stations, soil 
sensors (water content, temperature, etc.), optical sen¬ 
sors (e.g., chlorophyll meters), and remote sensors 
(visible and near infrared spectral), etc. These tools 
allow the farmer to have a better understanding of the 
interaction of climate and management that causes 
tremendous year-to-year variation in on-farm yields 
and crop requirements. 

Nonetheless, technical synchronization poses 
a problem when applied by the farmer. There is 
a large body of published research on technologies for 
increasing the technical synchronization of agricultural 
practices, but relatively few have been adopted by 
farmers because they are not cost-effective or practical. 
Adoption of improved technologies typically requires 
additional skills and labor or investments in new equip¬ 
ment. Information on expected costs and economic 
returns from such investments is required to convince 
farmers of the benefits from adoption [12]. A possible 
solution is cooperativism for the acquisition and shar¬ 
ing of these new technologies. 

Dynamic Cropping Systems 

It could be said that, until now, CSs have been static, 
i.e., year after year the same group of agricultural prac¬ 
tices have been repeated, causing soil monotony in the 
agroecosystem. A dynamic CS must be flexible and go 
against repeated management practices, and the deci¬ 
sions must be made based on measured parameters and 
their determining factors. A dynamic CS is only the 
consequence of the application of the two previously 
stated principles: biological synergism and technical 
synchronization. Cropping systems need to be inher¬ 
ently flexible to take advantage of economic opportu¬ 
nities and/or adapt to environmental realities [13, 18]. 
The implementation of a management strategy 
requires knowing, understanding, planning, measur¬ 
ing, monitoring, and record keeping at each step of 
the production process. Dynamic CSs require the 
development and implementation of agricultural pro¬ 
duction systems that are highly productive, energy 
efficient, and environmentally non-degrading [12]. 

Dynamic CSs are a form of agricultural production 
that relies on an annual or pluri-annual strategy to 
optimize the outcome of (1) production, (2) economic, 
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and (3) resource-conservation goals using ecologically 
based management principles [74]. Implicit to this 
strategy is the need for producers to possess informa¬ 
tion necessary to respond to continual change. The key 
factors associated with dynamic CSs are diversity, 
adaptability, reduced input cost, multiple enterprise 
systems, and awareness of environment and informa¬ 
tion [14]. Greater crop diversity and sequencing flexi¬ 
bility within dynamic CSs may result in reduced weed, 
pest and disease infestations, greater nutrient- and 
precipitation-use efficiency, decreased requirements of 
exogenous inputs, and lower production risk [74]. 
To remain competitive in agricultural markets, 
increased diversity of systems will be required to min¬ 
imize inputs and improve economic margins. 

To increase responsiveness to externalities, oppor¬ 
tunity/flex CS [75] or dynamic CS [14] concepts have 
been developed. These systems allow producers to 
adjust CS intensity and/or diversity based on external¬ 
ities as well as management goals, such as soil-water 
status at planting, soil residual nitrogen, market 
demands, etc. This approach to crop sequencing pos¬ 
sesses an inherent flexibility to adapt to high-risk con¬ 
ditions, and therefore may be more economically and 
environmentally sustainable than other approaches to 
crop sequencing [13]. However, the authors who 
established the principles of dynamic CSs have focused 
solely on rotation selection, which is erroneous, since 
a CS is more than its rotation selection, although the 
crop selection is without a doubt the most important 
decision made in a CS. Perhaps the concepts should be 
called dynamic or flexible crop election system. How¬ 
ever, a dynamic CS makes more sense if each year all of 
the crop techniques used are reconsidered, i.e., crop 
selection, tillage, residue management, nitrogenous fer¬ 
tilization, etc. in such a way that there are dynamics in 
the selection of each and every one of the agricultural 
practices that depend on externalities. 

Future Challenges 

Agricultural science is ripe for a renaissance [39] as 
a consequence of the future challenges it must con¬ 
front. The widespread adoption of CSs that are sustain¬ 
able and environmentally benign is essential for the 
long-term survival of civilization [32]. Furthermore, 
through the use of strategic alliances, cooperation 


among producers on a regional basis may eventually 
lead to greater integration and diversification than 
could be achieved for the individual farm operation 
[ 15] . The primary challenges facing CSs in this century 
are: covering the demands of an increasing world pop¬ 
ulation; water shortage; the inclusion and management 
of GM crops; harmonization of the production of 
biofuels and food production; and climate change, 
and the focus it brings to agricultural research. Obvi¬ 
ously, there are many other challenges which have not 
been presented here, but which are intimately related to 
those mentioned such as: the biodiversity of CSs - as 
already previously mentioned - which should redound 
to a reduction of inputs and, therefore, a reduction 
of the negative effects on the environment, strategies 
to manage anthropogenic carbon emissions from 
terrestrial systems as well as fossil fuel and industrial 
sources, etc. 

Nevertheless, together with these future challenges 
are a series of forces that may contribute or hinder their 
fulfillment: research policies, transference, and financ¬ 
ing. In view of these challenges, governmental policies 
play an important role that may help or hurt the 
achieving of these objectives. For this reason, 
policymakers have a great responsibility, and their deci¬ 
sions should be guided by agricultural science profes¬ 
sionals. Political intervention has an economic facet 
that arises as the result of the following question: 
How can society accomplish the dual objectives of 
improving food production and stability and of pre¬ 
serving the quality and quantity of ecosystem services 
provided by the Earth’s land and water resources? 
Clearly, appropriate incentives are needed [9]. What 
incentives and policies could lead to the adoption of 
sustainable agricultural practices? Several policy initia¬ 
tives have tried to level the playing field between agri¬ 
cultural production and the production of ecosystem 
services. 

Another problem in the attempt to fulfill these 
objectives is the transference and application of new 
technologies in CSs. The earlier paradigm of science 
being developed at the international or perhaps 
national level and then disseminated to farmers should 
be replaced by an active exchange of information 
among scientists and farmers. Scientists in developing 
countries who understand the ecosystems, human cul¬ 
ture, and demands on local agricultural systems must 
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be actively trained, promoted, and brought into the 
international scientific community [9]. 

The economic investment in agricultural research is 
the third problem. Organized public and private invest¬ 
ment in agricultural R&D was a primary driver of the 
comparatively rapid growth in agricultural productiv¬ 
ity experienced in the latter half of the twentieth 
century [76]. Substantially greater public and private 
investments in technology and human resources 
are needed internationally, especially in low-income 
nations, to make agricultural systems more sustainable. 
Global research expenditures are less than 2% of agri¬ 
cultural gross domestic product worldwide [77], being 
roughly 5.5% of agricultural GDP in developed coun¬ 
tries, but less than 1% in developing countries (where 
most of the increased food demand will occur during 
the next 50 years). At present, there are few incentives 
for the private sector to increase investments in lower- 
income developing countries [78]. Moreover, funds 
have been redirected away from farm productivity 
toward other concerns, such as the environmental 
effects of agriculture; food safety and other aspects of 
food quality; and the medical, energy, and industrial 
uses of agricultural commodities. Without adequate 
investments, yield gains and environmental protection 
may be insufficient for a transition to sustainable 
agriculture [9]. 

Feeding the World: Priority Aim 

In a recent update of earlier estimates, the Food and 
Agriculture Organization [79] of the United Nations 
reported that more than one billion people now suffer 
malnutrition [80]. By 2050, global population is 
projected to be 50% larger than at present and global 
grain demand is projected to double [28, 82], especially 
the economic growth of the fast-growing economies of 
Asia [81]. This doubling will result from a projected 
2.4-fold increase in per capita real income and from 
dietary shifts toward a higher proportion of meat 
(much of it grain fed) associated with higher income. 
Further increases in agricultural output are essential for 
global political and social stability and equity. Dou¬ 
bling food production again and sustaining food pro¬ 
duction at this level are major challenges [82] . Doing so 
in ways that do not compromise environmental integ¬ 
rity and public health is a greater challenge still [9]. 


Agricultural productivity growth will be a pivotal 
determinant of long-term growth in the supply, avail¬ 
ability, and price of food over the coming decades [81]. 
However, a slowdown in growth of agricultural 
productivity and grain yields has been documented. 
If this slowdown in productivity persists, it could 
have profound implications for food-price trends in 
the future [81]. 

A key trade-off in cultivated systems is between 
increasing the amount of cropland needed to meet 
growing food needs versus increasing the productivity 
of each hectare of cropland. The “land-sparing” impact 
of modern farming practices has mainly been achieved 
by yield increases from the use of crop monocultures 
with improved crop varieties, fertilizer inputs, and 
irrigation. For example, if yields of the six major crop 
groups that are cultivated on 80% of the total cultivated 
land area had remained at 1961 yield levels, it would 
require an additional 1.4 billion hectares of land in 
2004 - more than double the amount currently used. 
This represents 34% of the total land area suitable for 
crop cultivation and would have required conversion of 
large areas of uncultivated land that support rain for¬ 
ests, grassland savannahs, and wetlands. In Asia alone, 
it would require an additional 600 million hectares, 
which represent 25% more land area than is suitable 
for cultivation on this continent. Asia would now be 
heavily dependent on food imports if crop yields had 
remained at 1961 yield levels. Although this increase in 
productivity has saved some land from conversion, 
it has resulted in greater impact on other services 
through water withdrawals, excessive nutrient loads 
and pesticide use. The key ecological question is, there¬ 
fore, whether environmental services - other than food 
production at regional and global scales - would be 
enhanced by focusing food production on less land 
under intensive management with high yields, versus 
expanding cultivated area in lower-yielding systems 
using farming practices that preserve environmental 
services at the field and local levels. Few studies have 
addressed this issue using sound, ecological, analytical 
methods [27]. 

Water: Colorless Gold 

Water is the principal factor limiting crop yield and is 
considered a luxury in many agroecosystems. In many 
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developing countries, water is a major factor 
constraining agricultural output, and income of the 
world’s rural poor [83]. Forty percent of crop produc¬ 
tion comes from the 16% of agricultural land that is 
irrigated [84]. Irrigated lands account for a substantial 
portion of increased yields obtained during the Green 
Revolution [9] . Unless water-use efficiency is increased, 
greater agricultural production will require increased 
irrigation. However, the global rate of increase in irri¬ 
gated area is declining, per capita irrigated area has 
declined by 5% since 1978, and new dam construction 
may allow only a 10% increase in water for irrigation 
over the next 30 years [84]. Irrigation return-flows 
typically carry more salt, nutrients, minerals, and pes¬ 
ticides into surface and ground waters than in source 
water, impacting downstream agricultural, natural sys¬ 
tems, and drinking water. Technologies such as drip 
and pivot irrigation can improve water-use efficiency 
and decrease salinization while maintaining or increas¬ 
ing yields. They have been used in industrialized 
nations on high-value horticultural crops, but their 
expanded use currently is not economically viable for 
staple food crops. In developing countries, 15 million 
hectares have experienced reduced yields owing to salt 
accumulation and waterlogging [85]. The water¬ 
holding capacity of soil can be increased by adding 
manure or reducing tillage and by other approaches 
that maintain or increase soil organic matter. Cultiva¬ 
tion of crops with high water-use efficiency and the 
development - through the use of biotechnology or 
conventional breeding - of crops with greater drought 
tolerance can also contribute to yield increases 
in water-limited production environments. Investment 
in such water-efficient technologies, however, is 
best facilitated when water is valued and priced 
appropriately [9]. 

However, most of the world’s agricultural area cor¬ 
responds to dry-land areas and therefore the challenge 
is to increase the efficiency of water use in these areas. 
Efficient use of limited water supplies in dry-land CSs is 
critical to system success. In the future, increases to 
semiarid dry-land system water-use efficiency and pre¬ 
cipitation-use efficiency may come from continued 
improvement in managing residues (especially no- 
tillage), cropping sequences that minimize fallow 
periods, herbicides, and crop choice. However, contin¬ 
uous cropping under dry-land conditions remains 


risky due to the limited precipitation (erratic in distri¬ 
bution and frequency) and high-potential evaporation. 
In the future, the following are potentially fruitful areas 
of research that may improve system water-use effi¬ 
ciency [86]: (1) Increasing the amount and persistence 
of crop residues. (2) Implementation of flexible rota¬ 
tions (i.e., opportunity cropping). The occurrence of 
precipitation and, hence, the availability of adequate 
stored soil water for a crop is highly variable, especially 
in semiarid regions. Sometimes stored soil water at 
normal planting times for a crop in a given CS is 
limited; at other times, adequate water for a crop is 
available when the planting of a crop had not been 
planned. By practicing opportunity cropping, some 
crop generally could be planted when water becomes 
available. The goal should be to grow a crop whenever 
conditions are or become favorable and not according 
to some predetermined schedule [87]. (3) Matching of 
crop cultivar selection to prevailing weather condi¬ 
tions. Genetic yield potential is linked positively with 
maturity, so cultivar evaluation trials conducted under 
conditions of adequate soil water often favor longer- 
maturity cultivars and influence farmer choice. 
(4) Improvement of the timeliness of cultural opera¬ 
tions, including early seeding of crops and optimum 
timing of weed control, and time operations to coin¬ 
cide with favorable conditions as predicted by short¬ 
term (48-72 h) weather forecasts. 

Genetic Modified Crops: Essential 

Genetically modified crops are a very important part of 
the “second Green Revolution,” which may help to 
break the tendency toward the stabilization of work 
agricultural production. Genetically modified crops 
may offer the best hope for producing crops that can 
withstand drought, impoverished soils, and disease. 
Future GM products are likely to carry traits that will 
improve nutrition and health, help guard against 
drought, heat and cold, and allow plants to access and 
more efficiently utilize plant nutrients. All of these 
technologies have more benefits to offer society, and 
especially, poor farmers and consumers even more than 
rich ones [88]. 

Therefore, GM crops are an essential tool in feeding 
the growing world population and reducing the nega¬ 
tive effects on the environment from the use of 
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pesticides. Nevertheless, this technology must be used 
properly by growers. Growers who adopt herbicide or 
pest tolerant crops in all cycles of the crop rotation will 
be at greater risk than those who choose to rotate crops 
and/or use other management strategies [89]. 
However, farmers must be proactive in protecting the 
longevity of the technology by reducing the likelihood 
of resistance occurring and providing alternative selec¬ 
tion pressures on weeds and pests [52]. We must not 
forget that life always finds a way through barriers [90] . 

Biofuel Crops: A Controversial Birth 

Recent analyses of the energy and greenhouse-gas per¬ 
formance of alternative biofuels have ignited 
a controversy that may be best resolved by applying 
two simple principles. In a world seeking solutions to 
its energy, environmental, and food challenges, society 
cannot afford to miss out on the global greenhouse-gas 
emission reductions and the local environmental and 
societal benefits when biofuels are done right. However, 
society also cannot accept the undesirable impacts of 
biofuels done wrong [91]. Biofuels done right can be 
produced in substantial quantities with little or no 
competition with food production. 

The biofuel industry could have many positive 
social and environmental attributes, but it could also 
suffer from many of the sustainability issues if not 
implemented the right way. Putting biofuel crops on 
marginal lands, rather than on our most productive 
croplands, could mean preventing competition with 
food production and concomitant effects on commod¬ 
ity prices, as well as minimizing or even avoiding the 
carbon debt associated with land clearing. However, 
marginal lands can also be rich in biodiversity, 
may require sizable inputs of nutrients and water to 
make production economically viable, and may carry 
the opportunity cost of forgone future carbon 
sequestration [92]. 

Globally, the production of an important amount 
of energy with biofuels will require a large amount of 
land - perhaps as much as is in row crop agriculture 
today. This will change the landscape of Earth in 
a significant way. The identification of unintended 
consequences early in the development of alternative 
fuel strategies will help to avoid costly mistakes and 
regrets about the effects on the environment. Policies 


that support long-term sustainability of both our land¬ 
scapes and our atmosphere are essential if we are to 
chart a low-carbon economy that is substantially better 
than business as usual. Sustainable biofuel production 
systems could play a highly positive role in mitigating 
climate change, enhancing environmental quality, and 
strengthening the global economy. But it will take 
sound, science-based policy and additional research 
effort to make this so [93]. 

Until now, the most convincing option for the 
production of biofuel has been the use of perennial 
plants as they require fewer inputs and can be grown 
on degraded lands abandoned from agricultural use 
[91]. Use of such lands minimizes competition with 
food crops. This also minimizes the potential for direct 
and indirect land-clearing associated with biofuel 
expansion, as well as the resultant creation of long¬ 
term carbon debt and biodiversity loss. Some initial 
analyses on the global potential of degraded lands sug¬ 
gest that they could meet meaningful amounts of cur¬ 
rent global demand for liquid transportation fuels 
[92, 94, 95]. The option of using crop residues from 
food crops, such as corn stover and straw from rice and 
wheat is a more controversial matter. Recent research 
suggests that it is to the benefit of farmers to leave 
substantial quantities of crop residues on the land 
[96], but that, nonetheless, even conservative removal 
rates can provide a sustainable biomass resource about 
as large as that from dedicated perennial crops grown 
on degraded lands. 

Good public policy will ensure that biofuel produc¬ 
tion optimizes a bundle of benefits, including real 
energy gains, greenhouse-gas reductions, preservation 
of biodiversity, and maintenance of food security. 
Performance-based policies that provide incentives 
proportional to the benefits delivered are needed. 
This is a complex question that cannot be addressed 
with simplistic solutions and sound bites. It needs 
a new collaboration between environmentalists, econ¬ 
omists, technologists, the agricultural community, 
engaged citizens, and governments around the 
world [91]. 

Climate Change: Technical Plasticity 

The scenario produced, if indeed climate change 
becomes a greater reality, is unclear. There is 
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a “climate of suspicion” as published in the journal 
Nature at the beginning of January 2010. Climate 
researchers must emphasize that - although many 
holes remain to be filled - there is little uncertainty 
about the overall conclusions: greenhouse-gas emis¬ 
sions are rising sharply, they are very likely to be the 
cause of recent global warming and precipitation 
changes, and the world is on a trajectory that will 
shoot far past 2°C of warming unless emissions are 
cut substantially [97]. It is evident that CSs must have 
the ability to adapt to a more unpredictable amount 
of rainfall and high temperatures. According to the 
IPCC definition, the extent to which CSs are vulnerable 
to climate change depends on the actual exposure 
to climate change, their sensitivity, and adaptive 
capacity [98] . The adaptive capacity refers to the ability 
to cope with climate change, including 
climate variability and extremes, in order to (1) mod¬ 
erate potential damages, (2) take advantage of emerg¬ 
ing opportunities, and/or (3) cope with its 
consequences [99]. 

Nonetheless, in spite of the uncertainty regarding 
climate change, primarily as a consequence of the 
increase in atmospheric C0 2 levels, there are some 
positive effects. When you step into a commercial 
greenhouse, the chances are you are stepping into the 
future. To plants, C0 2 is food, and greenhouse opera¬ 
tors, knowing this, use it to fatten them up. While 
today’s atmosphere contains about 380 parts per mil¬ 
lion of C0 2 , commercial greenhouses often contain 
C0 2 concentrations of twice that or more - the sort 
of concentration that we might expect in the open air at 
the end of the century [100]. Many crop scientists 
believe that this carbon dioxide fertilization effect will 
go at least some way toward offsetting the losses in yield 
to be expected as a result of the higher temperatures, 
flooding, drought, and rising sea levels that the C0 2 
greenhouse effect will bring. But some are not so sure. 
These researchers point to the known negative effects 
that increased C0 2 concentrations have on the protein 
content of crops. When Bruce Kimball started out in 
the 1970s, available technology for high-C0 2 research 
was limited. Based on current knowledge, he says, the 
net effect of increasing C0 2 is a good one: “As far as 
crops go, I think higher C0 2 is a definite benefit. Yes, 
a little less nutrition than before, but we get more food” 
[100]. But while Kimball thinks that, in general, the 


gains in yield are the most important thing, he is not 
blind to the drop in protein levels. 

Agricultural Research: Return to the Field 

In recent years, an important movement in agricultural 
research has emerged on an economic as well as human 
resource level, toward molecular aspects of plant 
growth and development. Progress in understanding 
plant molecular biology has been impressive, and use¬ 
ful applications are evident [101]. However, agronomy 
has been marginalized, with its field laboratories which 
require much more time to produce results. This 
migration could have something to do with the scien¬ 
tific productivity stimulated by the present scientific 
system since many agricultural research areas that do 
not conduct field work require less than a year from the 
start of experimental design until the results are 
published. On the contrary, field research requires at 
least 2 years before the results can be accepted by the 
scientific community. Most records used to assess eco¬ 
system changes are based on short-term data or satellite 
imagery spanning only a few decades. In many 
instances, it is impossible to disentangle natural vari¬ 
ability from other, potentially significant trends in 
these records partly because of their short time scale 
[102]. It is evident that the improvement of CSs 
requires research in CSs, and specially long-term exper¬ 
iments. Unless a return is made to the roots of research 
under field conditions, many of the challenges faced 
will not be overcome. The return to field research will 
contribute to the global public good by restoring and 
sustaining productivity growth over the long run, 
which in turn will mitigate hunger and poverty and, 
at the same time, reduce pressure on the natural 
resource base [81]. 

Nevertheless, in recent years, some agronomists 
have begun to once again stress the importance of this 
marginalized dimension of agricultural research, which 
is time, in relation to the consequences that CSs 
have on productivity and the environment. Within 
this framework, long-term experiments acquire a 
noteworthy relevance. The aim is to compare the bio¬ 
logical and economic productivity of different CSs. The 
high annual fluctuations in rainfall, and resulting crop 
yields, may make several crop cycles necessary in order 
to detect significant differences. Likewise, the 
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differences between soils, water-soil relationship, and 
variability of pathogens and plagues could provide 
information that explains the differences in yield 
among CSs and suggest strategies in which crop 
sequences and management practices can be selected 
to increase efficiency in the use of water and nutrients, 
and control the populations of weed, plagues, and 
diseases. In this respect, long-term studies act as “lab¬ 
oratories” in which specific problems or mechanisms 
can be studied in continuous field conditions, where 
the crop and “input/output” history is well known, and 
also where tendencies of the quality of resources over 
time and crop yields can be examined [103]. The most 
convincing evidence regarding the sustainability of an 
agroecosystem comes from long-term experiments 
with positive results. 
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Glossary 


and around it; glacier flow directions in an icefield 
are dictated by the bed topography. 

Lake/river ice Floating ice on rivers or lakes, usually 
freshwater ice. 

Mass balance The overall gain or loss of mass for 
a component of the cryosphere over a specified 
time interval, typically 1 year. This can be expressed 
as a rate of change of mass (kg year -1 ), ice volume 
(m 3 year -1 ), or water-equivalent volume (m 3 w.eq. 
year -1 ). It is also common to express this as 
the area-averaged rate of change or the specific 
mass balance rate, with units of kg m -2 year -1 or 
m w.eq. year -1 . 

Permafrost Perennially frozen ground, technically 
defined as ground that is at or below 0°C for at 
least 2 years. 

Sea ice Floating ice from frozen seawater. 

Snow Ice-crystal precipitation that accumulates on the 
surface. 

Soil ice Ice in permafrost. 


c 


Ablation Snow and ice removed from an ice mass via 
meltwater runoff, sublimation, wind scour, or gla¬ 
cial calving (mechanical fracturing and separation). 

Accretion Increase in ice mass by basal growth in the 
case of floating ice, the compression of snow into 
ice, or freezing of water that has pooled on the ice 
or percolated into snow from rain, meltwater, or 
flooding of sea/lake/river water. 

Accumulation Snow and ice added to an ice mass via 
snowfall, frost deposition, rainfall that freezes on/in 
the ice mass, refrozen meltwater, wind-blown snow 
deposition, and avalanching. 

Glacier A perennial terrestrial ice mass that shows 
evidence of motion/deformation under gravity. 

Grounding line The transition zone between 
grounded and floating ice. 

Ice sheet A large (i.e., continental-scale) dome of gla¬ 
cier ice that overwhelms the local bedrock topogra¬ 
phy, with the ice flow direction governed by the 
shape of the ice cap itself. 

Ice shelf Glacier ice that has flowed into an ocean 
or lake and is floating, no longer supported by the 
bed. 

Icefield A sheet of glacier ice in an alpine environment 
in which the ice is not thick enough to overwhelm 
the local bedrock topography, but is draped over 


Definition of Subject 

The global cryosphere encompasses snow and ice in all 
its forms in the natural environment, including glaciers 
and ice sheets, sea ice, lake and river ice, permafrost, 
seasonal snow, and ice crystals in the atmosphere. 
Cryospheric models are mathematical and numerical 
descriptions of these components of the Earth system, 
designed to simulate snow and ice processes and feed¬ 
backs in the context of the global climate system. This 
entry covers all forms of ice except ice crystals in the 
atmosphere, which are more appropriately combined 
with clouds and appear in entry ► Solar Radiation 
Management, Cloud Albedo Enhancement. We focus 
on models that are appropriate for global climate 
modeling. There are regional climate modeling appli¬ 
cations that include some of the cryospheric compo¬ 
nents discussed here as well. 

The cryosphere is critical to understanding global 
climate change owing to its control on Earth’s surface 
reflectivity. In addition, storage of freshwater by the 
cryosphere and exchange of freshwater between the 
cryosphere and ocean are fundamental to ocean 
circulation and sea level rise. Sea ice and snow insulate 
the underlying surface and usually allow lower atmo¬ 
spheric temperatures than snow and ice environments. 
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Biogeochemical cycles in sea ice and permafrost 
influence carbon dioxide and methane concentrations 
in the atmosphere, and sea ice hosts organisms and 
nutrients important to marine ecosystems. Modeling 
cryosphere-climate interactions generally requires 
modeling the mass balance of ice on Earth and the 
key features of ice that interact with the climate system. 

Perennial ice covers 10.8% of Earth’s land, with 
most of this ice area in the great polar ice sheets 
in Greenland and Antarctica. Smaller glaciers 
and icefields are numerous - the global population is 
estimated at 160,000 - but these ice masses cover less 
than 1% of the landscape. An additional 15.4% of 
Earth’s land surface is covered by permafrost: frozen 
ground that ranges from a few meters to 100 s of meters 
deep. 

In contrast to this permanent land ice, seasonal 
snow and ice fluctuate dramatically. Snow cover is the 
largest-varying element of the cryosphere, with 
complete summer loss of this snow everywhere on 
Earth except over Antarctica, the interior of Greenland, 
and in the accumulation areas of other high-altitude 
and polar ice caps. In winter, the northern hemisphere 
snow cover reaches an average maximum extent of 
45.2 x 10 6 km [1], based on data from 1966 to 2004). 
This amounts to 49% of the Northern Hemisphere land 
mass. Because the southern hemisphere continents are 
situated at lower latitudes (excepting Antarctica), 
southern snow cover is less extensive, with the seasonal 
snow cover estimated at 1.2 x 10 6 km. This combines 
with the permanent blanket of snow over Antarctica to 
give a peak southern hemisphere terrestrial snow cover 
of 15.0 x 10 6 km. 

Relative to the snowpack, seasonal sea ice cycles 
are more hemispherically symmetric, although there 
are interesting north-south contrasts. Based on passive 
microwave remote sensing for the period 1979-2010, 
the average minimum Northern Hemisphere sea ice 
area is 4.4 x 10 6 km, typically reached in September 
[2]. Maximum ice cover is usually attained in March, 
with an average area of 13.1 x 10 6 km. The annual 
average Northern Hemisphere sea ice area is 
9.4 x 10 6 km. The Southern Hemisphere sea ice has 
a larger seasonal cycle, with relatively little multiyear 
ice. Annual mean sea ice cover in the south is 
8.7 x 10 6 km, varying from 1.9 x 10 6 km (February) 
to 14.5 x 10 6 km (September). 


Introduction 

The early energy balance modelers of the 1960s who 
investigated global climate recognized the importance 
of the high albedo of ice in the climate system [3, 4]. 
Their models parameterized snow and ice albedo by 
varying the land or ocean surface albedo with surface 
temperature, but no other physical characteristics of 
ice, such as its ability to insulate or store energy and 
freshwater, was simulated. When subject to climate 
forcing, such as a perturbation to the solar constants, 
the models nonetheless exhibited an amplification of 
temperature change in the high latitudes - a phenom¬ 
enon widely known as polar amplification. The albedo 
difference between ice-covered and ice-free regions 
determined the strength of the polar amplification. 
Today we know ice-albedo feedback is only one of 
many important ways that ice and snow contribute to 
the climate system. 

The next progression in ice modeling among global 
climate models was to include the major elements of 
the mass balance of snow on land [5] and sea ice [6], 
both implemented in the Geophysical Fluid Dynamics 
Laboratory (GFDL) in 1969. Yet the GFDL model and 
all other climate models for several more decades had 
no ice dynamics, continued to treat ice sheets as 
shallow snow fields with prescribed representative 
topography, and ignored soil ice altogether. The more 
advanced aspects of cryosphere modeling evolved in 
parallel with, but independent of global climate 
models. 

In the 1950s, John Nye [7, 8] laid the theoretical 
foundations for glacier models, through the elucida¬ 
tion of the essential physics as well as several 
well-judged simplifications that permit analytical 
solutions. Theoretical and laboratory analyses by 
a contemporary materials scientist, John Glen, led to 
the understanding that glacier ice deforms as a 
nonlinear viscous fluid. Glen established a constitutive 
relation for the rheology of glacier ice that endures to 
this day [9, 10] . Combined with the conservation equa¬ 
tions for mass, momentum, and energy, this provides 
the basis for modeling glaciers and ice sheets. By the 
late 1960s and 1970s, emerging computer power 
presented the opportunity to develop numerical 
models of glaciers [11-13]. Model development 
through the 1980s largely focused on regional 
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simulations [14] and ice flow in different regimes, such 
as ice shelves and ice streams [15, 16]. 

In the late 1980s, Philippe Huybrechts developed 
the first pragmatic, operational 3D thermomechanical 
ice sheet model [17, 18]. This model has been applied 
extensively to the Greenland and Antarctic Ice 
Sheets, helping to understand their past and present 
evolution, e.g., [19-22]. The Huybrechts model also 
underpins the projections of ice sheet response to cli¬ 
mate change in the IPCC reports. Similar models have 
now been developed in several research groups, and 
intercomparison exercises have been carried out to 
evaluate model strengths and weaknesses [23, 24]. 

A few global climate models have incorporated 
a Huybrechts-type ice sheet model such that changes 
to the global climate can interact with the shape and 
extent of ice sheets in Greenland and Antarctica. The 
interaction occurs through changes in surface albedo as 
the ice sheet retreats or advances over bare soil, eleva¬ 
tion-temperature feedbacks, and through changes in 
the atmospheric and oceanic circulation [21, 25-27]. 

The low-order glacier models implemented in 
global climate models to date are based on a simplified 
representation for ice sheet flow (see section “Land Ice 
Dynamics”), which is not well-suited to ice shelves, ice 
streams, and ice sheet margins. Because these features 
of ice sheets are known to be changing rapidly, current 
development efforts are focusing on higher-order solu¬ 
tions of ice dynamics [28-32]. Models of valley glacier 
dynamics have followed a similar evolution from sim¬ 
plified representations of ice dynamics, e.g., [33], to 
recent simulations that include a more complete 
representation of the glacier stress and strain regime 
[34-38]. Parallel to this recent effort are formal, com¬ 
munity-scale programs to couple more sophisticated, 
high-resolution ice sheet models into global climate 
models [39], although fully coupled efforts with global 
climate models are still in their infancy. 

The development of sea ice models took a rather 
different path than glacial models. Model development 
began later, but the methods, albeit often simplified, 
migrated more quickly into global climate models. 
Today the lag between developing new sea ice model 
physics and implementing them in global climate 
models is often only a few years. 

The first sea ice rheology was proposed in the 1970s, 
roughly two decades after the glacier rheology 


proposed by Nye and Glen. Initially a plastic rheology 
was put forward as a way to produce deformation 
(ridging and rafting) for the Arctic Ice Dynamics 
Joint Experiment (AIDJEX) model spearheaded by 
Max Coon [40]. Models attempting to treat ice as 
a plastic were only appropriate for local-scale problems 
of a few weeks to a month duration and could not be 
used to investigate ice-climate interactions owing to 
their inherent numerical complexity [41]. In the late 
1970s, William Hibler proposed a nonlinear viscous- 
plastic (VP) rheology - a simplification motivated on 
physical grounds [42]. In a 1979 landmark paper, 
Hibler applied his sea ice model to the whole Arctic 
Ocean basin and ran an 8-year simulation [43]. The 
AIDJEX and Hibler sea ice dynamics remains the foun¬ 
dation of modern sea ice models. 

A subgridscale parameterization of the variety of 
sea ice thicknesses that are found in a typical model 
grid box, known as an ice thickness distribution (ITD), 
was developed by Alan Thorndike and colleagues [44] 
and implemented by Hibler at the basin-scale in 1980 
[43] . Modeling the intricacies of sea ice thermodynam¬ 
ics to account for the thermal inertia of brine-pocket 
physics was developed slightly earlier by Norbert 
Untersteiner and Gary Maykut [45, 46], but it was 
only implemented in ID until the present decade. 

The first global climate models treated sea ice as 
a uniform slab without leads (openings among floes), 
melt ponds, or brine pockets, based on the simplifica¬ 
tions proposed by Albert Semtner [47]. If the sea ice 
moved at all, it was advected with the surface currents - 
in what is known as “free drift.” Once the sea ice 
thickness reached some threshold (4 m was common), 
it was then held motionless to prevent the sea ice from 
building to excess in regions of convergence [48-50]. 

It was not until Flato and Hibler [51] simplified the 
VP model by treating sea ice as a cavitating fluid (CF) 
that global climate modelers attempted to implement 
sea ice dynamics with a constitutive law. However, the 
lack of shear strength in the CF model degraded the 
accuracy of the simulation compared to the VP model. 
Soon after Hunke and Ducowicz [52] developed 
a technique of treating sea ice as an elastic-viscous- 
plastic (EVP) material - a numerical approximation 
to the VP model that asymptotes to the full VP solution 
and yet is efficient, highly parallelizable, and offers 
flexible grid choices. Zhang and Hibler [53] followed 
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suit by making the VP numerics more efficient and 
parallelizable. These new dynamical schemes ushered 
in a time of rapid improvement in the sea ice dynamics 
in climate models, and now EVP and VP dynamics are 
in wide use among climate models. Thermodynamic 
advances in global climate models have been slower. 
The use of an ITD and brine-pocket physics has only 
been recent in climate models [54, 55]. Melt pond 
parameterizations and radiative transfer that includes 
scattering have only been developed in one dimension 
and are in the early stages of production runs in the 
next generation of models. 

Snow and permafrost models used for one- and 
two-dimensional applications also progressed signifi¬ 
cantly before their developments were brought into 
climate models, e.g., [56-58]. Permafrost models are 
typically based on one-dimensional thermal diffusion 
in the upper 2 km of the Earth surface, consisting of 
bedrock, sediments, and soils. Where temperatures are 
below freezing, ground ice occupies pore space and 
fractures in the rock. A freezing front propagates to 
a depth that is limited by the geothermal heat flux. 
Permafrost models simulate the aggradation or degra¬ 
dation of permafrost, based on mean annual surface 
(ground) temperatures and subject to geothermal heat 
flux from below. Detailed near-surface models can be 
added to simulate the seasonal melting/freezing of the 
surface active layer. In reality, more complex thermo¬ 
dynamic and hydrological processes are involved in 
permafrost dynamics. For instance, water flow can 
advect heat and there are 3D thermal effects associated 
with horizontal gradients in surface temperature, 
arising from variable surface vegetation, micro-topog¬ 
raphy, snow cover, and surface lakes. In addition, freez¬ 
ing of subsurface water is not always limited to the pore 
space; ice can accrue as “massive ice” deposits (e.g., ice 
lenses) through migration of water to the interstices of 
soils and sediments, driven by low capillary pressures. 
Simple permafrost models, as used in climate models, 
typically neglect these processes and focus on large- 
scale predictions of permafrost depth and ID ground 
temperature structure. 

For snow models, the chief physical processes that 
prove challenging to model are snow aging and grain 
size evolution influences on the albedo and density; 
liquid water infiltration and storage; snow blowing, 
redistribution, and collection by vegetation; and active 


layer depth dynamics. Many global climate models 
have developed terrestrial snow schemes to treat all 
but snow redistribution in the last few decades because 
these processes have a major impact on surface albedo, 
surface hydrology, and soil carbon storage [59-64]. 
Subgridscale snow distribution models exist [65] but 
have yet to be implemented in any global climate model 
that we are aware of. Models of snow on sea ice tend to 
be more primitive because transporting every variable 
that describes the snow is necessary and expensive. 

In subsequent sections we describe the equations 
and methods used to model the cryosphere, followed 
by a brief outlook of future directions and priorities in 
cryospheric modeling. Components of the cryosphere 
are grouped into four types based on common charac¬ 
teristics and physics: Land ice refers to ice above the 
soil/bedrock (e.g., glaciers and ice sheets) and ice 
shelves; floating ice refers to river, lake, or sea ice; frozen 
soil includes permafrost and seasonally frozen soil; and 
finally snow, which may overly land ice, floating ice, or 
soil. The underlying equations for the dynamics of land 
and floating ice are rooted in a blend of continuum 
mechanics and fluid dynamics, but there are significant 
differences in their implementation, justifying separate 
descriptions. In contrast, thermodynamics cuts across 
all aspects of the cryosphere, and much about it can be 
described more generally. 

Thermodynamics of the Cryosphere 

Conservation of Energy 

The governing equation for the thermodynamics of the 
cryosphere is conservation of energy: 

+ V • (u v 4>q v ) = —V • (fcVT) + Q sw + ® 

( 1 ) 

which is written in terms of an enthalpy (q) for ice, 
snow, or mixed soil and ice, where q is expressed in 
units of Jm -3 , <p is the void (or pore) fraction within 
the solid where liquid water/vapor may exist, u v is the 
velocity of liquid water/vapor moving in the voids, T is 
temperature in the solid and voids, q v is the enthalpy of 
liquid water/vapor in the voids, k is the conductivity in 
solids and voids, Q sw is the absorption of shortwave 
radiation over a finite thickness of the ice (W m -3 ), 
which is assumed to be significant only in ice, and ® is 



Cryosphere, Modeling of 


c 


2765 


the strain heat production due to deformation work 
(significant for glaciers). The first term is a Lagrangian 
derivative of the heat required to raise the temperature 
and change the phase between solid and liquid. It is 
Lagrangian to account for the horizontal advection of 
heat in moving land and floating ice (see sections 
“Land Ice Dynamics” and “Floating Ice Dynamics”). 
The second term represents heat transport due to 
liquid water or vapor transport in the voids, and the 
third term is diffusion of heat in both solid and voids. 
In models of practical use today, vertical gradients in 
the enthalpy are far greater than those in the horizontal; 
therefore, the horizontal diffusion and horizontal 
liquid/vapor transport in the pore space are generally 
neglected. 

The enthalpy can be written to a good approximation 

q = q(T — T 0 ) + L 0 (1 — 0) (2) 

where q is the volumetric specific heat of ice or snow 
(or the ice and soil combination for permafrost), L 0 is 
the volumetric latent heat of fusion for ice or snow, and 
</> is the fraction of the volume that is not composed of 
ice. This includes air- and liquid-filled pore space in the 
volume, and in frozen ground, it also includes the soil 
or rock matrix. Modern models of snow and frozen soil 
may allow liquid water to infiltrate and possibly super¬ 
cool, and therefore, additional equations are needed to 
describe liquid infiltration and the conversion of liquid 
to ice and vice versa. Such equations may be devised 
such that the second term in Eq. 2 is not needed. The 
reader may refer to the Community Land Model, CLM 
version 4 for a description of one possible model of 
snow and frozen soil that is designed for a global 
climate model but is relatively complete [66]. 

The boundary conditions of Eq. 1 depend on the 
cryosphere component and possibly the climatic 
conditions. First consider the special case of floating 
ice where the basal temperature is always assumed to be 
at Tf. In these materials, the bottom boundary of Eq. 1 
has a temperature boundary condition, and the net flux 
into the surface excluding the conductive flux in the ice 
is equal to the heat flux from the water below: 
El bottom — Fw (fluxes are taken as positive toward the 
surface). In contrast, the base of land ice and the lowest 
point considered in a soil or snowpack model are 
generally not at the freezing point, in which case Eq. 1 
has a flux boundary condition at the base equal to the 


geothermal heat flux: F\ hottom = F G . At the top surface of 
ice or snow, one must test if the net flux into the top 
surface can balance the conductive flux (taking z as 
positive down): 


HT)L P 



top 


for a surface temperature below the freezing point 
(T < Tf). If so, then a flux boundary condition is used 
at that surface in Eq. 1. If the surface temperature is at 
or above the freezing point, then a temperature bound¬ 
ary condition is used at that surface in Eq. 1 with T = Tf. 


Ablation, Accretion, and Accumulation 

Models of land ice, floating ice, and snow usually 
employ a fixed number of layers and therefore layer 
thicknesses vary in time. This so-called moving bound¬ 
ary method requires a Stefan condition to describe 
ablation, accretion, and accumulation. The rate of 
change of the top surface position (z 0 , positive down) 
for snow or ice from ablation is 


dz 0 
^ dt 


ablation 


dT 

=F( T K- k ^ 


top 


if -F(T)| 

v n »p dz 


top 


( 3 ) 


An additional equation is needed to account for 
snow processes that contribute to increasing the mass 
of ice and/or snow at the top surface: 


= -S/a// + d, (4) 

accumulation 

where Sf a u is the rate of falling snow, and 3 is from snow 
densification, snow to ice conversion, snow redistribu¬ 
tion, etc. The rate of change of the bottom surface 
position (zip positive down) for ice from accretion 
(if liquid water is available) or ablation is 


dz 0 

dt 



-F(T) 


bottom 



bottom 


( 5 ) 


Floating ice may experience lateral accretion or 
ablation as well. Land and floating ice must also take 
into account the horizontal transport of ice, which 
is described in sections “Land Ice Dynamics” and 
“Floating Ice Dynamics”. 
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Surface Energy Balance, Radiation, Surface Albedo, 
and Melt Ponds 

The net flux entering the top surface of land and float¬ 
ing ice, soil, or snow is a sum of radiative and turbulent 
heat fluxes: 

F(T)\ top = Jv(l - a) - I 0 + F l - eaT 4 +F S + F e , 

( 6 ) 

where F r ( 1 — a) is the net downward solar irradiance 
above the top surface, a is the surface albedo, I 0 is the 
solar irradiance that penetrates the top surface, fi is the 
downward longwave irradiance, eaT 4 is the upward 
longwave irradiance (for T in Kelvin), e is the emissiv- 
ity, a is the Stefan-Boltzmann constant, F s and F e are 
the downward sensible and latent heat fluxes, 
respectively. 

In many ice and snow models, the surface albedo is 
a function of various quantities such as temperature, 
snow grain size, snow age, impurities, snow depth, ice 
thickness, and melt pond coverage. Often shortwave 
radiation is absorbed in the ice interior based on Beer’s 
law, although Beer’s law is inappropriate in materials 
with depth-dependent surface albedo parameteriza- 
tions. Usually the temperature dependence of the sur¬ 
face albedo is a proxy for modeling melt pond, grain 
size, and/or surface scattering characteristics. These 
relatively crude methods are being revamped consider¬ 
ably in models at this time. 

A better way is to design a highly interdependent set 
of physics for radiative transfer, ponding, and liquid 
infiltration. Ideally one would have radiative transfer 
account for multiple-scattering and be based on intrin¬ 
sic optical properties that vary with impurity concen¬ 
trations, snow grain size, ice bubbles, and brine 
pockets. Ponds would accumulate water above sea 
level when there is insufficient hydraulic connectivity 
to drain meltwater, and they would accumulate below 
sea level when there is hydraulic connectivity that is 
high enough to allow liquid water to rise up from below 
and flood the surface. 

Influence of Salts and Other Impurities 

Dissolved impurities such as salts in the pore water 
depress the freezing point. For ice to remain in local 
thermodynamic equilibrium, the pore water is always 


at the freezing point and freezing or melting must 
occur at the pore-ice interface to dilute or concentrate 
the solute. If we assume for simplicity that the voids are 
completely filled with liquid, then the void fraction is 
a function of temperature and the bulk solute concen¬ 
tration. The solute concentration in the voids, assum¬ 
ing freezing expels the solute completely into the voids, 
is a function of temperature according to the liquidus 
relation from the phase diagram of the binary material 
(ice plus solute). A few global climate models today 
have sea ice models that allow for such pore-ice inter¬ 
change (also known as brine-pocket physics in sea ice) 
[54, 55] . However, these models assume the bulk solute 
concentration is fixed in time, and hence, they neglect 
the heat and solute transport in the voids. These sim¬ 
plifications made it possible for the first step, but they 
cannot capture the important structural evolution of 
young sea ice or rapidly changing permafrost. These 
processes should not be neglected in models that aim to 
model biogeochemistry in the polar regions. 

Land Ice Dynamics 

Glaciers are perennial ice masses that are large enough 
to experience gravitational deformation: the flow of ice 
under its own weight. Glaciers and ice sheets nucleate 
where snow accumulation exceeds snow and ice abla¬ 
tion over a period of many years or decades. With time, 
the accumulated snow is buried and compressed, meta¬ 
morphosing into firn and then glacier ice. Ice behaves 
as a nonlinear, visco-plastic fluid; once the ice thickness 
is sufficient, internal gravitational stresses cause the ice 
to deform. 

Snow accumulation is primarily meteoric (derived 
from atmospheric precipitation), but snow can also 
accumulate at a site through wind deposition or 
avalanching. Ablation refers to the loss of snow and 
ice through melting, sublimation, wind erosion, or 
calving, a process where slabs of ice at the glacier 
margin mechanically fracture and detach from the 
main ice mass. Iceberg calving is a very effective abla¬ 
tion mechanism for glacier and ice sheets that are in 
contact with the ocean. Melting occurs at the glacier 
surface - the ice-atmosphere interface - but there is 
also melting internally (englacially), at the glacier bed 
(subglacially), and on vertical ice cliffs that are com¬ 
mon at the ice margin, particularly where glaciers reach 
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the sea and a large area of ice can be in contact with 
water. Melting only leads to ablation in the case where 
meltwater runs off and is removed from the system; 
some surface meltwater in the upper regions of glaciers 
and ice sheets percolates into the snowpack or ponds at 
the surface, where it can refreeze. 

A glacier’s mass balance is determined by the net 
accumulation minus ablation over a specific time 
period, typically 1 year. This can be expressed as 
a total balance (kg year -1 or m 3 year -1 water- 
equivalent for the entire glacier) or a specific balance, 
per unit area of the glacier (kgm -2 year -1 or m year -1 
water-equivalent). There is no simple “threshold 
temperature” for a glacier to be viable; a mean annual 
temperature below 0°C is not a necessary or sufficient 
condition for glacier ice to exist. Tidewater glaciers are 
vivid examples of this. Because ice flow delivers large 
fluxes of ice to low elevations, glaciers can extend to sea 
level environments where mean annual temperatures 
are several degrees above 0°C. While most glaciers do 
not reach the ocean, this feature is intrinsic to all 
glaciers; glacier ice in the ablation area does not grow 
in situ, but is a consequence of ice transport from the 
accumulation area to the ablation area. 

Glacier flow occurs through three different mecha¬ 
nisms: internal “creep” deformation, decoupled sliding 
at the ice-bed interface, and deformation of subglacial 
sediments (Fig. 1). The former is a function of ice 


z = s 
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Schematic of glacier flow mechanisms. The surface velocity 
u s = u b + u d (s). Basal velocity is the sum of deformation of 
underlying sediments and decoupled sliding at the ice-bed 
interface. Where ice is moving at the bed, these two 
processes can operate together, or only one of them may 
be active 


rheology and the stress regime in the ice, while the 
latter two mechanisms are governed by conditions at 
the base of the glacier [67]. These processes are 
described in more detail below. 


Governing Equations for Glacier Dynamics 

Similar to models of atmosphere or ocean dynamics, 
the flow of glaciers and ice sheets is mathematically 
described from the equations for the conservation of 
mass, momentum, and energy. For a point on the 
glacier with ice thickness H , the vertically integrated 
form of the conservation of mass is most commonly 
employed in glaciological models: 

— = -V-(u H) + b. (7) 

Here u is the average horizontal velocity in the 
vertically integrated ice column and b is the mass bal¬ 
ance rate. The mass balance is computed generally from 
Eqs. 3-5 by 


dz 0 

dz 0 

dzb 

dt 

ablation 

+ ~7~ 

accumulation 


(8) 


The first term on the right-hand-side describes 
the horizontal divergence of ice flux, while the second 
term describes the net local source or sink of 
mass associated with accumulation and ablation. The 
vertically averaged velocity includes ice flow due to 
both internal deformation and basal flow: 
u = uj + Ub- Glacial ice moves slowly, so a year is 
typically adopted as the most convenient unit of time; 
hence, ice velocities are reported in m year -1 and b is 
expressed as m year -1 of ice-equivalent gain or loss of 
mass. 

The main challenge in modeling glaciers and ice 
sheet is evaluation of the velocity field. Acceleration 
and inertial terms are negligible in glacier flow, so the 
Navier-Stokes equations that describe conservation of 
momentum reduce to a case of Stokes flow, where 
gravitational stress is balanced by internal deformation 
in the ice: 


v • a = -pg, (9) 

where o is the ice stress tensor, p is ice density, and g is 
the gravitational acceleration. Expanding this into the 
three directional components, 
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( 10 ) 


Ice deformation is independent of confining (i.e., 
hydrostatic) pressure, so ice rheology is usually couched 
as a function of the deviatoric stress tensor, a'. Under the 
assumption that ice is incompressible, the momentum 
equations can be analyzed to give the horizontal balances 


2 d<4 | | da 'xy | da'„ 

dx dx dy dz 

ay 

dx dy dy dz 


dh 

- pg d~x’ 


~Pg 


dh 
dy ' 


( 11 ) 


The terms on the right-hand-side represent the 
surface slope, for the glacier surface h(x, y). Greve and 
Blatter [129] present a detailed derivation of this full 
system of equations and their solution. A constitutive 
relation is needed to express internal stresses in terms 
of strain rates in the ice: ky = 0 .5(dui/dxj + duj/dxi). 
Equation 11 can then be rewritten as a function of the 
3D ice velocity field, providing a framework to solve for 
u and integrate Eq. 7 to model the evolution of glacier 
geometry in response to variations in ice dynamics or 
climate. The next section describes the constitutive rela¬ 
tion that is most commonly used in glacier modeling. 

Because ice rheology is strongly temperature- 
dependent, an additional equation is needed to solve 
for the 3D temperature distribution. As discussed in 
section “Thermodynamics of the Cryosphere,” the local 
energy balance gives the governing equation for tem¬ 
perature evolution in the ice sheet in Eq. 1. 

Given a 3D temperature distribution through the 
ice sheet, the effective rheology of the ice can be 
evaluated and the velocity field can be numerically 
determined. Knowledge of the temperature field is 
also essential to assessing whether the base of the ice 
sheet is at the pressure melting point or not; if so, liquid 
water can be present at the bed and the glacier or ice 
sheet is subject to basal flow. 


Internal Deformation: Ice Rheology In order to 
solve the Stokes flow diagnostic equation, ice sheet 


stresses need to be expressed as velocity fields, via 
a constitutive relation for ice. The rheology of poly¬ 
crystalline glacier ice is well-studied in laboratory and 
field environments [68]. Lab studies of tertiary ice 
deformation reveal that ice deforms as a nonlinear 
viscous fluid [9, 10, 69, 70]. The original form of the 
flow law proposed by Glen [9, 10] is broadly supported 
by field studies of tunnel and borehole deformation 
[7, 71], as well as observations and modeling of large- 
scale ice motion [72, 73]. 

This constitutive relation is known as Glen s flow 
law and is written as a function of the second invariant 
of the deviatoric stress tensor, = (a'- a^ /2, 

k = < 12 > 

B(T) is an “ice softness” term that follows an Arrhe¬ 
nius temperature dependence, 

B(T) = B 0 exp(^2). (13) 

B 0 is called the Glen flow law parameter, R is 
a constant, and Q is the creep activation energy. Lab 
and field studies of ice deformation suggest that B 0 can 
vary by a factor of about 10 [68]. Including the effects 
of strain-softening and temperature, the effective vis¬ 
cosity of ice varies by several orders of magnitude. 

This formulation is an isotropic flow law that allows 
the first-order effects of ice temperature and deviatoric 
stress regime to be incorporated in estimates of ice 
deformation. Ice deformation is typically modeled as 
an n = 3 process. Where shear stress and shear defor¬ 
mation are dominant, as is often the case, this is well- 
approximated by 

= B(T) <£. (14) 

Glens flow law is for pure, isotropic ice. There are 
numerous other complicating factors for ice deforma¬ 
tion, such as anisotropic ice fabric [74-76], the poten¬ 
tial impact of grain size [77, 78], and the presence of 
impurities and intergranular liquid water content [79]. 
These effects are not explicitly resolved in ice sheet 
models, but the flow rate parameter, b 0 , is typically 
tuned to approximate the bulk effects of crystal fabric, 
grain size, and impurity content. 

The strain rates in Eq. 12 or 14 can be expressed as 
velocity gradients, and then vertically integrated or 
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inverted and substituted into the momentum balance, 
Eq. 11, to give a set of equations for the horizontal ice 
velocity. Various numerical solutions to these equa¬ 
tions have been adopted in glacier and ice sheet model¬ 
ing, outlined in more detail in the next section. 

Basal Flow In addition to internal deformation, 
glacier ice can flow at the base where the bed is at the 
pressure melting point, through some combination of 
subglacial sediment deformation and decoupled slid¬ 
ing over the bed. Large-scale basal flow generally 
requires pressurized subglacial meltwater, which can 
lubricate the bed, float the ice, or weaken subglacial 
sediments. Subglacial hydrology therefore plays 
a pivotal role in fast-flowing glaciers and ice streams 
[80, 81]. High subglacial water pressures can decouple 
the ice from the bed by reducing or eliminating basal 
friction. On local scales this may not entice a significant 
ice-dynamical response, as resistive stresses 
can be taken up at adjacent well-coupled regions of 
the bed, by side drag from valley walls or adjacent ice, 
or by longitudinal stress bridging (upstream and 
downstream resistance to flow). However, numerous 
observational studies report occasions where inputs of 
surface meltwater to the bed overwhelm these 
resistive stresses and produce localized speedups 
in valley glaciers, e.g., [82, 83] and in polar icefields 
[84-86]. 

For large-scale ice stream flow or surging of outlet 
glaciers, subglacial water must occupy a significant 
portion of the glacier bed, at pressures which are suffi¬ 
cient to drown geologic and topographic pinning 
points [83, 87]. In this situation, widespread ice-bed 
decoupling can permit high rates of basal flow (100- 
1,000 s of m year -1 ) and a regime in which ice fluxes are 
dominated by basal flow. 

In ice sheets that exhibit high rates of basal flow, 
there is ongoing uncertainty as to the relative impor¬ 
tance of sliding flow along the ice-bed interface versus 
deformation of the underlying glacial sediment. High 
subglacial water pressures are conducive to both pro¬ 
cesses. Fast flow in West Antarctica’s Siple Coast ice 
streams is associated with plastic failure of a thin layer 
of saturated marine sediment [88, 89], and similar 
processes are expected to be important wherever sub¬ 
glacial sediments and topographic features offer 
a relatively smooth, low-friction substrate [90]. 


Models make some allowance for basal flow, usually 
through a local sliding “law” relating basal flow, u h , to 
gravitational shear stress at the bed, raised to some 
power m, Ub oc t™. In some applications, the effects of 
subglacial water pressure on basal flow are introduced, 
typically through the effective pressure p e . This is the 
difference between glaciostatic (ice) pressure and sub- 
glacial water pressure: p e = pi — p w . An effective pres¬ 
sure of 0 indicates that ice is floating, so low or negative 
effective pressures promote ice-bed decoupling and 
enhanced basal flow. While it is safe to assume that 
Ub oc p~ k , for some unknown power /c, there is likely no 
generalized local relationship between u b and p e ; actual 
basal flow is affected by regional-scale ice dynamics, 
not just local conditions. A prescription of the form 
Ub = At™ /p\ is unstable as this blows up as p e —* 0. 
Local flotation is commonly observed in nature, so 
p e = 0 is a physically acceptable possibility. The math¬ 
ematical instability is simply a failure of the local form 
of the basal flow law. An alternative is to introduce 
a parameterization in terms of the flotation fraction 
p w /ph with u b = 0 when p w = 0 and basal flow increas¬ 
ing with p w /pj. The local expression Ub = Az™f(p w /pi) 
can represent this, e.g., [91] . Basal flow observations are 
notoriously difficult to make so there is no clear rec¬ 
ommendation as to the functional form of f(p w /pi)- 
Hydrological enabling of basal flow is expected to be 
a nonlinear, threshold process [92]. 


Modeling Glaciers and Ice Sheets 


This section describes the physical approximations and 
numerical techniques in place for simulations of glacier 
and ice sheet dynamics, based on solution to Eqs. 7, 11, 
and 1. 

One common reduction in ice sheet modeling, 
known as the shallow-ice approximation, involves 
a number of scaling assumptions that are valid when 
horizontal gradients in ice thickness and velocity are 
small and ice flows dominantly by vertical shear defor¬ 
mation. Under these conditions, Eq. 11 simplifies to 


d °'iz 

dz 


= -Pg 


dh 

dxi 


(15) 


where subscript i refers to the horizontal direction of 
interest (x, y). Equation 15 can be vertically integrated 
to give 
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°iz( Z ) = -Pg( h - Z )fa i - ( 16 ) 

This vertical shear stress is commonly known as the 
gravitational driving stress, T d . It vanishes at the glacier 
surface ( z-h ) and T d = —pgH\7h at the glacier bed. 
Substituting for the strain rates based on Glens flow 
law and representing strain rates in terms of velocity 
gradients gives 


Ui(z) = u hi -2(pg) n \Vh\ n B(T)(h-z) n dz, 

(17) 


for basal velocity Ub- 

For the shallow-ice approximation represented in 
Eq. 17, the vertically averaged ice velocity due to defor¬ 
mation is nonlinearly proportional to local surface 
slope and ice thickness, 

u d oc (Vh) n H n+1 . (18) 

This treatment of ice dynamics has been adopted in 
most ice sheet modeling studies to date, e.g., [19, 20, 93] . 
The nonlinearity of Glen’s flow law gives ice deforma¬ 
tion rates that are exceptionally sensitive to the glacier 
thickness and surface slope. Under this approximation, 
ice velocity is solely a function of local ice geometry. 
This neglects “farfield” effects and other complicating 
influences on ice flow, such as longitudinal stretching/ 
compression of the ice and horizontal shearing due to 
the friction of valley walls. The influence of longitudi¬ 
nal stress coupling on ice dynamics is significant in 
complex terrain such as valley glaciers and mesoscale 
icefields [34, 94]. It is also important in settings such as 
tidewater glaciers, ice shelves, ice streams, transition 
regions from inland to floating ice dynamics, and ice 
sheet divides [14, 28]. Many of the most interesting 
questions in ice sheet behavior involve these parts of 
the system, so more complete representations of ice 
dynamics are of great interest. 

Doug MacAyeal [16, 130] introduced an approxi¬ 
mation for ice shelf and ice stream flow that is 
essentially the complement of the shallow-ice approx¬ 
imation in Eq. 15. Because flow in ice streams is pre¬ 
dominantly at the base, a plug-flow approximation, 
assuming no vertical shear, is reasonable. The gravita¬ 
tional driving stress is then taken up by longitudinal 


stress, horizontal shear stress, and basal friction. In ice 
shelves, the basal traction also vanishes. Under this 
approximation, Eq. 11 can be written 
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(19) 


Substituting for strain rates and defining an effec¬ 
tive viscosity p e from the inverted form of Glen’s flow 
law [16], 
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A vertically integrated form of Eq. 20 is readily 
derived under the assumption that there is no vertical 
variation in strain rates. These equations can be numer¬ 
ically solved to give the horizontal velocity fields, 
subject to prescription of a basal shear stress as 
a boundary condition at the bed. 

This set of equations provides a good representa¬ 
tion of ice dynamics where vertical shear deformation 
is negligible. MacAyeal and colleagues have had good 
success in simulating Antarctic ice stream and ice shelf 
dynamics with this method. This approach to model¬ 
ing ice dynamics has also been applied to former ice 
streams in the Laurentide Ice Sheet [95] and in studies 
of the inland propagation of ice-marginal thinning in 
the Amundsen Sea sector of West Antarctica [31]. 
Using control theory (inverse methods), these equa¬ 
tions also provide an opportunity to construct the basal 
friction from known surface velocity fields [96, 97]. 

The assumptions that underlie both shallow-ice 
models and the “plug-flow” equations are limiting in 
mixed flow regimes where vertical and horizontal shear 
stresses and longitudinal stresses are all important. This 
is the case where ice flow goes through a transition 
from grounded to floating conditions (the grounding 
line), where there are large spatial gradients in basal 
flow, at ice divides, and in valley glaciers, where the 
shallow-ice approximation is not valid because the ice 
thickness is of similar magnitude to horizontal varia¬ 
tions in ice thickness and velocity. To address this and 
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provide a modeling framework that is generic and self- 
consistent for all environments, recent efforts have 
explored solutions to the full Stokes system, Eq. 10, as 
well as intermediate stages of complexity between the 
shallow-ice approximation and the full Stokes solution. 
The development of theoretical and numerical solu¬ 
tions of full stress-field solutions is promising, and has 
become tractable on regional scales [28, 29, 32]. Full 
solutions are still computationally unwieldy on conti¬ 
nental scales, and have yet to be applied to whole ice 
sheet simulations in Greenland or Antarctica. 

Progress has been slower in numerical simulations 
of the subglacial geological and hydrological processes 
that give rise to basal flow. This remains absent or 
oversimplified in models, where basal sliding is often 
specified as a function of gravitational driving stress, 
subject to thermal regulation. Warm-based ice can 
slide, while ice frozen to the bed is subject to a no-slip 
boundary condition. Some recent efforts include 
explicit models of subglacial water flow and storage, 
permitting sliding-law formulations that model this 
control on basal flow [98-100], but the physical under¬ 
standing and the numerical representation of these 
processes remain limited. 


response of the ice sheets to recent climate change. 
Regions of the ice sheets are thickening and thinning 
as a result of the ice sheet and climate history, largely 
associated with the long timescales of thermal advec- 
tion and diffusion. The ice sheets are also adjusting to 
the geometric changes that attended the last deglacia¬ 
tion, e.g., [101]. These “secular” effects are further 
compounded by the slow (10 3 -10 4 year) timescale of 
isostatic adjustment to the last glacial maximum and 
the subsequent deglaciation, which also needs to be 
simulated for the ice sheet system. 

Finite difference and finite element approaches 
have both been applied to glacier and ice sheet models. 
Finite elements are more versatile and applicable to the 
complex geometry of mountain glaciers and fixed, lim¬ 
ited spatial domains such as ice shelves, and these have 
advantages for solution of the diagnostic equations for 
ice velocity. Time-adaptive, moving grids are needed to 
simulate the evolution of glacier geometry, however, 
and this combines with the simplicity of finite- 
difference methods to make these more popular for 
simulation of large-scale ice sheet dynamics. 


c 


Priorities and Challenges 


Computational Considerations 

For whole ice sheet simulations, the horizontal grid 
spacing is typically 20-50 km, with some 30 layers in 
the vertical. Recent simulations have adopted grid sizes 
of 5-10 km. Such resolutions are tractable but the 
computational demand increases by an order of mag¬ 
nitude for each factor of two reduction in the resolu¬ 
tion, due to a fourfold increase in grid cells and the 
necessary reduction in the solution time step. 

Part of the computational challenge for full-stress 
solutions arises from the need for long spin-up simu¬ 
lations for the polar ice sheets. Both Greenland and 
Antarctica contain ice that is more than 200 k year old, 
and it is essential to model the temperature and thick¬ 
ness evolution of the ice sheets through the last one or 
two glacial cycles in order to provide a reasonable 
internal temperature field for present-day studies or 
future projections of the ice sheets. Present-day dynam¬ 
ical adjustments are ongoing from the long-term evo¬ 
lution of the ice sheets, and this provides a background 
signal that must be understood in order to evaluate the 


Fundamental glaciological data are still sparse in large 
sectors of the polar ice sheets and mountain glaciers, 
including knowledge of ice thickness, thermal regime, 
and subglacial conditions. It is difficult to apply hydro- 
logical and basal flow models to much of Greenland 
and Antarctica, where groundwater drainage, sediment 
properties, and other details of the subglacial environ¬ 
ment are poorly known. Better understanding of the 
essential subglacial processes physics is also needed 
[67], as well as methods to parameterize these 
subgrid-scale processes in large-scale models. Similar 
physical and numerical challenges are involved in sim¬ 
ulation of iceberg calving; there is no established 
“calving law,” as the underlying physics and environ¬ 
mental controls are not fully understood. 

There is a recent emphasis on glacial systems 
modeling, including subglacial processes and hydro- 
logical evolution, e.g., [92], but these efforts are still 
in early stages. With these limitations in mind, there is 
potential for major progress on at least two fronts based 
on what is currently well-understood about glaciers 
and ice sheets: (1) through high-resolution, full Stokes 
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solutions, which are on the horizon, and (2) through 
improved coupling of ice sheet and climate models. 
Improved coupling with climate models is necessary 
and many research groups have initiated this effort in 
recent years, for both individual glaciers [102] and 
continental ice sheets [25, 103-105]. 

Improved representation of stress-strain regimes at 
ice sheet margins, within floating ice and grounding 
zones, and in drainage basins that feed fast-moving 
outlets will provide a better representation of 
interannual and decadal variability of the ice sheets. 
This will also improve model capability in simulating 
glacier and ice sheet sensitivity to climate change. 
Resolutions (and input bedrock and climate-forcing 
datasets) of order 1 km are needed for this advance in 
the polar ice sheets, at least in the areas of complex 
flow and steep gradients at the ice sheet margin. For 
valley glaciers, input fields and ice dynamics need to 
be simulated at resolutions of order 100 m, and closer 
to 10 m if one wishes to simulate interannual glacier 
terminus response to climate change. It is also 
important to recognize that even with full Stokes solu¬ 
tions, ice streams, surging behavior, summer speedups, 
ice shelf instabilities, and areas of fast flow within the 
ice sheets will not spontaneously arise in the correct 
places; these are associated with processes and forcings 
that are absent in most models. In particular, there is 
limited two-way interaction between oceans and ice 
sheets in modeling studies to date, despite recent evi¬ 
dence that ice sheets are highly responsive to ocean 
warming [106]. 

Floating Ice Dynamics 

State-of-the-art climate models today treat the jumble 
of floating ice floes as a continuum. At present, in 
global models, the only floating ice with dynamics is 
sea ice, and lake and river ice models are generally 
thermodynamics only. There are specialized models of 
lakes and river ice with dynamics, and they are based on 
the same principles as in sea ice models, so we discuss 
their dynamics together here. 

The physics of floating ice is strongly dependent on 
the thickness. Therefore, floating ice is generally 
described in terms of a distribution of ice thicknesses 
at the subgrid-scale. The ice motion is also considered 
for a continuum, rather than for individual floes. With 


this brief overview, a global-scale floating ice model can 
be developed from the description of thermodynamics 
given in section “Thermodynamics of the Cryosphere” 
and the additional governing equations for the dynam¬ 
ics and ice thickness distribution given in this section. 

Ice-Thickness Distribution 

The formulation of a floating ice model begins with the 
ice thickness distribution equation. The ITD is 
a probability density function (pdf), usually written 
g(h ), that describes the probability that the ice cover 
in a particular region has thickness h. A cruder alter¬ 
native is to model the mean thickness of the pdf and the 
total ice concentration. 

In a floating ice model, the ITD describes the pdf of 
a grid cell and thus it is sometimes called a subgrid- 
scale parameterization. A parameterization typically 
represents processes that are too small-scale or complex 
to be represented explicitly. For example, deformation 
is parameterized with a set of rules in a continuum 
model because there is no known differential equation 
that describes deformation in this type of model. The 
rules must select the portion of the ITD that will 
deform and then redistribute it within the ITD. In 
contrast, ice growth and melt alter the ITD in a way 
that is computed from first principles. Hence the ITD 
actually includes both parameterized and explicit 
physics. 

The ITD equation is 

s=-* v < 2i > 

where the left-hand-side is the Lagrangian derivative of 
g following an ice “parcel” and terms on the right- 
hand-side are the rate of change of g from parcel con¬ 
vergence, mechanical redistribution (T/ see Fig. 2), 
advection of g in ice thickness space from growth/melt 
(see Fig. 3), and the reduction rate ofgfrom lateral melt 
(£). Here, u is the horizontal ice velocity (vertical ice 
velocity is ignored) and /is the net growth rate. We use 
the variable / as is traditional in the ITD literature, 
though it is essentially equivalent to the mass balance 
b used to describe glaciers in Eqs. 7 and 8. The ITD 
equation was introduced by [44]. Models that specify 
the ITD, e.g., [ 107] , or only permit a single ice thickness 
in the ice-covered fraction of a model grid cell, 
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Cryosphere, Modeling of. Figure 2 
Illustration of deformation. The portion of the distribution 
labeled "lost to deformation" is also known as the ice that 
participates in redistribution. It subsequently is 
redistributed to thicker parts of g[h), where it is labeled 
"gain by deformation" 
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Illustration of advection in thickness space 


e.g., [108], would have an equation for the gridcell 
mean ice thickness instead. 

There are two parts to deformation: a rate of open¬ 
ing (creating open water) and closing (closing open 
water and/or deforming and redistributing the ice), 
which depend on u and g(h). The opening and closing 
rates depend on the convergence and/or shear in the ice 
motion field. It may not be obvious that shear would 
cause deformation. Imagine that the ice pack is com¬ 
posed of pieces with jagged edges. When shearing, the 
jagged edges can catch on one another and cause defor¬ 
mation, which converts kinetic energy into potential 
energy from piling up ice, or shearing can cause fric¬ 
tional loss of energy and no deformation. Thus the 


closing rates also depend on assumptions made about 
frictional losses, see e.g., [51, 54]. 

A mathematical representation of lead opening and 
mechanical redistribution is most conveniently written 
in terms of the divergence and shear of u, Gj and Gjj, 
respectively. The parameterization follows from [44]: 

ii= |e| [a o (0)8(h) + cc r (0)w r (h, g)], (22) 

where |g| = (ej + e 2 n ) 1/2 , 9 = tan _1 (Gjj/Gj), 8(h) 
(the delta function) is the opening mode and w r (h,g) 
is the ridging mode. The coefficients |G|a o (0) 
and |G|a r (0) are known as the lead opening and clos¬ 
ing rates, respectively, and they are related such that 
their difference equals the divergence, G|ao(0) — 
|G|a r (0) = Gj. 

For the redistribution process, some portion of g(h) 
is identified as potentially able to “participate” in redis¬ 
tribution (see Fig. 2). This is usually the thinnest 15% 
of g(h). If the open water fraction exceeds 15%, then no 
redistribution takes place, and instead the open water 
closes under convergence and nothing happens under 
shear. The so-called participation function is weighted 
according to its thickness, so that the thinnest ice is most 
likely to deform. Then a rule is needed to redistribute the 
ice that ridges. Originally, Thorndike et al. [44] pro¬ 
posed that ridged ice would end-up five times thicker 
than its starting thickness. Other more complex redis¬ 
tribution schemes have been used since then [43, 109]. 
The interested reader can refer to these references, for 
examples, of parameterizations for w r (h,g). 

The other primary mechanism that changes g(h) is 
ice growth or melt, which cause g(h) to shift in thick¬ 
ness space. This process is illustrated in Fig. 3. The 
growth/melt rate depends on thickness, so g(h) 
becomes distorted in the process. 

An explicit conservation equation for ice volume 
(or mass) is not given in this section because conser¬ 
vation of volume is contained in the equation for g( h ) 
(or at least it depends on the way g(h) is discretized). 

Momentum Equation 

The second governing equation is conservation of 
momentum: 

Du 

m— = — mfk x u + t„ + t w - mg r VY + V • a, 

( 23 ) 
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where the left-hand-side is the Lagrangian derivative of 
u following an ice parcel, the right-hand-side begins 
with a term representing Coriolis force, air and water 
stresses, the force due to ocean surface tilt, and the ice 
internal force. In this equation m is mass per unit area, 
f is the Coriolis parameter, g r is gravity, Y is the sea 
surface height, and a is the ice stress. 

The force balance at two locations, one just north of 
Greenland and the other in the Weddell Gyre, is given 
in Fig. 4. The internal stress constitutes a very large 
internal force in the sea ice near Greenland, where the 
sea ice is converging against the coast. In contrast, at 
the Weddell location, the ice is nearly in free drift and 
the main force balance is between air and water drag. In 
both cases, the ice motion is perpendicular to the 
Coriolis Force, to the left in the upper panel and to 
the right in the lower. 

Ice Rheology and the Constitutive Law 

A constitutive law characterizes the relationship 
between the ice stress and strain rate and defines the 
nature of the ice dynamics. A simplistic picture of 



Cryosphere, Modeling of. Figure 4 

Illustration of force balance from CCSM4 in October at 85N 
65W {upper) and 64S 164W {lower). The residual force is the 
force needed to balance the other forces that are shown, 
and it is equal to the acceleration minus the force due to 
ocean tilt, the latter normally is very small. The forces from 
internal stress and the residual are negligible at the 
location of the lower panel 


a converging ice pack with uniform thickness under 
an imposed compressive wind force is given in Fig. 5. 
Floating ice generally repels the compressive force 
somewhat even if it is deforming. The resulting internal 
force is associated with a nonzero stress state. In Fig. 5 
the ice pack is converging such that its length L on one 
side decreases by 3L in some time 3t, so the ice experi¬ 
ences a strain e = 3L/L and a strain rate e = 3L/L3t. 
A modeler chooses the constitutive law to relate o and 
e, which are actually two-dimensional tensors, not sca¬ 
lars as shown in Fig. 3 for illustrative purposes only. In 
two dimensions, the divergence and shear of the ice 
velocity can be conveniently used to describe the strain 
rate tensor as they are invariant across grid 
transformations. 

The momentum equation depends on the ice inter¬ 
nal force, which in turn depends on the ice stress tensor. 
The most common rheology used today is the viscous- 
plastic rheology from [108] (or a close derivative 
thereof) where the ice behavior is plastic at normal 
strain rates and viscous at very small strain rates. 

The constitutive law for the viscous-plastic model 
of floating ice can be cast in terms of the invariants of 
stress (cjj and a n ) and strain rate (ej and e n ) with the 
pair of equations 

= C4 ' “ P/2 (24) 

On = T]e II: 

where P is the ice strength and £ and 77 are bulk and 
shear viscosities, respectively. The relationship between 
the viscosities and strain rate invariants is chosen so 
that the stress state lies on an elliptical yield curve, 
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Ice slab side view 
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After applying a converging wind stress for time 8\ 
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Illustration of ice slab that deforms under compressive 
force 
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(25) 


where e is the ratio of the principal axes of the ellipse 
(the original Hibler model [108] used e - 2, which is 
still common today). Thus requiring 
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(26) 


for plastic behavior. To avoid infinite viscosities as 
A —> 0, £ and p are assigned large constant values. In 
this case, the stress state lies inside the elliptical yield 
curve and the ice behaves like a viscous fluid, exhibiting 
creep. 

The ice strength must be parameterized somehow. 
For a model with ridging, P can be based on the ener¬ 
getics argument proposed by Rothrock [110] for plastic 
deformation, where the compressive strength is 
equated to the potential energy increase per unit strain 
in pure convergence. Following [43], the potential 
energy is multiplied by a constant to account for the 
dissipation of kinetic energy by frictional energy loss 
during ridging, 

POO 

P = ZC p / h 2 w r (h)dh (27) 

Jo 

where C p = 0.5(p x /p w )g (p w — pj) and Z is the ratio of 
total energy dissipated to potential energy gain (p { and 
p w are the densities of ice and water and g is the 
acceleration due to gravity). [Ill] and [112] recom¬ 
mend Z- 17. 


Computational Considerations 

Computation time given to the sea ice component of 
a typical global climate models is roughly 10-20% for 
a ~100 km resolution ice grid, although it could easily 
be more or less depending on the relative resolution, 
sophistication, and optimization among components. 
Lake and river ice in global climate models take a trivial 
portion of the total computation time because their 
area is so small compared to the rest of the globe. 

A key consideration for climate modelers is to 
design a model that can run the scenarios of interest 


in a time to meet publication deadlines. Choices must 
be made to balance model resolution, physics, param- 
eterizations, and numerics. High-performance com¬ 
puters today have tens of thousands of cores and 
many have been constructed for the purpose of running 
climate models. Codes must parallelize well to take 
advantage of these machines, and sea ice codes gener¬ 
ally do because there are relatively many processes that 
operate at the grid-scale (e.g., vertical heat equation) 
and subgrid-scale (e.g., deformation). The Los Alamos 
Sea Ice model CICE has already been successfully scaled 
beyond 10,000 cores on Cray XT equipment [113]. 

Future Directions 

Most of the progress to date in modeling glaciers, ice 
sheets, permafrost, and snow science has stemmed 
from independent efforts by individual researchers. 
Advances have paralleled the much larger, coordinated 
research programs dedicated to the development of 
climate models, with little crossover or integration. 
This contrasts with modeling developments in sea ice 
studies, which are now fully integrated in most ocean- 
atmosphere general circulation models. 

To date there have been few efforts to couple climate 
and ice sheet models. Most ice sheet simulations use 
“offline” forcing where climate model-derived fields 
are used to drive simulations of ice sheet evolution, 
through one-way forcing. The long timescale of ice 
sheet evolution means that this may be a reasonable 
approach, but evolution of ice sheet albedo and topog¬ 
raphy feeds back on atmospheric conditions on time- 
scales of decades to millennia. Further, there are direct 
seasonal-timescale links between climate and the 
dynamics of glaciers and ice sheets, including the effects 
of surface meltwater and a possible link between coastal 
ocean temperature and sea ice conditions and outlet- 
glacier dynamics. These processes all require improved 
coupling between climate and ice sheet models. This is 
particularly true at the ice-ocean interface, where mass 
and energy exchanges are not physically modeled in 
current simulations. 

In general, the mass balance fields (accumulation 
minus melt) simulated by climate models are not accu¬ 
rate enough for fully coupled ice sheet-climate model¬ 
ing, nor can sophisticated atmospheric models be 
integrated over the millennial time scales of interest 
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for ice sheet evolution. However, the development of 
improved regional-scale meteorological and glacier 
mass balance models, e.g., [114-116], permit direct 
estimates of surface mass balance from meteorological 
models, and offer a good physically based method to 
simulate how these fields can be expected to change 
with ice sheet geometry. 

The challenge is greater for the mass balance fields 
and surface climatological forcing of mountain glaciers 
and permafrost. Mountain glaciers reside in complex 
terrain where temperature and precipitation gradients 
are steep. The topography and relevant meteorological 
processes are not faithfully resolved in even regional 
climate models, so some form of climatic downscaling 
is needed to prescribe mass balance fields for glacier 
modeling [102]. These methods generally do not 
conserve energy or mass, and improved treatments are 
needed. There are different considerations for perma¬ 
frost modeling, but they also relate to the resolution of 
the landscape and surface climate. The mean annual 
surface temperature that governs permafrost aggrada¬ 
tion and degradation depends on local-scale vegetation, 
snowpack depth, hydrology, and soil properties, which 
commonly vary over a spatial scale of meters. Hence, 
global-scale permafrost and snow models should be 
interpreted “statistically” for a region (i.e., via 
a distribution of permafrost thickness and snow depth 
in a given climate model grid cell, based on the range of 
ground cover and snow conditions in the region). 

In contrast to the terrestrial components of the 
cryosphere, the needs of global climate models often 
drive sea ice model developments. For example, models 
that use eddy-resolving ocean components are 
approaching the large floe-scale, where sea ice 
(assumed to be on the same grid) can no longer be 
considered a continuum. Rheologies that attempt to 
treat ice as a granular material hold promise as a means 
to extend the utility of continuum models to smaller 
length scales. Such rheologies often use Mohr- 
Coulomb or decohesive type yield curves, which depart 
from the standard elliptical yield curve particularly 
where the stress states on the ellipse resist breaking 
under tension. The floe models to date do well at 
modeling deformation but the process of joining floes 
at freeze-up has not been solved [111, 117]. 

Recent observations of sea ice reveal very long, 
narrow openings, or leads, in the ice that suggest 


oriented weaknesses occur. These regions have been 
dubbed linear kinematic features [118]. Such features 
occur in sea ice models with isotropic rheologies, such 
as the viscous-plastic type, with some level of realism at 
fine resolution [119]. However, models that account 
for such anisotropy explicitly by keeping track of lead 
orientation and computing resistance to opening 
depending on the orientation with respect to leads, 
e.g., [120, 121], may be able to match observations 
better [122]. 

The drive to model the carbon cycle and hence 
ecosystem dynamics in global climate models has 
spawned an effort to model sea ice algae and nutrient 
cycling. When seawater freezes, algae stick to the ice 
particles, and as the ice particles combine they trap 
brine, nutrients, and organisms in brine pockets. Bio¬ 
mass concentrations can be hundreds of times greater 
in sea ice than in seawater, and carbon and key nutri¬ 
ents in sea ice are a substantial fraction of the total in 
ice-covered regions. When sea ice melts it deposits 
these materials into the ocean precisely when solar 
input and meltwater runoff is highest, creating prime 
conditions for an ocean bloom. Models of these pro¬ 
cesses have been developed offline [123-127] and will 
soon be implemented in sea ice models of global cli¬ 
mate models. Models of ecosystem dynamics in sea ice 
also require a treatment of seawater infiltration, brine 
drainage, and meltwater flushing, which necessarily 
involves modeling sea ice salinity [127]. 

Modeling the cryosphere today is concerned with 
far more than describing Earth’s surface albedo. Global 
climate models need to be coupled to components 
of the cryosphere with adequate sophistication to 
investigate modern scientific problems involving sea 
level rise, Arctic sea ice retreat, permafrost thawing, 
and more. There is also evidence that models with 
better physics are among the models that agree best 
with observations [128]. 
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